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Preface 



The International Conference on Theoretical Physics, TH-2002, took place in Paris 
from July 22 to 27 in the Conference Center of the UNESCO, the United Nations 
Educational Scientific and Cultural Organization, under aegis of the lUPAP, the 
International Union of Pure and Applied Physics and of the French and Euro- 
pean Physical Societies, with a large support of several French, European and 
international Institutions. 

International and crossdisciplinary, TH-2002 welcomed around 1200 partic- 
ipants representing all domains of modern theoretical physics. The conference 
offered a high-level scientific program, including 18 plenary lectures, 45 general 
lectures in thematic sessions and 140 more specialized lectures, partly invited and 
partly selected among proposals received from participants. Around 500 contribu- 
tions were also presented as posters. 

Plenary lectures as well as general thematic lectures were addressed to a 
general audience of theoricians, not only to specialists. 

According to our commitments towards UNESCO and other sponsoring insti- 
tutions, TH-2002 attributed more than 200 fellowships, mostly to scientists from 
developing countries and Eastern Europe, covering registration fees and, for more 
than half of them, stay expenses with student type accomodation. 

Special highlights of the conference included 

• the opening ceremony on July 22, with the participation of Mrs Claudie 
Haigneré, French Minister of Research, and M. Walter Erdelen, General Ad- 
joint Director for Sciences at UNESCO. Their opening addresses were espe- 
cially appreciated and are reproduced below. This ceremony preceded the 
first lecture by Professor Cohen- Tannoudji, Physics Nobel prize winner. 

• the special session on Theoretical Physics in developing countries on July 24 
afternoon. The presentations of the CAMS (Center for Advanced Mathemat- 
ical Sciences, Lebanon) and of the AIMS (African Institute for Mathematical 
Sciences, South Africa) are included at the end of these Proceedings. 

• the cocktail and reception in the prestigious Salle des Fêtes of the Hôtel de 
Ville of Paris on July 24, evening. 

• the general scientific lectures open to the local scientific public on Satursday, 
July 27, in particular by Professors P.W.A. Anderson et C.N. Yang, Physics 
Nobel prize winners. 

Both of them as well as Professor Michael Fisher who delivered the concluding 
address, stressed the interest and success of this Conference which gathered the- 
oretical physicists and allowed exchanges of scientific ideas and methods between 
different domains of theoretical physics, beyond the specificities of particular sub- 
fields: particle physics, string theory, cosmology, statistical and condensed matter 




physics, dynamical systems and quantum chaos, physics/ biology interface, infor- 
mation theory and quantum computing, ... 

The previous analogous Conference on Theoretical Physics had taken place 
in Trieste in 1968. Everyone expressed the wish that the time interval to the next 
one should be reduced . . . 

An exhibition of scientific books was organized during the conference with 
the participation of Birkhauser-Verlag, Cambridge University Press, the Editions 
de Physique- Sciences, the Institute of Publishing, Imperial College Press, Kluwer, 
Oxford University Press, Springer-Verlag and World Scientific. 

These Proceedings present all invited contributions received. After the gen- 
eral lectures of Nobel prize winners Anderson and Yang, they are presented accord- 
ing to their theme. In each theme, plenary lectures are presented first. We also edit 
a CD-Rom which contains all contributions above as also all other contributions 
received corresponding to selected oral lectures and the Book of Abstracts. 

We express our warm thanks to all our committees: National Committee, 
International Advisory Committee, Thematic Committees and Fellowship Com- 
mittee; to all our invited lecturers, to the chairpersons of all our sessions and to 
all scientists who contributed to the success of this conference by presenting oral 
or poster contributions. 

We are also happy to warmly thank the High French, European and inter- 
national Institutions that supported the conference, and more particularly the 
UNESCO and lUPAP at the international level, the Division des Sciences of CEA 
(DSM, Commissariat à l’Énergie Atomique), the CNRS (Centre National de la 
Recherche Scientifique, Departments SPM and IN2P3) and the Ministère de la 
Recherche at the french level. Their very generous support both directly and in- 
directly were crucial. We also thank all Institutions of different countries that 
supported our participants, in particulier the NSF (National Science Foundation) 
which gave a generous number of grants to young high-level scientists coming from 
the USA. Our sincere thanks also go to the Mairie de Paris which welcomed us in 
its prestigious Salle des Fêtes of the Hôtel de Ville. 

The Conference was mainly organized at the Service de Physique Théorique 
de Saclay, and we are pleased to thank its Director, Jean-Paul Blaizot. There we 
benefit ted from the generous and very valuable help of our colleague Annie Gervois, 
main organizer of the StatPhys20 Conference. We are happy to thank her very 
warmly. Finally we benefitted from the technical assistance of Chantal Delongeas 
(in particular for the preparation of these Proceedings), Véronique Leclercq- Ort al 
and Paolo Ribeca (our web master, who moreover prepared our Book of Abstracts). 
We also adress our warm thanks to them. 

To conclude we wish to express a special thought in memory of our respected 
colleague Dominique Vautherin, active participant to our National Committee, 
who passed away during the preparation of this Conference. 

The Editors, June 2003 




Préface 



La Conférence Internationale en Physique Théorique, TH-2002, a eu lieu à Paris, 
dans le magnifique Centre de Conférences de l’UNESCO, du Lundi 22 au Samedi 
27 Juillet 2002, sous l’égide de l’IUPAP et des Sociétés française et européenne 
de physique et avec un soutien très important de plusieurs institutions françaises, 
européennes et internationales. 

Internationale et pluridisciplinaire, elle a rassemblé environ 1200 participants 
de tous pays, représentant tous les domaines de la physique théorique contempo- 
raine, et a présenté un programme scientifique diversifié de haut niveau, avec 18 
exposés pléniers, 45 exposés généraux en sessions thématiques et 140 exposés plus 
spécialisés, en partie invités et en partie sélectionnés parmi les propositions reçues 
des participants. Environ 500 contributions ont par ailleurs été présentées sous 
forme d’affiches murales. 

Les exposés pléniers, ainsi que les exposés thématiques généraux, étaient 
destinés à une large audience de théoriciens, et non seulement aux spécialistes. 

Conformément à nos engagements envers l’UNESCO et les autres institutions 
nous ayant accordé leur aide, TH-2002 a attribué plus de 200 bourses, principale- 
ment aux scientifiques de pays en voie de développement et d’Europe de l’Est, 
couvrant les droits d’inscription et pour plus de la moitié d’entre eux les frais de 
séjour en résidences étudiantes à Paris. 

Des événements marquants ont ponctué la conférence, en particulier 

• la cérémonie d’ouverture le Lundi 22 Juillet à 10 heures, en présence de Mme 
Claudie Haigneré, Ministre déléguée à la Recherche, et de M. Walter Erde- 
len. Directeur Général Adjoint de l’UNESCO pour les Sciences, dont les dis- 
cours de haute tenue ont été très appréciés et sont reproduits ci- après. Cette 
cérémonie a précédé la conférence du Professeur Claude Cohen- Tannoudji, 
prix Nobel de physique. 

• la session spéciale sur la physique théorique dans les pays en voie de dévelop- 
pement le Mercredi après-midi. On trouvera dans ce livre, après les exposés 
purement scientifiques, les présentations du CAMS (Center for Advanced 
Mathematical Sciences) au Liban et de l’AIMS (Institut Africain pour les 
Sciences Mathématiques) en Afrique du Sud. 

• la réception-cocktail dans la prestigieuse Salle des Fêtes de l’Hôtel de Ville 
de Paris le Mercredi en fin d’après-midi 

• les conférences d’intérêt général, ouvertes au public scientifique local, le 
Samedi matin 27 Juillet, en particulier par les professeurs P.W.A. Ander- 
son et C.N. Yang, prix Nobel de physique. 

Tous deux, ainsi que le professeur Michael Fisher qui a conclu, ont souligné le 
succès et l’intérêt de cette conférence ayant permis l’échange des idées et méthodes 




entre les différents domaines de la physique théorique, au-delà des spécificités de 
chaque discipline: physique des particules, théorie des cordes, cosmologie, physique 
statistique, matière condensée, systèmes dynamiques et chaos quantique, interfaces 
physique /biologie, information et calcul quantiques. . . La conférence précédente 
analogue en physique théorique avait eu lieu à Trieste en 1968. Tous ont exprimé 
le souhait que l’intervalle avec la prochaine soit réduit. 

Une exposition de livres scientifiques a eu lieu pendant la conférence avec la 
participation de Birkhàuser, Cambridge University Press, les Editions de Physique- 
Sciences, rinstitute of Publishing, Imperial College Press, Kluwer, Oxford Univer- 
sity Press, Springer-Verlag et World Scientific. 

Ces comptes-rendus présentent l’ensemble des contributions invitées reçues. 
Après les exposés généraux des Prix Nobel Anderson et Yang, elles sont regroupées 
par thèmes avec, dans chaque thème, les exposés pléniers puis les autres exposés 
invités. Nous éditons par ailleurs un CD Rom contenant l’ensemble des contribu- 
tions ci-dessus, ainsi que les autres contributions reçues correspondant aux exposés 
oraux sélectionnés, et le livre contenant tous les résumés reçus des participants. 

Nous exprimons nos chaleureux remerciements à tous nos comités: Comité 
National, Comité Consultatif International, Comités thématiques et Comité des 
bourses, à tous nos conférenciers invités, aux présidents de séance de toutes nos 
sessions ainsi qu’à tous les scientifiques qui ont contribué au succès de la conférence 
en présentant des contributions orales ou sous forme d’affiches murales. 

Nous sommes également heureux de remercier chaleureusement les hautes 
institutions françaises, européennes et internationales qui nous ont apporté leur 
soutien, et tout particulièrement l’UNESCO et l’IUPAP au niveau international, 
la Division des Sciences du CEA (DSM), le CNRS (SPM et IN2P3) et le Ministère 
de la Recherche au niveau français, dont les soutiens à la fois directs et indirects 
ont été si généreux. Nous remercions aussi toutes les institutions des différents 
pays qui ont accordé un soutien à nos participants, en particulier la NSF qui a 
attribué une vingtaine de bourses à des jeunes scientifiques de haut niveau venant 
des Etats-Unis. Notre sincère reconnaissance va aussi à la Mairie de Paris qui a 
bien voulu nous accueillir dans sa magnifique Salle des Fêtes de l’Hôtel de Ville. 

L’organisation de la conférence a été principalement assurée au Service de 
Physique Théorique de Saclay. Nous remercions son Directeur, Jean-Paul Blaizot. 
Nous y avons bénéficié de l’aide considérable de notre collègue Annie Gervois, 
principale organisatrice de la conférence StatPhys20, que nous sommes heureux 
de remercier très chaleureusement. Nous avons enfin bénéficié de l’assistance tech- 
nique de Chantal Delongeas (en particulier pour la préparation de ce manuscrit), 
Véronique Leclercq- O rt al et Paolo Ribeca (assistance informatique et préparation 
du livre des Résumés), auquels nous adressons aussi nos plus vifs remerciements. 

Nous souhaitons pour terminer saluer tout particulièrement la mémoire de 
notre respecté collègue Dominique Vautherin, prématurément disparu avant la 
conférence, et qui avait activement participé à notre Comité National. 

Daniel lagolnitzer, Vincent Rivasseau et Jean Zinn- Justin, Juin 2003 
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Opening Ceremony 



Opening Address 



Claudie Haigneré 

Ministre déléguée à la Recherche 



Une réunion d’excellences 

First of all, I would like to thank the organisers of the International Conference on 
Theoretical Physics, namely rUnion Internationale de Physique Pure et Applique 
and les Sociétés Françaises et Européennes de Physique. 

I would also like to welcome the numerous foreign scientists who joined us on this 
occasion and I ask them to forgive me because I have now to switch from English 
to French. 



Monsieur le Président 

Mesdames et Messieurs les professeurs 

Mesdames, Messieurs 

C’est pour moi un grand bonheur de saluer ce matin les quelques 1200 participants 
de la Conférence Internationale de Physique Théorique organisée par l’Union Inter- 
nationale de Physique Pure et Appliquée et les Sociétés Françaises et Européennes 
de Physique, dans cette superbe salle de l’UNESCO, dont le cadre se prête à 
merveille à cette réunion d’excellence. Que cette assemblée de chercheurs se tienne, 
à Paris, dans cette enceinte, symbole éclatant de la coopération internationale la 
plus ouverte et la plus libre, en sciences, et plus largement encore dans tous les 
domaines de l’esprit et de la culture, est un motif supplémentaire de satisfaction. 

Au nom de la France, je souhaite à tous nos amis étrangers, tout partic- 
ulièrement à ceux qui ont fait l’effort de longs déplacements, la bienvenue dans 
notre pays. Qu’ils s’y sentent chez eux. 




XVI 



Je remercie vivement les deux sociétés savantes organisatrices de leur choix 
de lieu et de programme, ainsi que du travail remarquable qu’elles ont accompli 
pour que votre réunion soit une réussite complète. 

Je salue tout particulièrement les plus jeunes d’entre vous dont la présence 
ici augure d’un avenir riche de promesses. Une rencontre scientifique de cette am- 
pleur leur permettra, dans les meilleures conditions possibles, de s’informer auprès 
de leurs pairs des progrès les plus récents accomplis dans votre spécialité et de 
confronter les points de vue. Ils y trouveront aussi, j’en suis sûre, une formidable 
occasion de nouer des liens personnels fructueux. 

Je souhaite souligner la présence de vos nombreux collègues venus d’Afrique, 
du Proche ou du Moyen-Orient et pour lesquels les conditions de recherche dans 
leurs laboratoires sont souvent moins favorable que celle que nous connaissons 
en Europe ou dans les pays de longue tradition scientifique. Leurs mérites de 
participer pleinement à la production scientifique de très haut niveau n’en sont 
que plus grands et appellent nos encouragements les plus chaleureux. Que soient 
donc également remerciés les organisateurs de ce colloque qui ont permis, grâce à 
l’atribution de bourses, la venue de certains d’entre vous. 

Cette Conférence ne s’inscrit pas dans une srie de rencontres thématiques 
régulières. Il faut remonter jusqu’en 1968 pour trouver une Conférence du même 
type. Il s’agit de celle organisée par Abus Salam, physicien pakistanais, prix Nobel 
1979. 

Cette interruption de plus de 20 années s’explique peut-être par une cer- 
taine réticence à accoler, de manière exclusive, l’épithète ’’théorique” au mot 
’’physique”, manifestant ainsi une crainte à souligner la différence d’approche 
entre théoriciens et expérimentateurs. Ces débats sont aujourd’hui dépassés et 
chacun sait qu’une telle conception s’inscrirait à contre-courant des synergies in- 
dispensables souhaitées par les deux communautés, convaincues qu’elles sont que 
les vrais progrès en physique viendront d’approches menées en étroite coopération, 
comportant à la fois le développement de théories et de concepts audacieux, des 
expériences habiles et lumineuses et des simulations numériques les plus raffinées. 

Cette Conférence doit, à n’en pas douter, son succès à votre soif d’échanger, 
entre théoriciens, les outils et les pratiques de votre discipline, tout autant qu’à 
confronter les débats que vous engagez avec les meilleurs expérimentateurs. 

Une aventure intellectuelle prodigieuse dans l’histoire 
des sciences 

La physique théorique représente une des aventures intellectuelles les plus pro- 
digieuses du siècle. 

Les grandes étapes de ce pan fascinant de l’histoire des sciences sont bien 
connues : à la dcouverte des relativités restreinte et générale au début du XX^^® 
siècle succèdent la fondation de la mécanique quantique dans l’immédiat après- 
guerre puis, en 1956, l’explication de la supraconductivité en matière condensée. 




L’Ecole française de physique théorique, quant à elle, a largement contribué 
au progrès de cette discipline, comme en témoigne, au cours des dernières dé- 
cennies, l’attribution de prix Nobel, à Louis Néel en 1970 pour sa théorie de l’anti- 
ferromagnétisme, à Pierre-Gilles de Gennes en 1991 pour sa théorie de la matière 
molle, à Claude Cohen- Tannoudji en 1997 pour ses études sur le refroidissement 
laser. 

On ne peut qu’espérer que le siècle qui s’ouvre devant nous voit se 

poursuivre l’exploration de champs encore nouveaux. 

De beaux défis restent encore à relever pour les années à venir, que ce 
soient l’unification des interactions élémentaires et de la gravité, l’élaboration 
d’une théorie cosmologique permettant d’expliquer l’ensemble des observations 
astronomiques - en particulier le rôle des trous noirs - ou encore l’analyse des 
propriétés d’enchevêtrement des fonctions d’onde ainsi que de leur propriété de 
cohérence. Songeons enfin aux interactions électroniques de la matière condensée, 
à l’origine de la supraconductivité de haute température et du magnétisme. 

Plus important encore, il demeure une frange d’ignorance à la lisière de ter- 
ritoires déjà explorés. 

Au-delà de ces découvertes majeures, la physique théorique a forgé des outils 
de recherche d’une efficacité tout à fait remarquable : elle a oeuvré, en particulier, 
pour une modélisation aussi simple que possible des phénomènes observés et pour 
une conceptualisation sous forme de lois élémentaires de divers mécanismes. 

C’est précisément ce type de démparche qui pourrait être étendue à des 
champs disciplinaires nouveaux, plus éloignés de la physique au sens strict, que l’on 
désigne couramment par l’expression ’’systèmes complexes” et particulièrement au 
plus complexe d’entre eux, le vivant. La cellule vivante offre un autre champ de 
recherche encore neuf. L’explication des mécanismes fondamentaux de la vie sup- 
pose, entre autres, une modélisation élaborée et audacieuse des mécanismes de 
rétroaction dans les cissemblages moléculaires. 

Des avancées considérables peuvent être attendues de l’application des 
méthodes de la physique théorique aux champs nouveaux que sont le calcul et 
l’information quantique ainsi que oies nano-objets en interaction. 

Les nanotechnologies en plein essor, pour ne prendre que cet exemple, vont 
certainement mobiliser dans un futur proche un grand nombre de contributions 
théoriques. 

Il en va de même de l’application des méthodes de la physique statistique 
aux objets fragiles de la matière molle, comme les mousses, les émulsions ou les 
tas de sable. 

Des communautés riches d’une culture scientifique différente attendent de 
vous une participation active à leurs recherches. Je vous encourage vivement à 
demeurer à l’écoute des besoins de collaboration qui peuvent s’exprimer et à en- 
treprendre avec enthousiasme cette démarche interdisciplinaire. 




Un soutien nécessaire à la recherche fondamentale 

Comme d’autres champs de la science, la physique théorique illustre de façon 
exemplaire la nécessité, dans le cadre d’une politique ambitieuse de recherche, 
d’apporter un soutien résolu à la recherche fondamentale. 

Au-delà de son champ d’investigation propre, elle est appelée à féconder les 
autres secteurs de la recherche ainsi qu’à proposer des développements aux appli- 
cations économiques et sociales. 

La physique théorique, par la notoriété que font rejaillir sur elle ses 
représentants les plus éminents, et surtout en vertu de sa nature même, intrigue et 
fascine tout à la fois. Son rayonnement s’étend bien au-delà de la seule communauté 
des chercheurs. 

C’est pourquoi la diffusion auprès des jeunes, filles et garçons, de cette image 
attirante de la physique pourrait constituer un puissant attracteur en direction 
des carrières scientifiques. 

Je forme le voeu que ce colloque soit pour vous tous l’occasion de travaux et 
d’échanges fructueux. Puisse le dynamisme qu’on voit s’exprimer dans ces journées 
de travail témoigner avec éloquence de l’excellence de la communauté scientifique. 

I do hope that this meeting will give rise to promising work and fruitful 
synergies. May your dynamism testify to the excellence of the scientific commu- 
nauty. 
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Your Excellency, Minister of Research and Technology, 

Professors lagolnitzer and Zinn-Justin, Presidents of the Conference, 
Distinguished participants, 

Friends and colleagues. 

It is a pleasure for me to welcome you to UNESCO House on this, the opening 
session of the International Conference on Theoretical Physics. I wish to convey 
to you the warm greetings of the Director-General of UNESCO, Mr. Koichiro 
Matsuura, who could not be with you today since he is on travel abroad. He and 
all of us at UNESCO extend to you our best wishes for a successful and fruitful 
meeting. 

UNESCO has hosted similar conferences in the past few years - the Interna- 
tional Congress in Mathematical Physics in 1994, the International Conference in 
Statistical Physics and the International Conference in Nuclear Physics, both in 
1998. This coming together at UNESCO of the international communities of scien- 
tists serves to demonstrate and symbolize UNESCO’s wholehearted commitment 
to the development of science on a world scale. 

Ladies and Gentlemen, 

Theoretical Physics deals with studies of very fundamental problems in Quantum 
Mechanics, Statistical Physics, High Energy Physics, Condensed Mater Theory, 
Astrophysics, to name but a few, and ranging from the sub- atomic level to the 
cosmic scale of the Universe. Theoretical Physics is concerned with research at the 
most fundamental level, with brilliant theories that are gradually confirmed by 
ingenious experiments, which in turn result in practical devices that are entirely 
transforming technology. 

The real world is a very complicated place: the knowledge of the properties 
of constituent particles (subsystems) does not allow us to reconstruct the diversity 
and richness of phenomena observed in these complex systems. As emphasised 30 
years ago by one of our plenary speakers, Professor Phil Anderson, in his article in 
Science entitled “MORE IS DIFFERENT!” , there is a hierarchy of scales and com- 
plexities, from “elementary” particles, to atoms, molecules, solid and soft matter. 
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biosystems, the mind, and social interactions. The analysis of each of these levels 
requires different concepts and principles. It is remarkable that at all levels the 
methods and way of thinking adopted in theoretical physics are playing a major, 
if not the leading, role. 

A very good example is the beautiful concept of broken symmetry at phase 
transitions particularly developed to explain the striking phenomena of supercon- 
ductivity in certain metals and alloys, and superfluidity in liquid helium. The 
discovery and interpretation of the integer and fractional quantum Hall effects are 
another demonstration of the strength of theoretical physics thinking. Recently, 
we have witnessed a boom of new ideas and concepts resulting from the statisti- 
cal physics of disordered systems, in particular the spin glasses. These concepts 
offer essential tools in the approach to one of the key issues in the theory of 
computation : the random satisfiability problem^ a typical case of the well-known 
NP-complete problem in mathematics. The progress on this subject is set to have a 
profound impact on theoretical disciplines like graph theory and game theory, and 
important applications such as the optimisation process, bio-informatics, error- 
corrected coding and crypto graphy. Theoretical physics is fast becoming the very 
foundation of these thriving interdisciplinary research activities. 

In elementary particle physics. String Theory seems to be the only possi- 
bility to unify all known interactions, including gravity. Superstring theory has 
made impressive progress in the last 6-7 years. The derivation of the Hawking- 
Beckenstein relation between the area and the entropy of a black hole is one of the 
most striking physical implications of the superstring theory. One other impres- 
sive achievement in Cosmology in recent years is a strong support for the idea of 
inflation, stemming from measurements of the anisotropy in the cosmic microwave 
background radiation. The data by Maxima and Boomerang essentially give a pic- 
ture of our Universe at the decoupling time, around 100 thousand years after the 
Big Bang. This was the moment when the background electromagnetic radiation 
decoupled from matter and from then on propagated freely. The measurement of 
acoustic peaks in its nearly thermal spectrum indicates initial density fluctuations 
believed to be the origin of bulk matter observed in our present-day Universe. 
The impressive agreement between theoretical prediction and data measured has 
made physicists stard talking about precision cosmology in a way similar to pre- 
cision measurements in high-energy accelerators. As a result of these and some 
other measurements, it is believed that our three-dimensional space is essentially 
flat and our universe has a non-zero cosmological constant. This implies that our 
present-day Universe is undergoing an accelerated expansion. 

Understanding the origin of this non- zero cosmological constant is one of the 
most important problems of theoretical physics. Other new ideas include theories 
with large extra dimensions and brane world scenarios, which, as a consequence, 
modify Newton’s Law of Gravitation at short distances. Experiments are underway 
to verify such modifications. 

In recent years, the standard Weinberg-Salam model of strong and elec- 
troweak interactions has been further confirmed by a number of various exper- 
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iments. This model has been tested with unprecedented accuracy at the level of 
high-order radiative corrections. In particular, new results on CP- violât ion are in 
good agreement with the predictions of the model. At the same time, the model 
is not complete and the search for the theory beyond the standard model is the 
main issue in high-energy physics. 

A significant number of these research projects in theoretical physics are pur- 
sued at the Abdus Salam International Centre for Theoretical Physics (ICTP) in 
Trieste. Many of you are familiar with the ICTP and have participated in its activ- 
ities. You will know that ICTP is a joint centre of UNESCO and the International 
Atomic Energy Agency (IAEA) that functions with generous financial support 
from the Government of Italy. More than 3000 scientists from developing coun- 
tries visit Trieste every year and spend on average about 6 weeks in the Centre to 
take part in courses and schools on advanced topics of physics and mathematics 
and to carry out research. We at UNESCO are proud to provide the framework 
and administrative support needed for the operation and functioning of the ICTP 
and with it, to assume the responsibility of maintaining and developing the unique 
heritage that Professor Abdus Salam left to the world. 

I am happy to know that this conference has been successful in attract- 
ing many well-known and outstanding scientists worldwide. Some of you are ofld 
friends and many have participated in various physics activities that UNESCO or- 
ganized or sponsored. The most recent of these activities was the Eighth Session of 
the International Interim Council of SESAME. This two-day meeting was the first 
to be held since the Executive Board of UNESCO decided, at its 164^'^ session, to 
establish an International Centre for Synchrotron Light for Experimental Science 
and Applications in the Middle East. The Centre, which is to be housed in a 6200 
square metre facility at Jordan’s A1 Balqua’ Applied University in Amman, will be 
responsible for installing, operating, maintaining and upgrading the synchrotron 
light source and providing training for technicians while promoting international 
cooperation in this field. 

I am pleased that many participants to this particular conference come from 
Africa, Eastern Europe, Latin America, the Middle East, countries that made up 
the former Soviet Union, and South Asia. It was with these participants in mind, 
that UNESCO decided early on to make its conference facilities available under 
exceptional conditions. This has meant that the funds thus saved could be used 
to provide full or partial grants to participants who could otherwise not attend. I 
am pleased to announce that about two hundred participants were helped in this 
way. 

Finally, I am happy to acknowledge that the Conference will devote a Special 
Session on Theoretical Physics in Developing Countries. 

Ladies and Gentlemen, 

I wish all of you a rewarding and enjoyable time during this week at UNESCO. 
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Rise of Complexity, 1953-2002 

Philip W. Anderson 



Here is a summary of the three rather different types of topic that I should like 
to cover today. Just because I am one of the few people here who has attended 
all four of the lUPAP meetings of all of theoretical physics, Fd like to start with 
a historical slant, first touching on some reminiscences of the first three meetings. 
Then I will, still speaking historically, try to outline some of the main intellectual 
themes of condensed matter theory - a very personal selection to be sure, one 
aiming at emphasizing the open frontier at each stage towards dealing with greater 
and greater real-world complexity. Finally, Fll introduce a more specialized, but 
still broad-brush, thread in one of those major themes. 




Some reminiscences of the three predecessors of TH 2002 can be introduced 
by first showing some pictures from the world of theoretical physics as it appeared 
in 1953. The Japanese were the first to have the idea - or the temerity - of bringing 
together all flavors of theory in a single conference, their first postwar attempt at 
a major congress - the peace treaty had only just been signed, and their facilities 
were very strained. I was there by accident, rather, starting a Fulbright with Kubo, 
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and for different reasons Frank Yang and I had both brought along our five-year 
olds, who turned out to be much more popular with press photographers than the 
panjandrums of physics - this is the Asahi’s photograph of the greeting ceremony at 
the Kyoto station, with the coheads of lUPAP, Mott and Gorter, in the background 
behind my Susan. 

For Yang’s sake, I also show a snap from the ladies’ program showing his 
Franklin and my Susan learning tea ceremony. 
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A last non- physics photo: the pre- meeting festivities were interrupted by a 
typhoon, but nonetheless the intrepid Japanese bussed us to Nara over the rather 
bad roads of postwar Japan a washout in the edge of one pitched our bus into 
the ditch. After much polylingual jabber, Onsager took over leadership of a crew 
of farmers and physicists and leveraged us out of the ditch with beams borrowed 
from a bridge - as you see. The photographer, incidentally, was John Bardeen. 







Here, finally, is the whole group assembled, or most of them - Feynman was 
off somewhere, but there are Wheeler, Tomonaga, Maria Mayer, Bhabha, Wigner, 
Peierls, and many others who were in the process of making theoretical physics; 
only the Russians are missing, for obvious political reasons. 
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The Japanese were disappointed that they had not attracted any of the then 
Nobelists; but I think this meeting had possibly a uniquely large percentage of 
“Nobelists-to-be” . I note a few of the faces which will play a special role in the 
discussion of condensed matter which follows: Slater, Onsager, Mott, Yang, Frank. 

The successor meetings came at rapidly increasing intervals: 1956, 1968, 2002; 
I guessed at the algorithm and came up with around 2071 for the next one, which 
leaves us with a great responsibility here, I guess. Many people remember the mag- 
nificent talk at Seattle by Feynman on superfluid He (though after reflection one 
realizes that many of the ideas had been foreshadowed 3 years earlier in remarks 
by Onsager, who was not the expositor that Feynman was, to say the least), at the 
end of which he admitted complete bafflement about superconductivity - Blatt, 
representing an Australian group, claiming thereupon that the answer was electron 
pairs, which it was; but not in the form they were using. Some Russians appeared 
already at this one - Bogoliubov, for instance - looking very uncomfortable in 
their ill-fitting Soviet-style wardrobe. I do not know if there exists any printed 
material on this meeting. 

Trieste was the grand opening shot of Salam’s ICTP. It was magnificently 
managed but overshadowed in my mind by political events - Robert Kennedy 
was shot on the eve of the meeting, for instance. Trieste as a UN institution was 
hospitable to Russians - I remet Abrikosov after 10 years, and the Landau group 
was well represented. But I am afraid that in Condensed Matter, at least, the 
scientific program was a bit retro. 

Now I would like to limn with a VERY broad brush some main intellectual 
strands of the progress of condensed matter physics during those 50 years. [1] John 
Slater, already in 1953, was obsessed with what I have rudely called the Great Solid 
State Dream machine. He envisioned that the then new electronic computer could 
be applied to the task of automatically providing the electronic structure of any 
desired solid; and he literally believed that then we would have all the answers to 
any conceivable question. The latter idea was wrongheaded; but the former has 
gradually become a reality, with the rise of what is now known as LDA. It is not 
enough appreciated that Slater himself provided the key element in that method, 
which in modern terms can be simply expressed as assuming that the self-energy is 
well approximated as purely local in both space and time. (Therefore determined 
by the only local electronic parameter, the density.) His problem was with the 
instruments (electronic and human) and the mathematical methods necessary to 
make the dream a reality. 

The basic locality assumption is not quite right, particularly with regard to 
band gaps, which are often vital; the reason and the necessary improvements in 
tractable materials were worked out by a list of people too long to detail here, 
except to mention that the earliest and the least often credited was my student 
John Inkson. But there are many cases where it fails spectacularly: essentially all 
of the interesting class of substances with magnetic inner shell electrons, most of 
which exhibit what has been called the “Mott Phenomenon” - a dominance of the 
local repulsion between those inner shell electrons. (Mott’s is another of those 1953 
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faces). A new version of the Dream Machine has recently been invented which is 
quite successful in most of these cases - DMFT, dynamic mean field theory, cooked 
up by Georges and Kotliar, where the assumption of locality of the self-energy in 
time is abandoned, generalizing more or less the scheme for dealing with magnetic 
impurities that was worked out for the Kondo effect. 

A second theme is the wonderful theory of critical points and phase tran- 
sitions which has been so heartening in bringing together statistical mechanics, 
condensed matter, and field theorists - but this is not the appropriate subject for 
me to talk about here, as a field in which I am not a player and the meeting’s 
president emphatically is. 

My third theme begins the march up the ladder of complexity from pure, 
perfect solids, which one can think of as one or another renormalized vacuum. Al- 
ready in Kyoto in 1953 Mott spoke about what at the time was his chief concern, 
theory and experiment on dislocations in solids, and Prank was about to invent 
his famous growth mechanism which relied on spiral dislocations; while Onsager’s 
cryptic remarks introduced the subject of quantized vortices in Hell. But it was 
not until much later that we began to realize that there were generalities relating 
all such phenomena, which came with the understanding of broken symmetry as a 
common description of condensed phases in the ’60’s, and with the general topolog- 
ical theory of defects originating with Volovik and Toulouse- Kleman in 1975. This 
mathematical apparatus was made necessary by the complexities of liquid crystal 
phases (brought out by Frank and de Gennes particularly) and of superfiuid He-3. 
We recognized that essentially all dissipative and breakdown phenomena, from the 
noisy sliding of charge-density waves in “blue-bronzes” and Barkhausen noise in 
magnets to glitches in neutron stars, could be seen as related to the symmetry 
properties of the condensed phases. 

Fourth Theme: From disorder to complexity. The original liberating idea may 
have been to stop treating the dirt and disorder in our condensed matter systems 
as a nuisance to be averaged over, and to try to renormalize the properties of the 
regular solid; instead to treat disorder as a subject of study in itself. This was the 
conceptual breakthrough of localization, which was simply a gleam in my eye in 
1956, and had not yet achieved respectability in 1968 (Salam’s program was firmly 
respectable). The next step was spin glass and the idea of frustration, appearing in 
the early ’70s, and before we knew it - late ’70s, early ’80s - we were dabbling in 
complex optimization, neural networks, and many of the subjects which are parts 
of complexity theory (if there is such a thing.) Others were arriving in the same 
territory from different starting points: Edwards and de Gennes from polymers 
and glass, for instance, Bak from critical fluctuations applied to sandpiles, Hopfield 
from biophysics. We are only beginning to sample the enormous playground we 
see ahead of us. 

A final major theme was the “supers” and macroscopic quantum coherence, 
from He-II in 1953-56, to BEG in 2002. In a way, this is climbing back down the 
ladder of complexity, but I wanted to make a small historical point having to do 
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with a subset of this gigantic and active field. I’ll call my subject “the rise and fall 
and rise again of ’anomalous’ - non-BCS - superconductivity.” 

Within a year of the BCS paper which explained superconductivity as what 
we would now describe as a condensate of isotropic, “s-wave singlet” pairs made 
up roughly equally from all parts of the Fermi surface, there were suggestions of 
the possibility of anisotropic pairings, by Thouless, Pitaevskii and others. Formal 
theory (Morel and PWA) brought out the possibility of novel properties: one could 
have gapless superconductivity violating the then sacred, but wrong, Landau cri- 
terion for superconductivity, “orbital ferromagnetism” and other anisotropic prop- 
erties, and, characteristically, one could see a complex phase diagram with several 
phases having the same critical point or nearly so because of the approximate 
degeneracy of the linearized gap equation for lower pair symmetries. Although sin- 
glet s is necessarily the lowest bound state for free particles, this is not the case in 
the presence of the Fermi sea, and in fact as all authors pointed out, basically re- 
pulsive pair potentials could have higher-L bound pair states, which might seem to 
open the prospect for a very common phenomenon. In fact Balian and Werthamer 
even showed that a particular L = 1 triplet state could masquerade as ordinary, 
isotropic superconductivity. 

Unfortunately, that prospect seemed blocked by my own “dirty superconduc- 
tor theorem” which showed that, except for the s-wave case of the original BCS 
which is protected by time-reversal invariance, all such states would be very sensi- 
tive to small quantities of disorder scattering - quantities which at the time were 
unavoidable in most metals. The psychological response to this and to unsuccessful 
searches for such states was a kind of “irrational non-exuberance” - an automatic 
assumption that superconductivity could only occur in BCS form, in response to 
the phonon mechanism of dynamic screening which had been elucidated in the 
’60s. 

As Brueckner and Pitaevskii early pointed out, dirt problems would not occur 
in the Fermi liquid He-3, which brooks no impurities whatever; and as the late 
Vic Emery, in particular, pointed out, the spin fiuct nation theory of Schrieffer 
and Doniach would favor triplet superfiuidity in this case - BCS being impossible 
because the underlying potential is gigantically repulsive. As everyone knows, these 
predictions were borne out by experiment in 1972: the complex phase diagram, the 
anisotropy seen in NMR beautifully interpreted by Leggett, the spin-fluctuation 
theory partially confirmed by Brinkman & PWA. 

In 1978 and 1979 the field of superconductivity was suddenly opened up 
by the discovery of two unexpected and exotic types of superconductor. First 
Frank Steglich discovered the superconductor CeCu 2 Si 2 , a material which at low 
temperatures was a metal with renormalized electrons having an effective mass of 
the order of hundreds of electron masses, such that the transition temperature of 2 
degrees K is an appreciable fraction of the effective Fermi energy. The characteristic 
feature of this material which is most striking is the gigantic specific heat peak at 
the phase transition, since an entropy of order unity per atom is involved in the SC 
transition. These electrons are basically /-shell states, but at normal temperatures 
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all / electrons are not free, but make up local magnetic moments. Soon Ott and 
Fisk had discovered three more of these “heavy-electron superconductors” based 
on uranium, the most exotic perhaps being UBeia, and they and others showed 
that these indeed showed complex phase diagrams, impurity-sensitivity, and other 
characteristics of anomalous SC. Nonetheless the community displayed a deep 
reluctance to accept the reality of such states or of a non- phonon mechanism of 
pairing. Perhaps an important aspect causing this resistance was the general failure 
to realize that the quality of materials had steadily improved since the 1950’s, so 
that the “dirty superconductor” barrier had quietly evaporated. 

Shortly - in 1979 - Bechgaard discovered the first of another type of super- 
conductor: an organic “charge-transfer” salt in which the metallic electrons - 1/2 
per molecule travel along one-dimensional or two-dimensional stacks of one kind 
of molecule (a selenium or sulfur-containing aromatic ring structure), the other 
simply providing a charge reservoir. It is very hard to imagine how the conven- 
tional phonon mechanism could cause superconductivity in these ultra-low density, 
highly anisotropic metals, in which the intramolecular Coulomb repulsion must be 
the major interaction, but nonetheless this seems to have been the mechanism 
of choice. Several more classes of similar compounds have come forward over the 
years, with Tc’s ranging up to 13 degrees K. Thus there were already exotically 
interesting superconductors being studied before the great discovery of the high 
Tc cuprates in 1986. 

It is often forgotten that the cuprates were suggested to be d-wave already 
in 1987-88, by Affleck, Kotliar, Rice and collaborators, and in a disguised form 
Laughlin. But that dirty superconductor argument deceived me, at least, into 
rejecting the existence of gap nodes until they were forced down my throat by 
massive experimental evidence in 1994. It is still one of their great mysteries that, 
alone among all known cases of non-BCS superconductivity, they still seem very 
insensitive to scattering. The reason seems very likely to be that some form of 
“spin-charge separation” protects the pairing, which is a singlet pairing of spins 
and only develops true electron pairing at lower temperatures. 

The floodgates seem to have been thrown open by yet another exotic dis- 
covery, Sr 2 Ru 04 , proven conclusively by Maeno to be a triplet case. Since then it 
has been shown using NMR Knight shift that Bechgaard’s original substance is 
a triplet case, while another organic is definitely singlet with nodes, according to 
several experiments. 

Among the heavy-electron cases there is also considerable complexity. At 
least one new one, UGe 2 , is superconducting and ferromagnetic with the same set 
of electrons, therefore necessarily a case of triplet equal-spin pairing. On the other 
hand, Lonzarich’s group has found a new class of heavy electron cases of which 
CeCoIns is typical, which share with the cuprates association with antiferromag- 
netism, and, apparently, a singlet gap with nodes - but not impurity immunity. 
I think it is time that one recognizes that these exotics must be treated as guilty 
unless proved innocent. 
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That being accepted, the question should be: why did we not assume this 
from the start? The magnetic phenomena associated with all of them strongly 
argue that the basic interaction is strong “Mott” local electronic repulsion, and 
one can argue persuasively that the dynamic screening mechanism of BCS which 
allows phonon- mediated SC cannot work in them. There seems to be, for reasons 
one thinks one understands, an association of antiferromagnetism with singlet, and 
of ferromagnetic interaction with triplet, cases. 

I am aware that some heavyweight names have argued otherwise, that there 
is a forward peak of phonon scattering, but not at all convincingly to me. In a 
final figure I show where the fallacy of the school which argues for a strong peak 
in forward scattering by phonons comes from - such a strong peak combined with 
short-range repulsion could, according to Abrikosov and others, give anomalous 
SC. Eventually, the point is that long- wavelength phonons in a metal are charge 
neutral and do not couple to electrons. I think that in all cases the mechanism is 
electronic and related to magnetic interactions. 

It is interesting that recent work of Capone et al, and Nozieres, seems to 
suggest that “Mottness” can also enhance Tc in a special class of singlet s cases - 
but here I am abandoning my historical theme. 

In conclusion, it seems likely that non-BCS is not a strange, exotic phe- 
nomenon but, like BCS, dirt-common. After all, in nature there are huge classes of 
magnetic compounds, though the stablest materials tend to be nonmagnetic. It is 
always a mistake to underestimate the complexity of which Nature is capable. It 
also can be a mistake to believe too faithfully in theories, perhaps especially those 
promulgated by PWA. 

Philip W. Anderson 
Department of Physics 
Princeton University 
Princeton, New Jersey 08544 
USA 
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The Thematic Melodies of Twentieth Century 
Theoretical Physics* 

Chen Ning Yang 



1 Introduction 

Among the main historic achievements brought forth by physics in the 20th cen- 
tury: 

• Man discovered, for the first time since our ancestors discovered fire, the 
second and vastly stronger source of energy: nuclear power. 

• Man learned to manipulate electrons to create the transistor which led to the 
modern computer, thereby increasing human productivity greatly. 

• Man learned how to probe into structures of atomic dimensions, thereby 
discovered the double-helix, leading to the birth of biotechnology. 

• Man took first steps on the moon. 

However, from the viewpoint of physicists, the most important advances are the 
profound revolutions in our understanding of the basic concepts of physics: Space, 
Time, Motion, Energy and Force. 

There are three main strands that had persistently woven through all con- 
ceptual advances in physics in the 20th century, three Thematic Melodies in Sym- 
phonic Music. They are Quantization, Symmetry and Phcise Factor. 

2 Quantization 

Main dates are 1900: Planck, 1905: Einstein, 1913: Bohr. 

“It was the spring of hope, it was the winter of despair” 

At present I am myself most optimistic as regards the future of the theory. 
Bohr to Rutherford 1918. 

Physics is once again at a dead end at this time. For me, at any rate, it is 
much too difficult. Pauli to Kronig, May 21, 1925. 

Heisenberg ’s mechanics has restored my zest for life. Pauli to Kronig, October 
9, 1925. 

*This lecture is derived from the PowerPoint lecture of Professor Yang, respecting the exact 
original style as much as possible. The many accompanying photographs were not included for 
lack of space. 
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Do not enter into this conflict, we are both much too kind and gentle to par- 
ticipate in that kind of struggle. Both Bohr and Heisenberg are tough, hard nosed, 
uncompromising and indefatigable. We would just be crushed in that juggernaut. 
Kramers to Klein 1927, quoted in Pais “Genius of Science”, p.l59 (2000). 

It was a period of patient work in the laboratory, of crucial experiments and 
daring action, of many false starts and many untenable conjectures. It was a time 
of earnest correspondence and hurried conferences, of debate, criticism, and bril- 
liant mathematical improvisation. For those who participated, it was a time of 
creation; there was terror as well as exaltation in their new insight. It will proba- 
bly not be recorded very completely as history. As history, its recreation would call 
for an art as high as the story of Oedipus or the story of Cromwell, yet in a realm 
of action so remote from our common experience that it is unlikely to be known to 
any poet or any historian. J.R. Oppenheimer, Reith Lectures 1953. 

The actors characteristics were 

• Pauli: Power 

• Fermi: Solidity, Strength 

• Heisenberg: Deep Insigh 

• Dirac: Cartesian Purity 

3 Symmetry (= invariance) 

Main dates are 1905: Einstein, 1908: Minkowski. 

Superfluous learnedness? 

that the basic demand of the special theory of relativity (invariance of the laws 
under Lorentz- transformations) is too narrow, i.e. that an invariance of the laws 
must be postulated also relative to non-linear transformations of the coordinates in 
the four- dimensional continuum. This happened in 1908. Einstein, Autobiograph- 
ical Notes, in “Albert Einstein”, ed. P.A. Schilpp, p.67. 

With the introduction of quantum mechanics in 1925, symmetry became very 
important. The mathematical language for symmetry is groups. 

It has been rumored that the group pest is gradually being cut out of quantum 
physics. H. Weyl, Nov. 1930. 

Symmetry gradually became a thematic melody (1927-1970) of atomic, 
molecular physics, nuclear physics and elementary particle physics. A great shock 
was created by Prof. C. S. Wu in 1957: Parity Nonconservation in Weak Interac- 
tions. 

Now, where shall I start? It is good that I did not make a bet. It would have 
resulted in a heavy loss of money (which I cannot afford); I did make a fool of 
myself, however (which I think I can afford). Pauli 1957. 

Never before or afterward have I seen him so excited about physics. Heisenberg 

1978. 
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Figure 1 : The five regular solids with maximum symmetry. Reprinted from A.V. 
Shubnikov and V.A. Koptsik, Symmetry in Science and Art (Plenum, 1974). 

4 Phase Factor 

So if one asks what is the main feature of quantum mechanics, I feel inclined now 
to say that it is not non- commutative algebra, it is the phase. Dirac 1972. 

The phase factor is ^ = 0 to 360. The phase factor became important 
through the proposal of Weyl in 1918, modified in 1927. Weyl introduced in 1918 
the gauge factor e^ Then London and Fock added in 1927 i = so that the 

gauge factor e^ became the phase factor 

Gauge Theory follows from the flexibility of the phase factor. It leads to the 
electromagnetic equation 

The Phase Factor of Weyl in 1918, exp[— (e/ 7 )A^d:r^], is a stretch factor. In 
1922 Schrodinger remarks that along a Bohr orbit, the exponent is — nh /7 with n 
integer, a “Remarkable Property”. Since 7 = —ip this stretch factor is 1. 

The de Broglie interpretation of the quantum rules seems to me to be re- 
lated in some ways to my note in the Zeitschrift Fiir. Phys. 12, 13, 1922,. . . The 
mathematical situation is, as far as I can see, the same, only from me much more 
formal, less elegant and not really shown generally. Schrodinger to Einstein, Nov 
3, 1925. 

The three thematic melodies were introduced in the first half of the century, 
their developments in the next half century were Developments, Variations and 
Inter twinings. 
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Figure 2: History of the Phase Factor 



The generalization of Gauge Symmetry is from p—eA to p—eB. It is motivated 
by the discovery of more and more strange particles which need a general principle 
for interaction. 

Conservation of charge => electromagnetic field 

Conservation of energy => gravitational field 

Why other conservation laws do not lead to specific field? 

Conservation laws were related to global gauge transformation. It is not con- 
sistent with the localized field concept. 

Non- Abelian Gauge Field uses the Mathematical Language of Symmetry, 
namely Groups, created by Galois (1811-1832) and Lie (1842-1899). The simplest 
Lie Group is the Phase Factor Non- Abelian Lie groups are generalizations of 

this Phase Factor. 

Flexibility in Definition of Phase lead in 1929 to the understanding that 
ElectroMagnetism is Gauge Theory. 

Flexibility Generalized lead in 1954 to Non-Abelian Gauge Theory. Non- 
Abelian gauge field, which was introduced in 1954, was initially found not consis- 
tent with experimental results. 

1960s: Breaking of Symmetry. Standard Model Symmetry Dictates Interac- 
tion. 

The propagator is / exp\^^{action)\d{path) (Feynman). 

The relationship between gauge theory and 20th century mathematics lead 
to consider Fiber bundles and Topology. 
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5 Conclusion 

The three thematic melodies of the 20th century led to a new understanding of 
the basic concepts of physics, Space, Time, Motion, Energy, Force. 

One can trace back the origin of the three thematic melodies: 

Early concepts related to Quantization: 

• Democritus (450 be), Atoms 

• Zeno (300 be). Continuity 

• Zhuang-zhou (300 be). Continuity 

=> Quantization of action (not of matter) 

Early concepts related to Symmetry; 

• Anaximander (600 be) 

• Pythagoras (510 be). Harmony of the Spheres 
=> Non- Abelian Lie Groups 

Early concepts related to Phases: 

• Phases of the Moon 

• Cycling of four seasons 

=> Flexibility of phases determines equations governing fundamental forces 

Through more than a century of hard work by mathematicians and physicists, 
these three primordial and inaccurate concepts became the thematic melodies 
of twentieth century theoretical physics. And these thematic melodies are the 
underlying spirit of todays theoretical physics. They will continue to lead the 
development of physics in the next thirty to fifty years. 

Chen N. Yang 

Room 352A 3rd Floor 

Science Centre North Block 

The Chinese University of Hong Kong 

Shatin, Hong Kong 

China 

Email: cnyang@cuhk.edu. hk 
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Probing Physics at the Planck Scale 



G.G. Ross 



Abstract. We review the ideas for extending the theory of the strong, electromag- 
netic and weak interactions to the Planck Scale and a unification with the gravita- 
tional interaction. 



1 Introduction 

One of the triumphs of modern physics has been the construction of the “Standard 
Model”, the theory of the strong, electromagnetic and weak interactions [1]. The 
Standard Model is a quantum field theory based on the principle of local gauge 
invariance. It describes essentially all the observed phenomena of the fundamental 
interactions as proceeding through the exchange of the elementary quanta of the 
force carriers; the photon for the electromagnetic interaction, the gluons for the 
strong interaction and the and Z bosons for the weak interactions. However 
many physicists think that the Standard Model is incomplete and are pursuing 
the elusive “Theory of Everything” which they think represents the ultimate uni- 
fication of the fundamental forces, including gravity. In this talk I will discuss how 
such searches have led to a discussion of Physics at the Planck Scale where the 
unification of the fundamental forces, including gravity, is thought to occur. 



2 The Standard Model 
2.1 The states of matter 

The Standard Model describes all the matter we observe in the universe in terms 
of a small number of elementary particles with interactions described by simple 
laws. The protons and neutrons which make up the atomic nucleus are made from 
elementary constituents, the (charge |) up and the (charge - ^) down type quarks 
and the “gluons”, the carriers of the strong force which holds them together. In 
addition the Standard Model includes the electron (charge -1) as an elementary 
state and its neutral partner, the neutrino. These “leptons”, like the quarks, feel 
the electromagnetic and weak forces but, unlike the quarks, do not feel the strong 
force. To complete the Standard Model, three families or copies of the quarks and 
leptons are needed to describe the states currently observed c.f. Figure 1. 
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Figure 1: The matter states of the Standard Model 



2.2 The interactions 

The laws that govern the strong and weak interactions follow from the same local 
gauge principle applied to a relativistic field theory that led to Quantum Electro- 
dynamics (QED), the quantum version of Maxwell’s theory of electromagnetism. 
The underlying ingredient is the recognition of a symmetry relating the states of 
the theory. Such a symmetry is based on patterns and the patterns of the Stan- 
dard Model are illustrated in Figure 1. The experimental observation of the weak 
interactions of the left-handed electron and its neutrino suggest that these states 
are intimately related and have similar properties. In the Standard Model these 
states are assigned to doublets of the group SU (2) as is shown in Figure 1. The 
weak force is due to the exchange forces generated by with the gauge bosons of 
SU (2)^ the and Z bosons. The charged weak interaction arises through the 
exchange of a W boson which changes a left-handed electron into a left-handed 
electron neutrino. The neutral weak interaction arises through the exchange of a 
Z boson which couples an electron. The representation is “chiral” in the sense 
that the transformation properties of the left- and right- handed states differ. The 
right-handed components of the electron, muon and tau are singlets under 51/(2), 
hence do not change under an SU (2) rotation. In turn this means they carry no 
weak charge and do not couple to the weak gauge bosons and do not participate in 
the weak nuclear force. Note that we have not observed right-handed components 
of neutrinos and they are not included in the Standard Model particle content. As 
may be seen from Figure 1 the same pattern is repeated for the quarks with the 
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left-handed components assigned to doublets and the right handed states singlets 
under SU {2). 

The strong interactions result from the gauge bosons of an SU (3) local gauge 
symmetry, with each quark coming in three “colours” and belonging to a triplet 
of states related by SU (3) rotations as shown in Figure 1. There are eight gauge 
bosons, “gluons”, associated with SU (3) corresponding to the different ways they 
can couple to the three colours of quarks. 

In order to include electromagnetism in the Standard Model, it is necessary to 
add a “1/(1)” gauge symmetry corresponding to the local conservation of electric 
charge. This gives rise to a single gauge boson B (since there is only one type 
of electric charge). The photon is made up of a mixture of B and the gauge 
boson of Figure 3a. The orthogonal combination of the B and gauge bosons 
is the neutral weak boson, the Z. This completes the gauge symmetry structure 
of the Standard Model. The full gauge symmetry is SU{3) (Z) SU{2) (g) ^7(1). 

The Standard Model has been found to be amazingly successful in describing 
all phenomena associated with the strong and electroweak interactions, the success 
culminating with the discovery of the W and Z bosons in 1983 at CERN [3] and 
the gluons at DESY [4]. 

2.3 Partial Unification 

The exchange forces associated with these gauge bosons provides us with a par- 
tially unified quantum description of the strong, the weak and the electromagnetic 
interactions and, to date, essentially all observed phenomena are consistent with 
the predictions of the Standard Model. A very important property of these inter- 
actions is that the strength of the interactions depends on the length scale being 
probed, a property due to the screening effects of the virtual states necessarily 
present. 

This is illustrated in Figure 2 [5]. It may be seen that the strength of the 
strong force reduces as the momentum scale increases (or the distance scale de- 
creases) the effect being dominated by the antiscreening effect of the virtual gluons. 
As a result at high momentum scales the strength of the strong force approaches 
that of the electromagnetic force which is also shown. Finally it may be seen 
that the strength of the weak force also approaches that of the other forces at 
high scales. In addition to a small screening effect there is a significant change in 
the effective weak coupling due to the weak propagator. The dimensionless cou- 
pling characterizing the underlying strength of the weak interactions is given by 
Gf{Q)Q^ where Gf{Q) is the Fermi coupling. In the Standard Model this cou- 
pling is due to the exchange of a FF boson so Gf{Q)Q^ = 9^ /^-Q^ /{Q‘^ + ^w) 
where g 12^/2 is the gauge coupling constant and T M^) is the gauge boson 

propagator. At low scales Gf(Q)Q^ — jM^ and is weak due to the large 

W mass Mw — SOGeV. At high scales Gf{Q)Q^ — g^ / 3 and, as may be seen from 
Figure 2, is intermediate in strength between the strong and weak interactions. 
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GK(Q)Q" = g^/8.QW+^V) 

Figure 2: The experimental meeisurement of the running of the strong force with 
the momentum scale Q at which it is measured. Also shown are the electromagnetic 
and weak interactions as a function of the momentum scale. 



However the approach of the coupling strengths, while suggestive, represents 
only a partial unification in as much as the couplings are not equal and there 
is no explanation in the Standard Model why they should even approach each 
other because the gauge group, SU{3) (g) SU{2) 0 ^7(1), involves three unrelated 
factors. Moreover, as may be seen from Figure 1, there is a complicated (chiral) 
multiplet structure needed which has been input by hand to generate the observed 
structure. Perhaps most importantly, the unification is of only three of the four fun- 
damental interactions and there is no connection with the gravitational force. The 
gravitational coupling is proportional to Newton’s constant with inverse mass di- 
mensions, Gat oc Mpianck is the Planck mass, approximately lO^^GeP/c^ 

(the unit of mass used in particle physics is 1 GeV j(? ^ approximately the mass of 
the proton). The weak coupling is given at low energies by the Fermi constant, 
Gp oc Af^^, where the W boson mass Mw is approximately X^^GeV j (? some 17 
orders of magnitude smaller than the Planck mass. The quark and charged lepton 
masses range from ^V^~^GeV j (? for the electron to approximately VJhGeV j (? for 
the top quark while the neutrinos in the Standard Model are massless. For many 
the origin of this hierarchy is the most pressing of all the questions raised by the 
Standard Model. 
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3 Grcind Unification 

Apart from the inclusion of gravity, an answer to the questions raised by the partial 
unification of the Standard Model is provided by Grand Unification. Grand Unified 
theories (GUTs) seek to embed the Standard Model in a unified structure which 
can relate its multiplet structure and interactions by extending the patterns that 
led to the Standard Model. The archetypical GUT is based on the symmetry 
group 5U(5), chosen because it is the smallest group which can accommodate the 
SU{3) G SU {2) (g) U{1) gauge group of the Standard Model [6, 7, 2]. In this the 
strong, the weak and the electromagnetic forces are just different facets of the 
one underlying {SU{U)) local gauge interaction. The quarks and leptons are also 
related in SU (5) because they belong to the common multiplets as shown in eq.(l). 
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Of course there must be an explanation for the non- observât ion of the additional 
gauge bosons implied by SU{b). Provided the mass of these bosons, Mx, is larger 
than the current energy of particle accelerators we will not have been able to find 
the X bosons directly. However a much stronger bound is available through the 
virtual effects of the X bosons since they mediate new processes which have not, 
so far, been observed. In particular the new interactions of SU (5) mediate proton 
decay. The current experimental limit on the decay lifetime of the proton is an im- 
pressive 10^^ years giving a limit on the X boson mass of Mx > / c^ \ Note 

also that in SU{5) there is no room for a right-handed neutrino component. This 
offers an explanation for the different properties of the neutrino for the absence 
of this state is what forces the neutrino to be massless in the Standard Model. 
In extensions of SU(5) to more unified GUTs such as 6'O(10) the missing right- 
handed neutrino appears. However it is expected to acquire a mass at the stage of 
spontaneous symmetry breaking of5'O(10) to SU (5) which we expect to be very 
large and greater than Mx- As a result the left-handed neutrino states remain 
very light with mass 0{M^ fMx) with the result that the left-handed neutrino 
states can only acquire a very small mass, in agreement with recent measurement. 
In fact 50(10) is a very attractive extension of SU (5) because a single 16 dimen- 
sional representation contains all the states of a single family plus the right handed 
neutrino. Thus it enormously simplifies the multiplet structure of Figure 1, leaving 
only the family replication to be explained, perhaps by extending the symmetry 
to include a family symmetry. 

There remains, of course, the question of determining the couplings and 
masses in the Standard Model. In a GUT, such as 5U(5), with a single group 
factor, there is a single gauge coupling constant describing the strong, weak and 
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Figure 3: Plot of the (inverse) couplings of the Standard Model gauge group factors 
up to the Grand Unified mass scale 



electromagnetic couplings. Thus it offers an explanation of the partial unification 
of couplings observed in Figure 2. 

However the unification of couplings should apply at a scale at which the 
differences between the gauge bosons of SU (5) can be neglected. This means we 
should continue the couplings above the mass of the X bosons. This is shown in 
Figure 3 where it may be seen that the couplings continue to approach each other, 
suggesting a true unification [9] . Moreover the scale at which they come together is 
very high, in agreement with the scale needed to inhibit proton decay. However, in 
detail, the couplings never meet, the discrepancy being 8 or 9 standard deviations. 

4 Supersymmetry 

Grand Unification, while very attractive, introduces a fundamental problem. This 
occurs because in a field theory with pointlike couplings, radiative corrections 
probe all virtual momentum scales. As a result they can generate large correc- 
tions to particle masses unless the mass is protected by a symmetry. Most of the 
states of the Standard Model are protected. The gauge bosons cannot acquire a 
mass while the gauge symmetry is unbroken and thus the gluons and the pho- 
ton remain massless even after radiative corrections are included. The fermions 
belong to chiral representations and similarly cannot acquire mass while the Stan- 
dard Model gauge group is unbroken. The problem occurs in the sector needed 
to spontaneously break the SU (2) gauge symmetry. In this spontaneous breaking 
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the order parameter is supplied by the vacuum expectation value of a scalar field, 
the Higgs boson. However its mass is not protected by a symmetry and so receives 
large radiative corrections which drive it up to the Grand Unified scale. This would 
mean that the SU (2) symmetry is broken at the scale Mx, giving mass of 0(Mx) 
to the W and Z bosons and to the quarks and lepton. To avoid this unaccept- 
able conclusion requires that the symmetry of the Standard Model be extended to 
include a new symmetry capable of protecting the Higgs scalar mass from large 
radiative corrections. 

The only possible symmetry capable of keeping a Higgs boson light is su- 
persymmetry (plus a chiral symmetry) . Supersymmetry [8, 2] assigns scalars to 
supermultiplets with fermion partners and thus their properties are related : 

S {J = 0) il) (J = i) requires 

Thus if the fermion mass my, is forbidden by a chiral symmetry as discussed 
above the scalar mass will also vanish. In this case there must be supersymmetric 
partners to the Standard Model states. In order to prevent radiative corrections 
reintroducing the hierarchy problem, the states must be quite light, less than or 
of order 1 TeV. 

The extension of the Standard Model to include supersymmetry requires the 
addition of many new states. These include the “squarks” and “sleptons”, spin 
zero partners of the spin-one-half quarks and leptons respectively. Also needed are 
the “gluinos” , the “Wino” , the “Zino” and the “photino” , spin-one-half partners 
of the Standard Model gauge bosons, the gluon, the W, the Z and the photon 
respectively. Although these new states are thought to be considerably heavier than 
their partners, and hence should not yet have been found directly in laboratory 
experiments, they will contribute as virtual states to the radiative corrections 
discussed above. The exciting observation that solves the hierarchy problem is that 
the new SUSY radiative corrections have the same magnitude as the Standard 
Model contributions but opposite sign. Thus, when SUSY is exact, there is no 
correction to the Higgs mass. When SUSY is broken at a scale Msusv the new 
supersymmetric states acquire mass of 0{Msusy) and this cancellation is spoilt. 
However the resultant contribution is of 0{Msusv) and provided this is not too 
large (z.e. < O(lTeU)) it is consistent with the observed electroweak breaking. 
In summary the supersymmetric solution to the hierarchy problem requires new 
physics beyond the Standard Model in the form of the spectrum of new SUSY 
states at a scale accessible to the next generation of accelerators. 

5 Unification of couplings 

Although there is no direct evidence for a new supersymmetric state, the is strong 
indirect evidence for the existence of such states coming from the unification of 
gauge couplings. This follows because the new supersymmetric states must be light 
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Figure 4: Evolution of the gauge couplings in the supersymmetric extension of the 
Standard Model 



to protect the Higgs from receiving large radiative corrections to its mass. These 
light states also contribute to the screening of the gauge couplings and thus affect 
the gauge unification prediction. In Figure 4 1 show the evolution of the couplings in 
the Supersymmetric Standard Model. One may see that that the supersymmetric 
prediction is quantitatively better than the non-supersymmetric case shown in 
Figure 3. This only works if the new supersymmetric states are “light” on the 
Grand Unified scale with masses less than ITeU/c^ just as is required if one is to 
keep the Higgs boson light. 

A better way to illustrate this may be obtained by using the assumed uni- 
fication to predict the value of the strong coupling in terms of the weak and 
electromagnetic couplings [11]. This is shown in Figure 5 where one may see that 
the predicted region in the sin^vv ~ <^s plane occupies only a very small part of the 
a priori allowed region. This is quite impressive already as it appears to be better 
than one part in one hundred. If one adds the constraint that Mx must be large 
to avoid rapid proton decay this region shrinks even further. Using the measured 
value of the strong coupling one may use this plot to determine the prediction for 
sin^ One finds 



sin^ Ow = 0.2314 ± 0.002 - 0.25(o, - 0.119)) 
to be compared with the experimental value sin^ 6w = 0.2312 ± 0.0002. 
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Figure 5: Graph (bottom left) illustrating the precision of the test of gauge unifi- 
cation. The area between the two curves is the predicted region allowing for the 
uncertainties in the supersymmetric particle masses with no constraint on the uni- 
fication scale. The graph (upper right) indicates the prediction for the unification 
scale that results from the requirement that the gravitational strength should also 
unify. 



5.1 Unification with gravity 

Given the success of the unification prediction one may ask whether the unification 
can extend to include gravity as well. Following our discussion of the weak force 
where the relevant dimensionless coupling was Gf{Q)Q^- In this case the relevant 
coupling is Gn{Q)Q‘^ where Gn{Q) is Newtons constant. In Figure 5 I show that 
this coupling also comes close to unification with the other couplings provided 
the unification scale is vey high, close to the Planck scale, In my opinion this is 
a very strong hint that the four fundamental interactions do unify and motivates 
the search for the underlying theory capable of doing so. The only candidate is 
the (super) string. 

6 String Unification 

Since others at this meeting will talk about string theories themselves, I will con- 
centrate on the question how is unification changed in superstring theories? String 
theory is the only candidate we have for a unification of all the fundamental in- 
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teractions including gravity [12]. Of course at low energies one is looking at the 
four dimensional effective field theory which results from string theory defined 
at the Plack scale. Often this involves a stage of compactification of some of the 
space-time dimensions and in the low energy theory these act as internal degrees 
of freedom. The resulting 4D theory has several promising properties : 



• The symmetries are specified by the underlying string. For consistency the 
string theory should be supersymmetric and this symmetry may survive in 
the low energy effective theory offering an explanation for the hierarchical 
structure of masses observed. In the heterotic string case there is an Es ^ Es 
gauge symmetry before compactification leading to an Es Eq on compact- 
ification on a Calabi Yau manifold. The resulting gauge group can easily 
accommodate the Standard Model but also can be larger. For example it 
can incorporate the Grand Unified groups SU(5), SO (10) or Ee- The only 
symmetries that are known to descend from the string are gauge symmetries, 
either continuous or discrete. Unfortunately there are many possible vacua 
in the compactified 4 dimensional theory and we do not know how to select 
between them. 



• The multiplet structure is specified in a given string vacua. Usually this in- 
cludes some number Ug of chiral families + some number ny oï vector like 
states which come in complex conjugate pairs. The latter are expected to 
acquire mass at a high scale through a stage of symmetry breaking below 
the compactification scale. Many 3 generation examples are known. More- 
over, if the string theory is built from level- 1 Kac Moody level theories the 
representation content of the theory is restricted offering an explanation to 
the question why only low lying representations of quark and leptons are 
observed. 



• String theory has only one fundamental parameter, the string scale which 
can be related to the Planck scale. All other parameters are determined in 
terms of the vacuum expectation values (vevs) of moduli fields, For 
example the string coupling constant is determined by 

gy\Mstr^ng) = ( 2 ) 



where ki is a parameter associated with the particular string construction 
and 



2 

9 string 



1 

< 5 >’ 



(3) 



Here 5 is a moduli field known as the dilaton. In the absence of supersym- 
metry breaking the moduli fields have no potential and any value for their 
vacuum expectation value gives a viable string theory. Once supersymmetry 
is broken, the moduli’s vev will be fixed and the coupling will be determined 
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as in eq 3. Similarly the Yukawa couplings which determine the quark and 
lepton masses are determined as functions of additional moduli known as 
complex structure moduli. (Unfortunately we are still far from an under- 
standing of the string vacuum structure and hence from a prediction of the 
string coupling.) 



6.1 M-theory determination of the gauge unification scale 

The prediction for the gauge unification scale in the weakly coupled heterotic string 
follows from the general form of the 4D Lagrangian [15] 

Le// = -Jd^<^œ^a^,\±R+±TrF^ + --.) 

= -Jé:,V^aTo\~R+—TrF^ + ---) (4) 



In this we may see that Newton’s constant, Gni and the value of running gauge 
couplings at the unification scale, acuT^ are given in terms of the lOD string 
coupling a 10 , the string tension a' volume of the 6D compactified 

space V by 



Gn 



fA 

647tU ’ 



^GUT = 



/3 

aioa 

167tU 



(5) 



For the case that o^io is small the volume V is approximately and one 

obtains eq(6) eliminating V between the two equations. 



■^string ~ Q string ^ (5.2 X lO^^GeF) ^ 3.6 x lO^^GeV (6) 



which, c.f. Figure 3, is only a factor of 20 above the “observed” gauge unification 
scale [16]. 

It is difficult to overemphasize the potential importance of this result. Our 
belief that there is a stage of Grand Unification of the strong, weak and electro- 
magnetic interactions rests largely on the quantitative success of the unification of 
the associated couplings. The prediction of the unification scale would be the first 
indication of unification with gravity. 

While the prediction within a factor of 20 is encouraging, the residual dis- 
crepancy raises some doubts. One promising explanation comes from the string 
itself. If one uses the “measured” value for V = 0((1 — 3).10^^GeU~^) one may 
obtain the value of «lo instead. This gives an enormous value, quite inconsistent 
with the assumption of weak coupling that went into the derivation of eq(5). Thus 
the failure of the prediction of eq(6) is not surprising - it was the wrong calculation. 
Instead one should go to the strongly coupled case. 

That it is now possible to discuss strong coupling relies on an understanding 
on relations between strong coupling and weak coupling regimes known as duali- 
ties. It has long been known that there are five distinct classes of string theory, a 
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fact that has caused concern because one might hope that the “Theory of Every- 
thing” is unique. However developments in string theory have shown that there 
is a rich class of duality symmetries relating various string theories [13, 14] so 
now the expectation is that all string theories are related to each other and to an 
underlying theory known as “M-theory”. This theory lives in eleven dimensions 
and the various string theories may be viewed as different limits of this underlying 
theory. In addition it is now realised that eleven dimensional supergravity also 
belongs to this web of inter-related theories. A feature of the dualities relating 
different theories is that they often relate one theory in the weak coupling (pertur- 
bative) limit to another theory in the strong (nonperturbative) limit or one theory 
in the small compactification radius limit to another in the large compactification 
limit. By making use of these dualities one may gain insight into non-perturbative 
physics by performing calculations in the dual theory in which the perturbation 
series makes sense. 

In the strongly coupled heterotic string case Horava and Witten [15] showed 
that the theory is equivalent to M-theory. Although it is not possible to construct 
M-theory explicitly it is possible to write down the long-distance effective field 
theory coming from M-theory. It is just 11 dimensional supergravity compactified 
on a six-dimensional Calabi-Yau space times a line interval, CY 3 (g) The 

effective Lagrangian in this case is given by [15] 



giving 



Gn = 



Wtt^VRu 



OLGUT = 



(47TKf,)2/3 



2V 



(8) 



The reason this form changes is that the gravitational fields now propagate in the 
bulk of the 11th dimension while the gauge fields live on the end points of the line 
integral (one Eg on each fixed point) and propagate only in lOD. One may see 
that this changes the relation between Newton’s constant and otcuT through the 
appearance of the radius of the 11th dimension, Rn. Given the appearance of a 
new parameter it is clear one may adjust it to eliminate the discrepancy between 
scales found in the weakly coupled case. This gives [16] 



ttRh = (5.10^^GeH)“^ 



(9) 



While it is encouraging that the discrepancy between scales has been eliminated, 
apparently the predictive power has been lost. In order to determine the size of the 
compactified dimensions it is necessary to understand the vacuum structure of the 
theory which determines the magnitude of the moduli field setting this scale. At 
present this is beyond our ability so the unification with gravity, while suggestive, 
remains unproven. 
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7 Probing the Planck scale 

In this talk I have described some very promising ideas for the complete unifi- 
cation of the fundamental forces, including gravity. Will these ideas be tested in 
the foreseeable future? As we have discussed there is already evidence for unifica- 
tion coming from the unification of the gauge couplings of the theory. Additional 
predictions of relations between the masses of the fundamental matter fields, the 
quarks and leptons are also in good agreement with measurement. In addition 
there is a good explanation for why the SU (2) symmetry is broken to give masses 
to the W and Z bosons leaving the gluons and photon massless. These indirect 
tests of the underlying theory are encouraging. What about more direct tests of 
unification? One of the central ingredients is the need for supersymmetry with its 
associated states with a mass in the TeV range. Such states will be accessible to 
the next generation of accelerators, the LHC at CERN and the Tevatron at Fer- 
milab in Chicago. It may be that the new supersymmetric states provide a source 
of the missing dark matter in the universe and searches for supersymmetric dark 
matter are being actively pursued. 



7.1 Epilogue 

In this talk I have concentrated on the “classic” unification prediction which applies 
close to the Planck scale some fifteen orders of magnitude above the energy scale 
accessible to present accelerators. In this case it is impossible to produce the string 
states which would directly verify an underlying string unification. The best we 
can hope for is that these states may leave an imprint on the development of 
the universe after the Big Bang which is visible today in the structure of the 
universe and the microwave background. However recent developments in the study 
of string theory have raised another interesting possibility that may make the 
string states more accessible to experimental test. As discussed in the previous 
Section it is possible that the relation between the string scale and the Planck 
scale gets modified through the appearance of a new space dimension with radius 
Rii much larger than the Compton wavelength of the string states. As a result the 
underlying unification scale can be reduced. This has raised the question how large 
can such new dimensions be and how low can the string scale be? Surprisingly it is 
possible to have new space dimensions as large as 0.1mm [17]! The first indication 
of such a new space dimension would be new massive “Kaluza Klein” excitations 
of the graviton, perhaps as light as fractions of an electron volt! However these 
states couple with gravitational strength and so would not yet have been observed. 
Associated with a large new dimension is the reduction in the string scale and the 
extreme possibility is that the string scale should be as low as a TeV. In this 
case the Planck scale reflects the existence of a very small coupling and not the 
existence of very heavy states. In this extreme case one might even do without 
supersymmetry because the virtual momenta involved in radiative corrections are 
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cut-off at the TeV scale. In such schemes the next generation of accelerators would 
even be able to directly produce string excitations. 

While an interesting possibility, I consider such low scale unification schemes 
to be much less likely than the “classic” picture. If the gauge couplings are to unify 
at a low scale there must be additional light states which change the running of 
the couplings. As a result the remarkably precise prediction for the unification of 
gauge couplings discussed above would then have to be considered to be a complete 
accident. 

8 Conclusion 

The 20th century has seen the construction of a relativistic quantum field theory 
of the strong, electromagnetic and weak interactions. This theory is remarkably 
successful and its predictions are consistent with all observed phenomena. How- 
ever the theory leaves unanswered several important questions, perhaps the most 
important being what is the origin of the mass of the fundamental states and how 
does the unification extend to the fourth fundamental force, gravity? Remarkably 
there is already indication that there is an underlying “Theory of Everything” ca- 
pable of providing answers to these questions. Provided there is a new symmetry, 
supersymmetry, the observed strengths of the interactions become equal at a scale 
close to the Planck scale. In string theories the unification scale is related to the 
string or Planck scale offering a simple explanation for high scale unification. The 
small mass of the neutrinos is a natural outcome of the large scale of unification. 
The parameters determining the quark, charged lepton and neutrino masses fit well 
with an underlying GUT combined with a spontaneously broken family symmetry. 
If these ideas are correct we should be on the threshold of observing completely 
new phenomena associated with the new supersymmetric states. The very real 
hope is that the 21st century will see our understanding of the origin of mass and 
the construction of a fully unified theory of all the fundamental interactions. 
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Tachyons in String Theory 

Ashoke Sen 



Abstract. We describe some of the recent developments in our understanding of the 
tachyon in string theory. 

In this talk I shall try to address the following topics: 

1. What are tachyons? 

2. Dealing with tachyons in quantum field theory 

3. Tachyons in string theory 

4. Making sense of string theory tachyons 

5. Possible application to cosmology 

6. Outline of the derivation of various results 

7. Summary and open questions 

Let us begin with the question: What are tachyons? Historically tachyons 
were described as particles which travel faster than light. In modern days we think 
of tachyons as particles with negative mass^, i.e. imaginary mass. Both descriptions 
sound equally bizarre. On the other hand tachyons have been known to exist in 
string theory almost since its birth, and hence we need to make sense of them. 

Actually tachyons do appear in conventional quantum field theories as well. 
Consider, for example, a classical scalar field cj) with potential P(0). In p-space 
and 1-time dimension labeled by the time coordinate and space coordinates x'^ 
(1 < i < p) the lagrangian of the scalar field is: 

L = j <Fx[{docj)f - - V{d>)] . (1) 

Normally we choose the origin of (j) so that the potential V {<p) has a minimum at 
(j) = 0. In this case quantization of (j) gives a scalar particle of mass^ = P"(0)|0=o- 
This gives a positive mass^ particle. But now suppose the potential has a maximum 
at 0 = 0. Then P"(0)|0:=o is negative. Naive quantization will give a particle of 
negative maiss^. Thus we have a tachyon! 

In this case however it is clear what we are doing wrong. When we identify 
P"(0) as the mass^ of the particle, we are making an approximation. We expand 
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V{(j)) in a Taylor series expansion in 0, and treat the cubic and higher order 
terms as small corrections to the quadratic term. This is true only if the quantum 
fluctuations of (f) around 0 = 0 are small. But for if V (0) has a maximum at 0 = 0 , 
then 0 = 0 is a classically unstable point. Hence we cannot expect the fluctuations 
of 0 to be small. The remedy to this difficulty is to find the minimum 0o of the 
potential V^(0), and quantize the system around this point. More precisely this 
means that we can expand the potential around 0 = 0o, and treat the cubic and 
higher order terms in the expansion to be small. The mass^ of the particle now 
can be identified as F"(0o). This is positive since F(0) has a minimum at 0 = 0 q. 
Hence the theory does not have tachyons. 

Note that if there had been no reason to choose the origin of 0 at zero, we 
could have defined a new field ip = 0 — 0o sind expressed the potential as a function 
of 0: 

VW = V{rP + M • (2) 

If from the beginning we worked with the field 0 then we would not have en- 
countered the tachyon in the first place since \^"(0 = 0 ) is positive. But there 
are often reasons why we choose the origin of field space in a specific way. For 
example, 0 = 0 could be a point with higher symmetry {e.g. 0 — > —0). This 
symmetry will be manifest in V(0) but not so explicit in F(0). This leads to the 
phenomenon of spontaneous symmetry breaking. In such cases instead of having a 
single minimum, the potential has more than one minimum related by symmetry. 
e.g. if y(0) = F(— 0), then a minimum at 0o also means a minimum at — 0o- 

If the potential has more than one degenerate minimum, we can consider field 
configurations where 0 approaches different minimum in different regions of space. 
An example of such a field configuration is the domain wall, where we consider a 
field configuration where 1) 0 depends on one spatial coordinate 2) as ^ oo, 
0^00, 3) as — 00,0— > —00, and 4) the total energy is minimized subject to 

these constraints. For this configuration the energy density is concentrated around 
x^ ~ 0. This gives rise to a ‘codimension 1 defect’. For more complicated cases 
we can have more complicated defects (of higher codimension) . Examples of such 
defects are vortices which are codimension 2 defects, t’Hooft Polyakov monopoles 
which are codimension 3 defects, etc. In general for a codimension k defect the 
energy density is localized around a subspace of dimension {p — k). 

The lessons learned from the field theory examples may be summarized as 
follows: 

• Existence of tachyons in the spectrum tells us that we are expanding the 
potential around its maximum rather than its minimum. 

• Associated with the existence of tachyons we often have spontaneous sym- 
metry breaking and existence of defects. 

We now turn to the discussion of tachyons in string theory. String theory 
contains infinite number of single ‘particle’ states, as if it is a field theory with 
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infinite number of fields. But the conventional description of string theory is based 
on ‘first quantized’ formalism rather than a field theory. We take a string (closed or 
open) and quantize it maintaining Lorentz invariance. This gives infinite number 
of states characterized by momentum p and other discrete quantum numbers n. 
It turns out that the energy of the nth state carrying momentum p is given by 
En = yjp^ + where is some constant. Thus this state clearly has the 
interpretation of being a particle of mass rrin^ 

Quantization of some closed or open strings gives rise to states with negative 
for some n. This corresponds to a tachyon! For example, the original bosonic 
string theory formulated in (25+1) dimensions has a tachyon in the spectrum of 
closed strings. This theory is thought to be inconsistent due to this reason. 

Superstring theories are free from closed string tachyons. But for certain 
boundary conditions, there can be tachyon in the spectrum of open strings even 
in superstring theories. Thus the question is: Does the existence of tachyons make 
the theory inconsistent? Or does it simply indicate that we are quantizing the 
theory around the wrong point? The problem in analyzing this question stems 
from the fact that unlike the example in a scalar field theory, the tachyon in string 
theory does not obviously come from quantization of a scalar field. Thus in order 
to understand the tachyon, we have to reconstruct the scalar field and its potential 
from the known results in string theory, and then analyze if the potential has a 
minimum. 

It turns out that for open string tachyons we now know the answer in many 
cases. On the other hand, closed string tachyons are only beginning to be explored. 
Hence we shall focus mainly on open string tachyons in this talk. 

There are five consistent, apparently different, superstring theories in 9-space 
and 1-time dimension. We shall focus on two of them, known as type II A and type 
IIB string theories. Elementary excitations in this theory come from quantum 
states of the closed strings. But besides these elementary excitations these theories 
also contain ‘composite’ objects known as D-branes or more explicitly Dirichlet 
p-branes. 

A Dp-brane is a p-dimensional objects. Thus for example DO-brane corre- 
sponds to a particle like object, a Dl-brane corresponds to a string-like object, a 
D2-brane corresponds to a membrane like object and so on. But unlike the kinks 
and other defects in field theory which are associated with classical solutions of 
the equations of motion of the fields, D-branes are defined by saying what hap- 
pens in their presence rather than by saying what they are. Consider, for example, 
a static flat Dp-brane in flat space-time, lying along a p-dimensional subspace. 
The definition of a Dp-brane is simply that fundamental strings can end on the 
p-dimensional hypersurface along which the D-brane lies. This has been illustrated 
in Fig.l 

Quantum states of a fundamental open string with ends on a D-brane repre- 
sent quantum excitation modes of the D-brane. D-branes need to satisfy various 
consistency requirements, and as a result D-branes for different p have different 
properties. For type IIA string theory, these properties are summarized as follows: 
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Open string 

' 'A 




D-brane 



Figure 1: Fundamental strings (shown by dashed line) ending on a D-brane (shown 
by solid line). 



1. For even p, Dp-branes are oriented and are known as BPS D-branes due to 
some special properties which they possess. For these branes, the mass per 
unit p- volume, also known as the tension Tp of the brane, is given by 

'X = ^ 

" (27T)Pp, ’ 

in a unit in which the tension of the fundamental string is We shall use 
this unit throughout this talk. Qs is a dimensionless constant known as the 
string coupling constant. We shall do all our analysis to lowest order in the 
perturbation expansion in 

It turns out that all open string states on a BPS D-brane have mass^ > 0. 
Hence there are no tachyons in the spectrum. 

2. For odd p, the Dp-branes are unoriented (non-BPS). The tension 7^ of a 
non-BPS D-p-brane is given by, 

" (27r)Pp, • 

Each such D-brane has one open string mode with mass^ = — In other 
words, there is a tachyonic mode on each of these non-BPS D-p-branes. 



For type IIB string theory the situation is reversed. There are now oriented 
(BPS) Dp-branes for odd p and unoriented (non-BPS) Dp-branes for even p. The 
results that we shall discuss will be valid both for type IIA and type IIB string 
theory. Whether we are talking about type IIA theory or type IIB theory should 
be understood from the context. For example, if we are refering to a non-BPS D- 
p-brane, then it should be understood that we are talking about type IIA theory 
if p is odd, and type IIB theory if p is even. 

For oriented D-branes we define an anti-D-brane (D-brane) to be a D-brane 
with opposite orientation It turns out that a coincident BPS Dp-brane D-p-brane 
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Figure 2: The tachyon on a Dp-D-j[?-brane pair cornes from open strings whose two 
ends lie on two different branes. 



pair has two tachyonic modes, each of mass^ ~ “è’ from the open strings with 
one end on the brane and one end on the antibrane (sectors (c) and (d) in Fig.2). 

Since string theory is formulated in a way that is different from a field theory, 
the method of analysis in string theory is very different from that in a field theory. 
Nevertheless it is useful to use the language of field theory to describe various 
situations in string theory. In particular, if we use the analogy with field theory 
origin of tachyons, then for a non-BPS D-p-brane, the dynamics of the single 
tachyonic mode should be described by a real scalar field T with negative mass^ 
in p-space and one time dimensions. We shall refer to T as tachyon field. For the 
D-p-D-p system, the dynamics of the pair of tachyonic modes should be described 
by a complex scalar field T with negative mass^. The results in string theory can 
be stated as if the dynamics of the tachyon T is described by an effective potential 
Veff{T) or more generally an effective action Seff{T). This is what we shall do. 

Let us begin by reviewing the properties of Seff{T) and Veff{T) which follow 
from simple considerations. First of all it is known that Seff(T) has simple sym- 
metry properties. For example, for a non-BPS D-p-brane Seff{—T) = Seff(T), 
On the other hand for a D-D system, Seff(e^^T) = Seff{T). The other property 
of that is obvious is that Veff{T) has a maximum at T = 0, since the field T is 
tachyonic. 

The interesting questions to which we would like to know the answer are: 

1. Does Veff{T) have a minimum? 

2. If it does have a minimum, then what kind of mass spectrum do we get by 
quantizing the theory around the minimum? 

3. Do we get topological defects involving the tachyon? 

etc. It turns out that the answers to many of these questions are now known. These 
results can be summarized as follows: 




S36 



Ashoke Sen Ann. Henri Poincaré 




T 



Figure 3: The tachyon potential on a non-BPS D-p-brane. The tachyon potential 
on a brane-antibrane system can be obtained by revolving this diagram around 
the vertical axis, so that we get a mexican hat potential. 



1. Veff(T) does have a minimum at some value |T| = Tq. Furthermore, at this 
minimum [1, 2] 

^eff{To)-\-Sp = 0^ (5) 

where Sp denotes the total energy density of the original system. Thus Sp 
= 7p for a non-BPS Dp brane, and Sp = 27^ for Dp - Dp system. Thus 
at |T| = To the total energy density vanishes identically. This situation has 
been illustrated in Fig. 3. 

2. |T| — To configuration describes the closed string vacuum without any D- 
brane[l, 2]. Thus around this minimum there are no physical open string 
excitations. This is natural from the point of view of string theory, since the 
total energy vanishes at T = To, and hence we can identify this configuration 
as vacuum without any D-brane. Since there is no D-brane, there should be 
no open strings in the spectrum. However, this result is very surprising from 
the point of view of a normal field theory. Shifting the point around which we 
expand the potential can make a negative mass^ state into a positive mass^ 
state, but we do not eliminate the state altogether. On the other hand here 
expanding the action around the minimum of the potential not only gets rid 
of the original tachyon state, but also gets rid of the infinite number of other 
open string states which were present. 

3. There are classical solutions of the equations of motion of T, representing 
lower dimensional D-branes [3, 4, 5, 6, 7, 8]. For example, on a non-BPS 
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Figure 4: Tachyonic kink solution representing a BPS D-(p — l)-brane. 



Dp-brane, a kink represents a D-(p — l)-brane. For this solution the energy 
density is localized around a codimension 1 subspace (x^ = 0) This looks 
like an ordinary kink solution in a field theory, but there is an important 
difference. In a conventional field theory, a defect lives on the space in which 
the field theory lives. Here, at the bottom of the potential, the object (original 
D-p-brane) whose dynamics the field theory describes disappears altogether. 
Nevertheless defects in the field can survive and describe non-trivial objects 
in the (9+l)-dimensional space-time in which full string theory lives. 

There are also other more complicated examples of ‘tachyonic defects’. For 
example, a vortex solution on a Dp- Dp pair describes a BPS D-(p — 2)-brane 
[4]. On the other hand, a ’t Hooft Polyakov monopole on a pair of coincident 
non-BPS Dp-branes describes a non-BPS D-(p — 3)-brane [7]. In this way all 
D-branes can be regarded as defects in the tachyon field living on D-branes of 
maximal dimension. This gives a more conventional description of D-branes 
as defects in the tachyon field. But more importantly this description gives 
a way to classify Dp-branes based on a branch of mathematics known as 
K-theory [5, 7]. Several new stable D-branes in various string theories have 
been discovered using this general scheme. 

So far we have only described the properties of static solutions of the tachyon 
effective field theory. Let us now turn to dynamics, namely time dependent solu- 
tions of the equations of motion. In the case of a conventional scalar field, if we 
displace the field from its maximum and let it roll down the potential, the scalar 
field will oscillate about its minimum. Energy- momentum tensor for this so- 
lution will have the form: 



Tij = -p{x°)Sij, 



Too = 6 , 



Tio = 0 . 



(6) 
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Here E denotes the energy density, and remains constant due to energy conserva- 
tion. p denotes the pressure, and will typically oscillate about an averge value (0 
for a conventional scalar field) as the scalar field oscillates about its minimum. We 
can now ask; What happens if we displace the tachyon field on a D-D pair (or a 
non-BPS D-brane) and let it roll down the hill? It turns out that in this case the 
energy density remains constant as usual by energy conservation, but the pressure 
goes to zero asymptotically instead of oscillating about 0. Thus the final state is 
a gas of non- zero energy density and zero pressure [9, 10]. 

This seems a very strange result from the point of view of a conventional 
scalar field theory in which the action is given by the sum of a kinetic and a 
potential term. We can now ask if it is possible to write down an (unconven- 
tional) scalar field theory that can describe this apparently strange dynamics of 
the tachyon. It turns out that the tachyon dynamics near the minimum of the 
potential is describable in terms of a non-standard action for the tachyon field T 
[ 11 ]: 

- J (F+'^xV{T) V'l + , (7) 

where rj is the diagonal matrix with eigenvalues (—1, 1, 1, ... 1), and V{T) is the 
tachyon effective potential which in this parametrization has its minima at T = 
±oo and its maximum at T = 0. For large T, V [T] falls off as This action as 

a candidate for tachyon effective field theory had been proposed earlier in [12, 13]. 

The classical dynamics of this system is best described in the Hamiltonian 
formalism. The Hamiltonian for this system is given by [14, 11]; 

H = j + (F(T))2 ^J 1 + (VT)2 , (8) 

where H is the momentum conjugate to T. As the tachyon rolls down the potential 
hill, V(T) 0. Thus at late time we can ignore the V{T) term in the Hamilto- 
nian. It can be shown that in this limit the equations of motion derived from the 
Hamiltonian (8) are identical to the equations of motion of a pressureless non- 
interacting fluid (dust) with the identification that |H|y^l + (VT)^ is interpreted 
as the energy density p of the dust, and —d^T is interpreted as the local (p + 1)- 
velocity of the dust particle. Thus at late time the classical solutions in this 
field theory are in one to one correspondence with the configurations of a system 
of non-interacting dust. Since dust particles at rest correspond to a pressureless 
fluid, this automatically explains the result as to why the solutions describing a 
homogeneous rolling tachyon evolve into a system of zero pressure. 

We should however add a caution that all the results quoted here are classical 
and are expected to be modified due to quantum corrections. Thus, for example, 
the pressureless gas obtained from the rolling tachyon may further decay into 
other states of the string. But as long as the coupling constant of the theory is 
small, we expect the life-time of the system to be large. This could be important 
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in cosmology in the early universe, if brane-antibrane annihilation or decay of 
non-BPS D-branes played any role in the history of the universe. 

Let us now briefly discuss a possible application of brane-antibrane annihila- 
tion process to inflationary cosmology. Typically inflation requires a (set of) scalar 
field (s) with reasonably flat potential near the top and a minimum where the po- 
tential vanishes. The question is: can we achieve this by a brane-antibrane pair or 
some variation of this configuration? The brane-antibrane pair has large energy 
near the top of the potential, but the potential is not flat near the top. Thus this 
system does not seem to be suitable for inflation. One possible way to achieve a 
flat potential is to separate the brane and the antibrane in space [15, 16]. To be 
more specific, consider (9+1) dimensional string theory with 6 of the dimensions 
wrapped into a compact space K. Take a D3-brane along the 3 non-compact direc- 
tions, placed at a given point in and an D3-brane along the same direction but 
placed at a different point of K. The D-D system has large energy density, but for 
sufficiently large separation there is no tachyon. They however have an attractive 
weak gravitational potential, and under its influence the brane and the anti-brane 
will slowly roll towards each other. From the point of view of a (3+1) dimensional 
observer, the separation between branes is interpreted as a scalar field 0. For large 
value of (j) the potential is almost flat, but there is a small potential that drives (j) 
towards smaller value. Thus if the universe started out with a large (/> which then 
slowly decreased towards zero the universe will inflate during this phase. When (f) 
becomes smaller than a critical value the tachyon develops and since the tachyon 
potential is very steep, the system quickly rolls down towards the minimum of the 
potential. In this scenario the tachyon dynamics is important for understanding 
end of inflation / reheating process. The details of this reheating process are still 
to be explored. 

So far I have only mentioned various results, but not told you how to derive 
any of these results. Let me now say a few words about the various techniques 
which are used to derive these results. In stating the various results we have rep- 
resented the tachyon by a scalar field. But we have one major problem, - that it 
is inconsistent to deal only with the tachyon and not take into accout its coupling 
with other fields which should represent the massive string states. Thus in order to 
study the classical dynamics of the tachyon field, we actually have to solve infinite 
number of equations involving infinite number of fields. 

There are various approaches to this problem, but I shall mention only two of 
them. We can use an indirect approach where we use the fact that there is a one to 
one correspondence between solutions of equations of motion in string theory and 
two dimensional conformal field theories. In this approach we directly try to get a 
solution of the equations of motion (describing the defect solutions for example) 
by constructing the corresponding conformal field theory in two dimensions. This 
avoids the need to find the tachyon potential or its coupling to other fields. This 
procedure has been used to derive analytical results both for static and dynamical 
properties. In the direct approach (based on string field theory) [17, 18, 19, 20, 21, 
22, 23] we take into accout the coupling of the tachyon to all the other fields and try 
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to solve the coupled equations for all the fields using some approximation scheme, 
known as level truncation. In this scheme, we include only fields below a certain 
fixed mass (say M). This gives a finite number of fields, and the corresponding 
equations can be solved (numerically). Then we include more fields, with mass 
below M' (M' > M) and repeat the procedure. If the procedure converges as we 
go to larger and larger cut-off on the mass, then we are on the right track. So far 
this procedure has been used to study only the static properties of the tachyon. 
In these applications the results converge rapidly to the conjectured answers. 

There are various other approaches all of which has been successful to various 
extents in studying the properties of the tachyon. I shall only list them here without 
giving any details: 

1. Renormalization group flow [24] 

2. Non-commutative geometry [25, 26] 

3. Boundary string field theory [27, 28, 29] 

etc. 

I shall now summarize the talk by emphasizing once again the main points. As 
we have seen, we now have a good understanding of the physics of tachyons which 
arise from open strings living on unstable D-brane systems (non-BPS D-brane or 
D-D system) . In these systems the minimum of the tachyon potential corresponds 
to total disappearance of the original brane. Nevertheless defects in the tachyon 
field can give rise to non-trivial D-branes which live not on the original brane but 
in the full (9+l)-dimensional space-time. Classical dynamics of the tachyon field 
near its minimum is represented by a non-standard field theory whose classical 
solutions represent configurations of non-interacting dust. 

We conclude by listing one of the main open questions. Clearly we would 
like to know if we can make sense of closed string tachyon that appears in the 
original bosonic string theory. Existence of the tachyon is the only thing wrong with 
this theory, and hence by making sense of this tachyon we may make the theory 
consistent. For this we need to 1) establish the existence of the minimum of the 
potential, and 2) find an interpretation of the physics around this minimum. This is 
still an unsolved problem. However some progress has been made in understanding 
other kind of closed string tachyons which appear in superstring theories in non- 
trivial background [30]. In each case that has been understood, the minimum of 
the tachyon potential always corresponds to some kind of stable background. Thus 
the tachyon reflects the instability of the original background to decay into the 
new background. Success of this analysis raises hope that perhaps the tachyon 
in (25+1) dimensional bosonic string theory may also be understood in a similar 



manner. 
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Neutrino Masses as a Probe of Grand Unification 

Guido Altarelli 



Abstract. We briefly review the physics of neutrino masses and mixings as derived 
from neutrino oscillation data and the existing limits. The smallness of neutrino 
masses is explained in terms of Majorana neutrinos with mass inversely proportional 
to a large scale where lepton number is violated. This scale is well compatible with 
the Grand Unification scale. In this general context possible models for neutrino 
masses and mixings are concisely presented. 



1 Introduction 

There is by now convincing evidence, from the experimental study of atmospheric 
and solar neutrinos, for the existence of at least two distinct frequencies of neutrino 
oscillations. This in turn implies non vanishing neutrino masses and a mixing ma- 
trix, in analogy with the quark sector and the CKM matrix. So apriori the study 
of masses and mixings in the lepton sector should be at least as important as that 
in the quark sector. But actually there a number of features that make neutrinos 
especially interesting. In fact the smallness of neutrino masses is related to the 
fact that p's are completely neutral (i.e. they carry no charge which is exactly 
conserved) and are Majorana particles with masses inversely proportional to the 
large scale where lepton number (L) conservation is violated. Majorana masses can 
arise from the see-saw mechanism, in which case there is some relation with the 
Dirac masses, or from higher dimension non renormalisable operators which come 
from a different sector of the lagrangian density than other fermion mass terms. 
The relation with L non conservation and the fact that the observed neutrino 
oscillation frequencies are well compatible with a large scale for L non conser- 
vation, points to a tantalizing connection with Grand Unified Theories (GUT’s). 
So neutrino masses and mixings can represent a probe into the physics at GUT 
energy scales and offer a different perspective on the problem of flavour and the 
origin of fermion masses. There also direct connections with important issues in 
astrophysics and cosmology as for example baryogenesis through leptogenesis and 
the possibly non negligible contribution of neutrinos to hot dark matter in the 
Universe. 

At present there are many alternative models of neutrino masses. This vari- 
ety is mostly due to the considerable experimental ambiguities that still exist. The 
most crucial questions to be clarified by experiment are whether the LSND signal 
will be confirmed or will be excluded and which solar neutrino solution will eventu- 
ally be established. LSND claims that there are oscillations with Am^ ~ O(leU^) 
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and small mixing angle. But KARMEN has excluded most of the signal region of 
LSND. The experiment MiniBOONE now starting will definitely decide the issue. 
If LSND is right we need four light neutrinos (assuming that CPT is conserved) , 
if not we can do with only the three known ones. Which solar solution is correct 
fixes the corresponding mass squared difference and the associated mixing angle. 
Although the Large Angle (LA) solution emerges as the most likely from present 
data, the other solutions LOW and Vacuum Oscillations (VO) are still not ex- 
cluded (the small angle solution, SA, is very disfavoured by now, after SNO data 
appeared, and we will disregard it in most of the following discussion). Indeed 
no solution is actually leading to an imposingly good fit and, for example, the 
discrimination between LA and LOW is only based on a few hints which in the 
end could be contradicted by further evidence. Hopefully very soon, when the first 
results from the KamLAND experiment will be known, one will have new relevant, 
perhaps decisive, results on this matter. Another crucial unknown is the absolute 
scale of neutrino masses. This is in turn related to as diverse physical questions 
as the possible cosmological relevance of neutrinos as hot dark matter or the rate 
of Ov/SP decay. If neutrinos are an important fraction of the cosmological density, 
say Qt, ~ 0.1, then the average neutrino mass must be considerably heavier than 
the splittings that are indicated by the observed atmospheric and solar oscilla- 
tion frequencies. For example, for three light neutrinos, only models with almost 
degenerate neutrinos, with common mass \mi,\ ^ 1 eV, are compatible with a 
large hot dark matter component, but in this case the existing bounds on Oi/PP 
decay represent an important constraint. On the contrary hierarchical three neu- 
trino models (with both signs of Am^s) have the largest neutrino mass fixed by 
\m\ ^ ^ 0.05 eV. In view of all these important questions still pending 

it is no wonder that many different theoretical avenues are open and have been 
explored in the vast literature on the subject. 

Here we will briefly summarize the main categories of neutrino mass models, 
discuss their respective advantages and difficulties and give a number of examples. 
For a more extended review and a guided tour of the existing literature see, for 
example, ref.[l] and the update in ref. [2] (in fact a reasonably complete set of 
references would take several pages). 



2 Neutrino Masses and Lepton Number Violation 

Neutrino oscillations imply neutrino masses which in turn demand either the exis- 
tence of RH (RH) neutrinos (Dirac masses) or lepton number L violation (Majo- 
rana masses) or both. Given that neutrino masses are certainly extremely small, 
it is really difficult from the theory point of view to avoid the conclusion that L 
must be violated. In fact, it is only in terms of lepton number violation that the 
smallness of neutrino masses can be explained as inversely proportional to the very 
large scale where L is violated, of order Mgut or even Mpianck- 
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Once we accept L violation we gain an elegant explanation for the smallness 
of neutrino masses which turn out to be inversely proportional to the large scale 
where lepton number is violated. If L is not conserved, even in the absence of vr, 
Major ana masses can be generated for neutrinos by dimension five operators of 
the form 

_ LJ XijLjcfxf) 

^ M ^ ^ 

with 0 being the ordinary Higgs doublet, L the LH lepton doublet, A a matrix in 
fiavour space and M a large scale of mass, of order Mqut or Mpianck- Neutrino 
masses generated by O 5 are of the order rriy ^ v‘^/M for Xij ^ 0(1), where 
V ^ 0(100 GeV) is the vacuum expectation value of the ordinary Higgs. 

We consider that the existence of ur is quite plausible because all GUT 
groups larger than SU(5) require them. In particular the fact that ur completes 
the representation 16 of SO(IO): 16=5+10+1, so that all fermions of each family 
are contained in a single representation of the unifying group, is too impressive 
not to be significant. At least as a classification group SO (10) must be of some 
relevance. Thus in the following we assume that there are both ur and lepton 
number violation. With these assumptions the see-saw mechanism is possible which 
leads to: 

= rn^M~^mD ( 2 ) 

That is, the light neutrino masses are quadratic in the Dirac m asses an d inversely 
proportional to the large Majorana mass. Note that for rrii, ^ 0.05 eV 

and rrii^ ^ m\)/M with rriD ^ v ^ 200 GeV we find M ^ 10^^ GeV which indeed 
is an impressive indication for Mgut- 

If additional non renormalizable terms like O 5 are comparatively non negli- 
gible, they should simply be added. After elimination of the heavy RH fields, at 
the level of the effective low energy theory, the two types of terms are equivalent. 
In particular they have identical transformation properties under a chiral change 
of basis in flavour space. The difference is, however, that in the see-saw mech- 
anism, the Dirac matrix tyid is presumably related to ordinary fermion masses 
because they are both generated by the Higgs mechanism and both must obey 
GUT-induced constraints. Thus if we assume the see-saw mechanism more con- 
straints are implied. In particular we are led to the natural hypothesis that rriD 
has a largely dominant third family eigenvalue in analogy to rrit , mb and which 
are by far the largest masses among u quarks, d quarks and charged leptons. Once 
we accept that mp is hierarchical it is very difficult to imagine that the effective 
light neutrino matrix, generated by the see-saw mechanism, could have eigenvalues 
very close in absolute value. 

3 Three-Neutrino Models 

We now assume that the LSND signal will not be confirmed, so that there are 
only two distinct neutrino oscillation frequencies, the atmospheric and the solar 
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frequencies. These two can be reproduced with the known three light neutrino 
species. 

Neutrino oscillations are due to a misalignment between the flavour basis, 
v' = (z^e, z/,-), where Pe is the partner of the mass and flavour eigenstate e~ in a 

LH weak isospin SU ( 2 ) doublet (similarly for and Pr)) and the mass eigenstates 
p = (z/i,z/2,z/3): 

p' = Up, ( 3 ) 

where U is the unitary 3 by 3 mixing matrix. Given the definition of U and the 
transformation properties of the effective light neutrino mass matrix 

p'^rriup' = p^U^rriuUp ( 4 ) 

U'^rn^U = Diag{mi,rri2,m3) =rridiag 

we obtain the general form of rriiy (i.e. of the light p mass matrix in the basis where 
the charged lepton mass is a diagonal matrix): 

rrii, = UmdiagU^ ( 5 ) 

The matrix U can be parameterized in terms of three mixing angles ^12, 623 and 
^13 (0 < Oij < 7t/2) and one phase {0 < (f < 2 tt), exactly as for the quark mixing 
matrix Vckm- The following definition of mixing angles can be adopted: 

/l 0 ^ \ ( 0 / C12 S12 0\ 

U = jo C23 S23 ] I 0 1 ^11 C12 0 j 7 (6) 

\0 -S 23 C 23 / \-si 3 e-^'^ 0 Ci 3 / \ 0 0 1/ 

where sij = sin Oij, Cij = cos^ij. In addition we have the relative phases among the 
Majorana masses mi, m2 and m3. If we choose m3 real and positive, these phases 
are carried by mi, 2 = |mi,2|e^^^’^ . Thus, in general, 9 parameters are added to 
the SM when non vanishing neutrino masses are included: 3 eigenvalues, 3 mixing 
angles and 3 CP violating phases. 

In our notation the two frequencies, Am^fAE (/ = sun, atm), are parame- 
trised in terms of the p mass eigenvalues by 

= ml- mf , ( 7 ) 

The numbering 1 , 2,3 corresponds to our definition of the frequencies and in prin- 
ciple may not coincide with the ordering from the lightest to the heaviest state. 

From experiment we know that C23 ~ S23 ~ 1 /V^, corresponding to nearly 
maximal atmospheric neutrino mixing, and that S13 is small, according to CHOOZ, 
si3 < 0.2. The solar angle ^12 is large for the LA, LOW and VO solutions (could 
even be nearly maximal for LOW and VO) If we take maximal S23 and keep only 
linear terms in u = sise^^ from experiment we find the following structure of the 
Ufi {î—e,id,r, i=l, 2 , 3 ) mixing matrix, apart from sign convention redefinitions 

( C12 S12 u \ 

~{si2 Ci2U*)/y/2 {Ci2 — Si2U*)/y/2 1/V^ j 

(si2 - Ci2U*)/\/2 — (ci2 + Si2U*)/\/2 1 /V^/ 



(8) 
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Given the observed frequencies and our notation in eq. (7), there are three 
possible patterns of mass eigenvalues: 

Degenerate : \mi\ \rri2\ ~ ImsI » \m^ - mj\ 

Inverted hierarchy : |mi| |w2| > imal 

Normal hierarchy : Imaj |?n2,i| (9) 

We now discuss pro’s and con’s of the different cases and argue in favour of the 
hierarchical option. 

3.1 Degenerate Neutrinos 

At first sight the degenerate case is the most appealing: the observation of nearly 
maximal atmospheric neutrino mixing and the fact that the solar mixing is also 
large suggests that all p masses are nearly degenerate. Moreover, the common value 
of \m,y\ could conceivably be compatible with a large fraction of hot dark matter 
in the universe for \m,j\ ~ 1 — 2 eV. In this case, however, the existing limits on 
the absence of neutrino- less double beta decay {Oup/S) imply a strong constraint 
on the solar mixing angle. In fact the quantity which is bound by experiments is 
the 11 entry of the i/ mass matrix, which is approximately given by: 

TTiee ~ miCos^^i2 + m2sin^0\2 ^ 0.3 — 0.5 eV (10) 

To satisfy this constraint one needs mi = —m 2 (recall that the sign of fermion 
masses can be changed by a phase redefinition) and cos^ 0 i 2 ~ siii? 0\2 to a good 
accuracy (in fact we need szn^2^i2 > 0.96 in order that |cos2^i2| = |cos^^i2 — 
siin?0i2\ < 0.2). Of course this strong constraint can be relaxed if the common 
mass is below the hot dark matter maximum. It is true in any case that a signal 
of near the present limit (like a large relic density of hot dark matter) would 
be an indication for nearly degenerate i/’s. 

In addition, for naturalness reasons, the splittings cannot be too small with 
respect to the common mass, unless there is a protective symmetry. This is because 
the wide mass differences of fermion masses, in particular charged lepton masses, 
would tend to create neutrino mass splittings via renormalization group running 
effects even if one started from degenerate masses at a large scale. For example, the 
vacuum oscillation solution for solar neutrino oscillations would imply Am/m ~ 
10~^ — which is difficult to obtain. In this respect the LA solution would be 

favoured, but, if we insist that \mi,\ ~ 1 — 2 eV, the corresponding mixing angle 
is probably not sufficiently maximal. 

It is clear that in the degenerate case the most likely origin of u masses is 
from dim-5 operators O5 = Lj \ijLj(f)(j)/M and not from the see-saw mechanism 
mi^ = mJ)M~^mo- In fact we expect the z/-Dirac mass m^ to be hierarchical like 
for all other fermions and a conspiracy to reinstaure a nearly perfect degeneracy 
between mo and M, which arise from completely different physics, looks very 
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unplausible. Thus in degenerate models, in general, there is no direct relation 
with Dirac masses of quarks and leptons and the possibility of a simultaneous 
description of all fermion masses within a grand unified theory is more remote. 

For model building there is an important quantitative difference between the 
LA solution on the one side and the LOW or VO solutions on the other side. 
While for all these solutions the relevant mixing angle 0\2 is large, the value of the 
squared mass difference Am^ is very different for LA, LOW and VO: ~ 10“^ eV^, 
~ 10~^ eV^ and ~ 10”^° eV^, respectively. Thus the gap with respect to the 
atmospheric neutrino oscillation frequency — m^, which is given by 

lAm^al 3-10-^ eV\ is moderate for LA and very pronounced for the other two 
solutions. In the case of the LA solution the ratio of the solar and atmospheric 
Am^ ranges is typically given by r = Ami2/|Am23| ~ 1/20 — 1/100. For LA one 
can reproduce the data either in a nearly degenerate or in a hierarchical model. We 
have seen that a moderate degeneracy is more likely than the extreme case where all 
masses are \rrii\ ~ O(leV). Typically we could have all mf ~ (few 10~^ — 10“^ eV‘^ 
with one comparatively not so small splitting Amf2 10~^ eV^). Actually a 
sufficient hierarchy to reproduce the LA solution (a factor of 5 in mass) can 
arise from no significant underlying structure at all. In particular, the see-saw 
mechanism, being quadratic in the Dirac neutrino masses, tends to enhance small 
fluctuations in the Dirac eigenvalue ratios. This is the point of view of anarchical 
models, where no structure is assumed to exist in the neutrino mass matrix and 
the smallness of r is interpreted as a fluctuation. But one additional feature of 
the data plays an important role in this context and presents a clear difficulty for 
anarchical models. This is the experimental result that the third mixing angle O 13 
is small, \Ues\ = \ sin 6^13 1 ^ 0.2. So, for neutrinos two mixing angles are large and 
one is small. Instead in anarchical models all angles should apriori be comparable 
and not particularly small. Therefore this is a difficulty for anarchy and, for the 
survival of this idea, it is necessary that 613 is found very close to the present 
upper bound. 



3.2 Inverted Hierarchy 

The inverted hierarchy configuration \mi\ ~ \m 2 \ » \m 3 \ consists of two levels 
mi and m 2 with small splitting Ami2 = ^ common mass given by 

r«l,2 ~ r\j 3 10 ^ eV^ (no large hot dark matter component in this case). 

One particularly interesting example of this sort, which leads to double maximal 
mixing, is obtained with the phase choice mi = —m 2 so that, approximately: 

md^ag = M[l,-1,0] (11) 

The effective light neutrino mass matrix 

m,, = UmdiagU^ 



(12) 
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which corresponds to the mixing matrix of double maximal mixing c = s = \j\f2\ 



is given by: 






c —s 
sfy/^ cj^/2 
s/V2 clV2 



0 

-1/v^ 

+1/V2 



rrijy 



M 

71 



0 1 
1 0 
1 0 



1 

0 

0 



(13) 



(14) 



The structure of can be reproduced by imposing a flavour symmetry Lg — 
Lp — Lr. The 1 — 2 degeneracy remains stable under radiative corrections. The 
preferred solar solutions are the LOW or the VO solutions. The LA can also be 
compatible but the solar mixing angle must be as close to maximal as possible. In 
fact in models of this class the solar angle is typically nearly maximal. 



3.3 Hierarchical 



We now discuss the class of models which we consider of particular interest because 
this is the most constrained framework which allows a comprehensive combined 
study of all fermion masses in GUT’s. We assume three widely split i/’s and the 
existence of a RH neutrino for each generation, as required to complete a 16- 
dim representation of *90(10) for each generation. We then assume dominance of 
the see-saw mechanism rrii^ = We know that the third- generation 

eigenvalue of the Dirac mass matrices of up and down quarks and of charged 
leptons is systematically the largest one. It is natural to imagine that this property 
will also be true for the Dirac mass of p’s: diag[mD] ~ [0,0, m^s]. After see-saw 
we expect to be even more hierarchical being quadratic in mjj (barring fine- 
tuned compensations between and M). The amount of hierarchy, m^jm^ = 
^'^sun-> depends on which solar neutrino solution is adopted: the hierarchy 
is maximal for VO, intermediate for LOW and moderate for LA solutions. A 
possible difficulty is that one is used to expect that large splittings correspond to 
small mixings because normally only close- by states are strongly mixed. In a 2 by 
2 matrix context the requirement of large splitting and large mixings leads to a 
condition of vanishing determinant. For example the matrix 



m (X 



X 



X 

I 



(15) 



has eigenvalues 0 and 1 + and for x of 0(1) the mixing is large. Thus in the limit 
of neglecting small mass terms of order mi ^2 the demands of large atmospheric 
neutrino mixing and dominance of m3 translate into the condition that the 2 by 
2 subdeterminant 23 of the 3 by 3 mixing matrix approximately vanishes. The 
problem is to show that this vanishing can be arranged in a natural way without 
fine tuning. Once near maximal atmospheric neutrino mixing is reproduced the 
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solar neutrino mixing can be arranged to be either small or large without difficulty 
by implementing suitable relations among the small mass terms. 

It is not difficult to imagine mechanisms that naturally lead to the approx- 
imate vanishing of the 23 sub-determinant. For example assume that one ur is 
particularly light and coupled to fi and r. In a 2 by 2 simplified context if we have 





A different possibility that we find attractive is that, in the limit of neglecting 
terms of order mi ^2 and, in the basis where charged leptons are diagonal, the 
Dirac matrix m^, defined by RtudL, takes the approximate form: 

"0 0 O' 

niD oc 0 0 0 . (18) 

0 X 1 

This matrix has the property that for a generic Majorana matrix M one finds: 

To 0 O' 

m^y = rn^M~^mD (x Q x . (19) 

0x1 

The only condition on M~^ is that the 33 entry is non zero. But when the ap- 
proximately vanishing matrix elements are replaced by small terms, one must also 
assume that no new o(l) terms are generated in m^y by a compensation between 
small terms in mo and large terms in M. It is important for the following dis- 
cussion to observe that mj) given by eq. (15) under a change of basis transforms 
as m'j^— > V^ttidU where V and U rotate the right and left fields respectively. 
It is easy to check that in order to make mo diagonal we need large left mixings 
(i.e. large off diagonal terms in the matrix that rotates LH fields). Thus the ques- 
tion is how to reconcile large LH mixings in the leptonic sector with the observed 
near diagonal form of Vckm, the quark mixing matrix. Strictly speaking, since 
VcKM = the individual matrices Uu and Ud need not be near diagonal, but 

VcKM does, while the analogue for leptons apparently cannot be near diagonal. 
However nothing forbids for quarks that, in the basis where rriu is diagonal, the d 
quark matrix has large non diagonal terms that can be rotated away by a pure RH 
rotation. We suggest that this is so and that in some way RH mixings for quarks 
correspond to LH mixings for leptons. 

In the context of (Susy) SU(5) there is a very attractive hint of how the 
present mechanism can be realized. In the 5 of SU(5) the singlet appears together 




Vol. 4, 2003 



Neutrino Masses as a Probe of Grand Unification 



S51 



with the lepton doublet {iy,e). The {u,d) doublet and belong to the 10 and 
to the 1 and similarly for the other families. As a consequence, in the simplest 
model with mass terms arising from only Higgs pentaplets, the Dirac matrix of 
down quarks is the transpose of the charged lepton matrix: . Thus, 

indeed, a large mixing for RH down quarks corresponds to a large LH mixing for 
charged leptons. At leading order we may have: 



rrid = {mif 



0 0 
0 0 
0 0 



Vd 



(20) 



In the same simplest approximation with 5 or 5 Higgs, the up quark mass matrix 
is symmetric, so that left and right mixing matrices are equal in this case. Then 
small mixings for up quarks and small LH mixings for down quarks are sufficient 
to guarantee small Vckm mixing angles even for large d quark RH mixings. If 
these small mixings are neglected, we expect: 



rriu 



0 0 
0 0 
0 0 



( 21 ) 



When the charged lepton matrix is diagonalized the large LH mixing of the charged 
leptons is transferred to the neutrinos. Note that in SU(5) we can diagonalize the 
u mass matrix by a rotation of the fields in the 10, the Majorana matrix M by 
a rotation of the 1 and the effective light neutrino matrix m,y by a rotation of 
the 5. In this basis the d quark mass matrix fixes Vckm and the charged lepton 
mass matrix fixes neutrino mixings. It is well known that a model where the down 
and the charged lepton matrices are exactly the transpose of one another cannot 
be exactly true because of the e/d and ji/ s mass ratios. It is also known how to 
remedy to this problem but here we are only interested in the order of magnitude 
of the matrix entries and not in their exact values. Similarly, the Dirac neutrino 
mass matrix m/:) is the same as the up quark mass matrix in the very crude 
model where the Higgs pentaplets come from a pure 10 representation of SO (10): 
rriD = rriu . For m d the dominance of the third family eigenvalue as well as a near 
diagonal form could be an order of magnitude remnant of this broken symmetry. 
Thus, neglecting small terms, the neutrino Dirac matrix in the basis where charged 
leptons are diagonal could be directly obtained in the form of eq. (15). 



4 Conclusion 

While the existence of oscillations appears to be on a ground of increasing solidity, 
many important experimental challenges remain. For atmospheric neutrino oscil- 
lations the completion of the K2K experiment, now stopped by the accident that 
has seriously damaged the Super kamiokande detector, is important for a terrestrial 
confirmation of the effect and for an independent measurement of the associated 
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parameters. In the near future the experimental study of atmospheric neutrinos 
will continue with long baseline measurements by MINOS, OPERA, ICARUS. For 
solar neutrinos it is not yet clear which of the solutions LA, LOW and VO, is 
true, although a preference for the LA solution appears to be indicated by the 
present data. This issue will be presumably clarified in the near future by the 
continuation of SNO and the forthcoming data from KamLAND (eagerly awaited 
for very soon) and Borexino. Finally a clarification by MiniBOONE of the issue 
of the LSND alleged signal is necessary, in order to know if 3 light neutrinos are 
sufficient or additional sterile neutrinos must be introduced, in spite of the ap- 
parent lack of independent evidence in the data for such sterile neutrinos and of 
the fact that attempts of constructing plausible and natural theoretical models 
have not led sofar to compelling results. Further in the future, there are projects 
for neutrino factories and/or superbeams aimed at precision measurements of the 
oscillation parameters and possibly the detection of CP violation effects in the 
neutrino sector. 

Pending the solution of the existing experimental ambiguities a large variety 
of theoretical models of neutrino masses and mixings are still conceivable. Among 
3-neutrino models we have described a variety of possibilities based on degenerate, 
inverted hierarchy and normal hierarchy type of spectra. Most models prefer one 
or the other of the possible experimental alternatives which are still open. 

The fact that some neutrino mixing angles are large and even nearly max- 
imal, while surprising at the start, was eventually found to be well compatible 
with a unified picture of quark and lepton masses within GUTs. The symmetry 
group at Mgut could be either (SUSY) SU(5) or SO (10) or a larger group. For 
example, we have discussed a class of natural models where large RH mixings for 
quarks are transformed into large LH mixings for leptons by the transposition re- 
lation rrid = ml which is approximately realised in SU(5) models. In particular, 
we argued in favour of models with 3 widely split neutrinos. Reconciling large 
splittings with large mixing(s) requires some natural mechanism to implement a 
vanishing determinant condition. This can be obtained in the see-saw mechanism, 
for example, if one light RH neutrino is dominant, or a suitable texture of the 
Dirac matrix is imposed by an underlying symmetry. For example, these mech- 
anisms can be naturally implemented by simple assignments of U(1)f horizontal 
charges that lead to a successful semiquantitative unified description of all quark 
and lepton masses in SUSY SU(5)x U(1)f- Alternative realizations based on the 
SO (10) unification group have also been discussed in the literature. 

In conclusion, the discovery of neutrino oscillations with frequencies that 
point to very small u masses has opened a window on the physics beyond the 
Standard Model at very large energy scales. The study of the neutrino mass and 
mixing matrices, which is still at the beginning, can lead to particularly exciting 
insights on the theory at large energies possibly as large as Mgut- 
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Addressing Challenges of Quantum Gravity Through 
Quantum Geometry; Black holes and Big-bang 

Abhay Ashtekar 



Abstract. In this article I present a brief review of some of the recent advances in 
loop quantum gravity - a non-perturbative, background independent approach to 
quantum gravity based on quantum geometry. The emphasis is on physical issues. 



1 Introduction 

The goal of loop quantum gravity is to address the conceptual and physical issues 
at the interface of general relativity and quantum theory, without recourse to 
a background geometry. Because of space limitation, in this article I will restrict 
myself only to two of these issues: cosmological singularities and black hole entropy. 

• Big-Bang: It is widely believed that the prediction of a singularity, such as 
the big-bang of classical general relativity, is primarily a signal that the theory has 
been pushed beyond the domain of its validity. Therefore, some of the key chal- 
lenges for any quantum gravity theory are: What replaces the big-bang? Is there a 
mathematically consistent description of the quantum state of the universe which 
replaces the classical big-bang? What can we say about the ‘initial conditions’, i.e., 
the quantum state of geometry and matter that correctly describes the big-bang? 
If they have to be imposed externally, is there a physical guiding principle? 

• Black holes: The analogy between the first law of black hole mechanics 
and thermodynamics, when coupled to Stephen Hawking’s discovery that black 
holes radiate quantum mechanically as though they are black bodies at a specific 
temperature, leads one to conclude that the entropy of large black holes should 
be given by Sbu = <^hor/4G^. This conclusion is striking and deep because it 
brings together the three pillars of fundamental physics - general relativity, quan- 
tum theory and statistical mechanics. However, the argument itself is a rather 
hodge-podge mixture of classical and semi-classical ideas, reminiscent of the Bohr 
theory of atom. A natural question then is: what is the analog of the more fun- 
damental, Pauli-Schrôdinger theory of the Hydrogen atom? More precisely, what 
is the statistical mechanical origin of black hole entropy? What is the nature of 
a quantum black hole and what is the interplay between the quantum degrees of 
freedom responsible for entropy and the exterior curved geometry? Can one derive 
the Hawking effect from first principles of quantum gravity? 

Recent advances in quantum geometry have provided illuminating answers to 
several of these questions and opened-up avenues to address others. There are also 
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several other interesting developments - notably the relation between the ‘fun- 
damental’, Planck scale theory and the low energy energy world and background 
independent path integrals provided by ‘spin- foam models - which, unfortunately, 
can not be discussed due to space limitation. 

2 Brief summary of quantum geometry 

Let us begin by summarizing the salient features and current status of quantum 
geometry. The emphasis is on structural and conceptual issues; details can be 
found in references [1-6]. 

2.1 Viewpoint 

In this approach, one takes the central lesson of general relativity seriously: gravity 
is geometry whence, in a fundamental theory, there should be no background 
metric. In quantum gravity, geometry and matter should both be ‘born quantum 
mechanically’. Thus, in contrast to approaches developed by particle physicists, 
one does not begin with quantum matter on a background geometry and use 
perturbation theory to incorporate quantum effects of gravity. There is a manifold 
but no metric, or indeed any other physical fields, in the background.^ 

In classical gravity, Riemannian geometry provides the appropriate math- 
ematical language to formulate the physical, kinematical notions as well as the 
final dynamical equations. This role is now taken by quantum Riemannian geom- 
etry, discussed below. In the classical domain, general relativity stands out as the 
best available theory of gravity, some of whose predictions have been tested to 
an amazing accuracy, surpassing even the legendary tests of quantum electrody- 
namics. Therefore, it is natural to ask: Does quantum general relativity, coupled 
to suitable matter ^r, supergravity, its supersymmetric generalization- exist as 
consistent theories non-perturbatively ? This is a fascinating open question, par- 
ticularly at the level of mathematical physics. 

In the particle physics circles, the answer is often assumed to be in the neg- 
ative, not because there is concrete evidence against non-perturbative quantum 
gravity, but because of an analogy to the theory of weak interactions. There, one 
first had a 4-point interaction model due to Fermi which works quite well at low 
energies but which fails to be renormalizable. Progress occurred not by looking for 
non-perturbative formulations of the Fermi model but by replacing the model by 
the Glashow-Salam- Weinberg renormalizable theory of electro- weak interactions, 
in which the 4-point interaction is replaced by and Z propagators. Therefore, 
it is often assumed that perturbative non-renormalizability of quantum general 

^In 2-hl dimensions, although one begins in a completely analogous fashion, in the final 
picture one can get rid of the back ground manifold as well. Thus, the fundamental theory can 
be formulated combinatorially [2]. To achieve this goal in 3+1 dimensions, one needs a much 
better understanding of the theory of (intersecting) knots in 3 dimensions. 
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relativity points in a similar direction. However this argument overlooks the cru- 
cial fact that, in the case of general relativity, there is a qualitatively new element. 
Perturbative treatments pre-suppose that the space-time can be assumed to be a 
continuum at all scales of interest to physics under consideration. This assump- 
tion is reasonable for weak interactions. In the gravitational case, by contrast, the 
scale of interest is given by the Planck length ip\ and there is no physical basis 
to pre-suppose that the continuum picture should be valid down to that scale. 
The failure of the standard perturbative treatments may simply be due to this 
grossly incorrect assumption and a non-perturbative treatment which correctly 
incorporates the physical micro-structure of geometry may well be free of these 
inconsistencies. 

As indicated above, even if quantum general relativity did exist as a math- 
ematically consistent theory, there is no a priori reason to assume that it would 
be the ‘final’ theory of all known physics. In particular, as is the case with classi- 
cal general relativity, while requirements of background independence and general 
covariance do restrict the form of interactions between gravity and matter fields 
and among matter fields themselves, the theory would not have a built-in principle 
which determines these interactions. Put differently, such a theory would not be a 
satisfactory candidate for unification of all known forces. However, just as general 
relativity has had powerful implications in spite of this limitation in the classi- 
cal domain, quantum general relativity should have qualitatively new predictions, 
pushing further the existing frontiers of physics. 

2.2 Quantum Geometry 

Although there is no natural unification of dynamics of all interactions in loop 
quantum gravity, it does provide a kinematical unification. More precisely, in this 
approach one begins by formulating general relativity in the mathematical lan- 
guage of connections, the basic variables of gauge theories of electro- weak and 
strong interactions. Thus, now the configuration variables are not metrics as in 
Wheeler’s geometrodynamics, but certain spin connections; the emphasis is shifted 
from distances and geodesics to holonomies and Wilson loops [1]. Consequently, 
the basic kinematical structures are the same as those used in gauge theories. A 
key difference, however, is that while a background space-time metric is available 
and crucially used in gauge theories, there are no background fields whatsoever 
now. This absence is forced on us by the requirement of diffeomorphism invariance 
(or ‘general covariance’ ). 

This is a key difference and it causes a host of conceptual as well as technical 
difficulties in the passage to quantum theory. For, most of the techniques used in 
the familiar, Minkowskian quantum theories are deeply rooted in the availability 
of a flat back- ground metric. It is this structure that enables one to single out 
the vacuum state, perform Fourier transforms to decompose fields canonically in 
to creation and annihilation parts, define masses and spins of particles and carry 
out regularizations of products of operators. Already when one passes to quantum 
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field theory in curved space-times, extra work is needed to construct mathematical 
structures that are adequate for physics. In our case, the situation is much more 
drastic: there is no background metric what so ever. Therefore new physical ideas 
and mathematical tools are now necessary. Fortunately, they were constructed 
by a number of researchers in the mid-nineties and have given rise to a detailed 
quantum theory of geometry [3, 4, 5, 6]. 

Because the situation is conceptually so novel and because there are no di- 
rect experiments to guide us, reliable results require a high degree of mathematical 
precision to ensure that there are no hidden infinities. Achieving this precision has 
been a high priority in the program. Thus, while one is inevitably motivated by 
heuristic, physical ideas and formal manipulations, the final results are mathemat- 
ically rigorous. In particular, due care is taken in constructing function spaces, 
defining measures and functional integrals, regularizing products of field opera- 
tor, and calculating eigenvectors and eigenvalues of geometric operators. The final 
results are all free of divergences, well-defined, and respect the background inde- 
pendence and diffeomorphism invariance. 

Let me now turn to specifics. Our basic configuration variable is an SU(2)- 
connection, on a 3-manifold S representing ‘space’ and, as in gauge theories, 
the momenta are the ‘electric fields’ Ef. However, in the present gravitational con- 
text, the acquire a geometrical significance: they can be naturally interpreted 
as orthonormal triads (with density weight 1) and determine the dynamical, Rie- 
mannian geometry of E. Thus, in contrast to Wheeler’s geometrodynamics, the 
Riemannian structures, including the positive-definite metric on E, is now built 
from momentum variables. The basic kinematic objects are holonomies of 
which dictate how spinors are parallel transported along curves, and the triads 
E^, which determine the geometry of E. (Matter couplings to gravity have also 
been studied extensively [1].) 

In the quantum theory, the fundamental excitations of geometry are most 
conveniently expressed in terms of holonomies [2, 3]. They are thus one-dimen- 
sional, polymer- like and, in analogy with gauge theories, can be thought of as 
‘fiux lines of the electric field’. More precisely, they turn out to be fiux lines of 
areas: an elementary flux line deposits a quantum of area on any 2-surface S it 
intersects. Thus, if quantum geometry were to be excited along just a few flux 
lines, most surfaces would have zero area and the quantum state would not at all 
resemble a classical geometry. This state would be analogous, in Maxwell theory, 
to a ‘genuinely quantum mechanical state’ with just a few photons. In the Maxwell 
case, one must superpose photons coherently to obtain a semi-clctssical state that 
can be approximated by a classical electromagnetic field. Similarly, here, semi- 
classical geometries can result only if a huge number of these elementary excitations 
are superposed in suitable dense configurations. The state of quantum geometry 
around you, for example, must have so many elementary excitations that ~ 10^^ 
of them intersect the sheet of paper you are reading. Even in such states, the 
geometry is still distributional, concentrated on the underlying elementary flux 
lines; but if suitably coarse-grained, it can be approximated by a smooth metric. 
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Thus, the continuum picture is only an approximation that arises from coarse 
graining of semi-classical states. 

These quantum states span a specific Hilbert space H consisting of functions 
on the space of connections which are square integrable with respect to a natural, 
diffeomorphism invariant (regular, Borel) measure [3]. This space is very large. 
However, it can be conveniently decomposed in to a family of orthonormal, finite 
dimensional sub-spaces 7Y = 0^ j, labeled by graphs 7 each edge of which 
itself is labeled by a spin (i.e., half-integer) j [4]. One can think of 7 as a ‘floating 
lattice’ in S - ‘floating’ because its edges are arbitrary, rather than ‘rectangular’. 
(Indeed, since there is no background metric on E, a rectangular lattice has no 
invariant meaning.) Mathematically, j can be regarded as the Hilbert space of 
a spin-system. These spaces are extremely simple to work with; this is why very 
explicit calculations are feasible. Elements of j are referred to as spin-network 
states [4]. 

As one would expect from the structure of the classical theory, the basic 
quantum operators are the holonomies hp along paths p in E and the triads 
[5]. (Both sets are densely defined and self-adjoint on H. Furthermore, a striking 
result is that all eigenvalues of the triad operators are discrete. This key property 
is, in essence, the origin of the fundamental discreteness of quantum geometry. 
For, just as the classical Riemannian geometry of E is determined by the triads 
Ef, all Riemannian geometry operators - such as the area operator As associated 
with a 2-surface S or the volume operator Vr associated with a region R - are 
constructed from Ef. However, since even the classical quantities As and Vr are 
non-polynomial functionals of the triads, the construction of the corresponding As 
and Vr is quite subtle and requires a great deal of care. But their final expressions 
are rather simple [5]. 

In this regularization, the underlying background independence turns out to 
be a blessing. For, diffeomorphism invariance constrains the possible forms of the 
final expressions severely and the detailed calculations then serve essentially to fix 
numerical coefficients and other details. Let us illustrate this point with the ex- 
ample of the area operators As . Since they are associated with 2-surfaces S while 
the states are 1-dimensional excitations, the diffeomorphism covariance requires 
that the action of As on a state j must be concentrated at the intersections of 

S with 7. The detailed expression bears out this fact: the action of As on j is 
dictated simply by the spin labels ji attached to those edges of 7 which intersect 
S. For all surfaces S and 3-dimensional regions in E, As and Vr are densely 
defined, self-adjoint operators. All their eigenvalues are discrete [5]. Naively, one 
might expect that the eigenvalues would be uniformly spaced, given by, e.g., in- 
tegral multiples of the Planck area or volume. This turns out not to be the case; 
the distribution of eigenvalues is quite subtle. In particular, the eigenvalues crowd 
rapidly as areas and volumes increase. In the case of area operators, the complete 
spectrum is known in a closed form, and the first several hundred eigenvalues 
have been explicitly computed numerically. For a large eigenvalue a^, the sépara- 
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tion Attn = an+i — Cin between consecutive eigenvalues decreases exponentially: 
Aun < ipi exp Because of such strong crowding, the continuum ap- 

proximation becomes excellent quite rapidly just a few orders of magnitude above 
the Planck scale. At the Planck scale, however, there is a precise and very specific 
replacement. This is the arena of quantum geometry. The premise is that the stan- 
dard perturbation theory fails because it ignores this fundamental discreteness. 

There is however a key mathematical subtlety [1, 6]. This non-perturbative 
quantization has a one parameter family of ambiguities labeled by 7 > 0. This 7 
is called the Barbero-Immirzi parameter and is rather similar to the well-known 
^-parameter of QCD. In QCD, a single classical theory gives rise to inequivalent 
sectors of quantum theory, labeled by 6 . Similarly, 7 is classically irrelevant but 
different values of 7 correspond to unitarily inequivalent representations of the 
algebra of geometric operators. The overall mathematical structure of all these 
sectors is very similar; the only difference is that the eigenvalues of all geometric 
operators scale with 7. For example, the simplest eigenvalues of the area operator 
As in the 7 quantum sector is given by ^ 



«{j} = 87T74 i + 1) (1) 

/ 

where / = 1, ... AT for some N and each ji is a half-integer. Since the representa- 
tions are unitarily inequivalent, as usual, one must rely on Nature to resolve this 
ambiguity: Just as Nature must select a specific value of ^ in QCD, it must select a 
specific value of 7 in loop quantum gravity. With one judicious experiment - e.g., 
measurement of the lowest eigenvalue of the area operator As for a 2-surface S of 
any given topology - we could determine the value of 7 and fix the theory. Unfor- 
tunately, such experiments are hard to perform! However, we will see in Section 
4 that the Bekenstein-Hawking formula of black hole entropy provides an indirect 
measurement of this lowest eigenvalue of area for the 2-sphere topology and can 
therefore be used to fix the value of 7. 

3 Applications of quantum geometry: Big Bang 

Over the last two years, quantum geometry has led to some striking results of direct 
physical interest. The first of these concerns the fate of the big-bang singularity. 

Traditionally, in quantum cosmology one has proceeded by first imposing spa- 
tial symmetries such as homogeneity and isotropy - to freeze out all but a finite 

^In particular, the lowest non-zero eigenvalue of area operators is proportional to 7. This fact 
has led to a misunderstanding: in circles outside loop quantum gravity, 7 is sometimes thought 
of as a regulator responsible for discreteness of quantum geometry. As explained above, this is 
not the case; 7 is analogous to the QCD 9 and quantum geometry is discrete in every permissible 
7-sector. Note also that, at the classical level, the theory is equivalent to general relativity only 
if 7 is positive] if one sets 7 = 0 by hand, one can not recover even the kinematics of general 
relativity. Similarly, at the quantum level, setting 7 = 0 would lead to a meaningless theory in 
which all eigenvalues of geometric operators vanish identically. 
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number of degrees of freedom already at the classical level and then quantizing the 
reduced system in a standard fashion. In the simplest case, the basic variables of 
the reduced classical system are the scale factor a and matter fields (f). The sym- 
metries imply that space-time curvature goes as ~ 1/a^ and Einstein’s equations 
predict a big-bang, where the scale factor goes to zero and the curvature blows 
up. The key question is: Do these ‘pathologies’ disappear if we re-examine the sit- 
uation in the context of an appropriate quantum theory? In traditional quantum 
cosmologies, without an additional input, they do not. That is, typically, to resolve 
the singularity one either has to introduce matter with unphysical properties or 
introduce boundary conditions, e.g., by invoking new principles. 

In a series of seminal papers [7], Bojowald has shown that the situation 
in loop quantum cosmology is quite different: the underlying quantum geometry 
makes a qualitative difference very near the big-bang. In the standard procedure 
summarized above, the symmetry reduction removes all traces of the fundamental 
discreteness. The key idea in Bojowald’s analysis is to retain the essential features 
of quantum geometry. As a result, the scale factor operator à has discrete eigen- 
values. The continuum limit is reached rapidly. For example, the gap between an 
eigenvalue of â of ~ 1cm and the next one is less than 10“^^ ^pi! Nonetheless, 
near a ~ ^pi there are surprises and predictions of loop quantum cosmology are 
very different from those of traditional quantum cosmology. 

The first surprise occurs already at the kinematical level. Recall that, in 
the classical theory curvature is essentially given by 1/a^, and blows up at the 
big-bang. What is the situation in quantum theory? Denote the Hilbert space 
of spatially homogeneous, isotropic kinematical quantum states by Tim- A self- 
adjoint operator cufv corresponding to curvature can be constructed on Tim and 
turns out to be hounded from above. This is very surprising because Tim admits an 
eigenstate of the scale factor operator à with a discrete, zero eigenvalue! At first, 
it may appear that this could happen only by an artificial trick in the construction 
of c'urv and that this quantization can not possibly be right because it seems to 
represent a huge departure from the classical relation (curv) o? = 1. However, these 
concerns turn out to be misplaced. The procedure for constructing c'urv is natural 
and, furthermore, descends from the full theory. Let us examine the properties of 
c'urv. Its upper bound i/curv is finite but absolutely huge: 

256 1 _ 256 1 

or, about 10^^ times the curvature at the horizon of a solar mass black hole. The 
functional form of the upper bound is also illuminating. Recall that, in the case 
of an hydrogen atom, energy is unbounded from below classically but, thanks to 
h, we obtain a finite value, Eq = me^l2h^, in quantum theory. Similarly, Ucurv is 
finite because h is non-zero and tends to the classical answer as h tends to zero. At 
curvatures as large as i^curv, it is natural to expect large departures from claissical 
relations such as (curv) = 1. But is this relation recovered in the semi-classical 



( 2 ) 
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regime? The answer is in the affirmative. In fact it is somewhat surprising how 
quickly this happens. As one would expect, one can simultaneously diagonalize â 
and curv. If we denote their eigenvalues by and bn respectively, then ün-bn — 1 is 
of the order 10“^ at n = 100 and decreases rapidly as n increases. These properties 
show that, in spite of the initial surprise, the quantization procedure is viable. 
Furthermore, one can apply it also to more familiar systems such as a particle 
moving on a circle and obtain results which at first seem surprising but are in 
complete agreement with the standard quantum theory of these systems. 

Since the curvature is bounded above in the entire Hilbert space, one might 
hope that the quantum evolution may be well-defined right through the big-bang 
singularity. Is this in fact the case? The second surprise is that although the quan- 
tum evolution is close to that of the Wheeler- De Witt equation of standard quan- 
tum cosmology for large a, there are dramatic differences near the big-bang which 
makes it well defined even at the big-bang, without any additional input. As one 
might expect, the ‘evolution’ is dictated by the quantum scalar constraint opera- 
tor. To obtain this operator, Bojowald again follows, step by step, the procedure 
in the full theory introduced by Thiemann. Let us expand out the full quantum 
state as | ^ >= I where | n> are the eigenstates of the scale factor 

operator and (j) denotes matter fields. Then, the scalar constraint takes the form: 

^71*072+8 (0) (0) (0) -h/n'0n — 4 (0) 8 (0) ~ -^c/)0n(0) (^) 

where c^, . - • are fixed numerical coefficients, 7 the Barbero-Immirzi parameter 
and is the (well-defined) matter Hamiltonian. Primarily, being a constraint 
equation, (3) constrains the physically permissible However, if we choose 

to interpret the scale factor (more precisely, the square of the scale factor times 
the determinant of the triad) as a time variable, (3) can be interpreted as an 
‘evolution equation’ which evolves the state through discrete time steps. In a (large) 
neighborhood of the big- bang singularity, this ‘deparametrization’ is viable. For the 
choice of factor ordering used in the Thiemann regularization, one can evolve in the 
past through n = 0, i.e. right through the classical singularity. Thus, the infinities 
predicted by the classical theory at the big-bang are artifacts of assuming that the 
classical, continuum space-time approximation is valid right up to the big-bang. In 
the quantum theory, the state can be evolved through the big-bang without any 
difficulty. However, the classical space-time description fails near the big-bang; 
quantum evolution is well-defined but the classical space-time ‘dissolves’. 

The ‘evolution’ equation (3) has other interesting features. To begin with, 
the space of solutions is 16 dimensional. Can we single out a preferred solution by 
imposing a physical condition? One possibility is to impose pre-classicality, i.e., to 
require that the quantum state not oscillate rapidly from one step to the next at 
late times when we know our universe behaves classically. Although this is an extra 
input, it is not a theoretical prejudice about what should happen at (or near) the 
big-bang but an observationally motivated condition that is clearly satisfied by our 
universe. The coefficients • - Qn of (3) are such that this condition singles out 
a non-trivial solution uniquely. Another interesting feature is that the standard 
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Wheeler- De Witt equation is recovered if we take the limit 7^0 and n ^ 00 such 
that the eigenvalues of à take on continuous values. This is completely parallel to 
the limit we often take to coarse grain the quantum description of a rotor to ‘wash 
out’ discreteness in angular momentum eigenvalues and arrive at the classical de- 
scription. Prom this perspective, then, one is led to say that the most striking 
of the consequences of loop quantum gravity are not seen in standard quantum 
cosmology because it ‘washes out’ the fundamental discreteness of quantum geom- 
etry. Finally, very recently Bojowald has shown that the quantum geometry also 
affects the kinetic energy term in the matter Hamiltonian, modifying the ‘effective’ 
field equations in just the right manner to give rise an inflationary phase with a 
natural ‘graceful’ exit. This is a potentially important development whose physical 
ramifications are being worked out. 

The detailed calculations have revealed another surprising feature. Prior to 
these calculations, it was not clear how soon after the big-bang one can start 
trusting semi-classical notions and calculations. It would not have been surprising 
if we had to wait till the radius of the universe became, say, a few million times the 
Planck length. These calculations strongly suggest that a few tens of Planck lengths 
should suffice. This is fortunate because it is now feasible to develop quantum 
numerical relativity; with computational resources commonly available, grids with 
( 10 ^)^ points are hopelessly large but one with ( 100 )^ points are readily available. 

Remark: It is known that in non-isotropic case, very near the big-bang singu- 
larity the classical solution has the so-called BKL behavior which has features of 
chaos of dynamical systems. So, one might ask - as Thibaud Damour did during 
the conference - whether this has ramifications on the quantum theory. Prom the 
well understood chaotic systems, one knows that quantization is generally straight- 
forward and signatures of classical chaos appear only in the finer structures (such 
as the detailed properties of spectra of operators.) In this sense, the classical, intri- 
cate BKL behavior does not have much of an impact the basic, qualitative features 
of the quantum theory. (See A. Ashtekar and J. Pullin, Ann. Israel Phys. Soc., Vol 
9, pp 74-75 (1990).) 

4 Applications of Quantum Geometry: Black-holes 

Loop quantum cosmology illuminates dynamical ramifications of quantum geome- 
try but within the context of mini-superspaces. In this sub-section, I will discuss a 
complementary application where one considers the full theory but probes conse- 
quences of quantum geometry - the application of the framework to the problem 
of black hole entropy. This discussion is based on joint work with Baez, Corichi 
and Krasnov [ 8 ] which itself was motivated by earlier work of Krasnov, Rovelli and 
others. 

As explained in the Introduction, since mid-seventies, a key question in the 
subject has been: What is the statistical mechanical origin of the black hole en- 
tropy 5 bh = cihor/4^Pi? What are the microscopic degrees of freedom that ac- 
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count for this entropy? This relation implies that a solar mass black hole must 
have (exp 10^^) quantum states, a number that is huge even by the standards 
of statistical mechanics. Where do all these states reside? To answer these ques- 
tions, in the second Sakharov conference, Wheeler suggested the following heuristic 
picture, which he christened Tt from Bit’. Divide the black hole horizon in to el- 
ementary cells, each with one Planck unit of area, and assign to each cell two 
microstates, or one ‘bit’. Then the total number of states M is given hy M = 2^ 
where n = (uhor/^Pi) number of elementary cells, whence entropy is given by 
S = InA? ~ ahor- Thus, apart from a numerical coefficient, the entropy (‘It’) is ac- 
counted for by assigning two states (‘Bit’) to each elementary cell. This qualitative 
picture is simple and attractive. But the key open issue was: can these heuristic 
ideas be supported by a systematic analysis from first principles? Where do the 
‘elementary cells’ come from and why are why does each one carry just two states? 
Quantum geometry has supplied the required analysis.^ 

A systematic approach requires that we first specify the class of black holes 
of interest. Since the entropy formula is expected to hold unambiguously for black 
holes in equilibrium, most analyses were confined to stationary, eternal black holes 
(i.e., in 4-dimensional general relativity, to the Kerr-Newman family). From a 
physical viewpoint however, this assumption seems overly restrictive. After all, in 
statistical mechanical calculations of entropy of ordinary systems, one only has to 
assume that the given system is in equilibrium, not the whole world. Therefore, 
it should suffice for us to assume that the black hole itself is in equilibrium; the 
exterior geometry should not be forced to be time-independent. Furthermore, the 
analysis should also account for entropy of black holes which may be distorted 
or carry (Yang- Mills and other) hair. Finally, it has been known since the mid- 
seventies that the thermodynamical considerations apply not only to black holes 
but also to cosmological horizons. A natural question is: Can these diverse situa- 
tions be treated in a single stroke? Within the quantum geometry approach, the 
answer is in the affirmative. The entropy calculations have been carried out in the 
recently developed framework of ‘isolated horizons’ which encompasses all these 
situations. Isolated horizons serve as ‘internal boundaries’ whose intrinsic geome- 
tries (and matter fields) are time-independent, although space-time geometry as 
well as matter fields in the external space-time region can be fully dynamical. The 
zeroth and first laws of black hole mechanics have been extended to isolated hori- 
zons. Entropy associated with an isolated horizon refers to the family of observers 
in the exterior for whom the isolated horizon is a physical boundary that separates 
the region which is accessible to them from the one which is not. This point is es- 
pecially important for cosmological horizons where, without reference to observers, 
one can not even define horizons. States which contribute to this entropy are the 
ones which can interact with the states in the exterior; in this sense, they ‘reside’ 
on the horizon. 

^However, I should add that this ax:count does not follow chronology. Black hole entropy was 
computed in quantum geometry quite independently and the realization that the ‘It from Bit’ 
picture works so well was somewhat of a surprise. 
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In the detailed analysis, one considers space-times admitting an isolated hori- 
zon as inner boundary and carries out a systematic quantization. The quantum 
geometry framework can be naturally extended to this case. The isolated horizon 
boundary conditions imply that the intrinsic geometry of the quantum horizon is 
described by the so called U(l) Chern-Simons theory on the horizon. This is a 
well-developed, topological field theory. A deeply satisfying feature of the analysis 
is that there is a seamless matching of three otherwise independent structures: the 
isolated horizon boundary conditions, the quantum geometry in the bulk, and the 
Chern-Simons theory on the horizon. In particular, one can calculate eigenvalues 
of certain physically interesting operators using purely bulk quantum geometry 
without any knowledge of the Chern-Simons theory, or using the Chern-Simons 
theory without any knowledge of the bulk quantum geometry. The two theories 
have never heard of each other. Yet, thanks to the isolated horizon boundary 
conditions, the two infinite sets of numbers match exactly, providing a coherent 
description of the quantum horizon. 

In this description, the polymer excitations of the bulk geometry, each la- 
beled by a spin j/, pierce the horizon, endowing it an elementary area ajj given 
by (1). The sum Y^jdjj adds up to the total horizon area Uhor- The intrinsic ge- 
ometry of the horizon is flat except at these puncture, but at each puncture there 
is a quantized deficit angle. These add up to endow the horizon with a 2 -sphere 
topology. For a solar mass black hole, a typical horizon state would have 10 ^^ punc- 
tures, each contributing a tiny deficit angle. So, although the quantum geometry 
is distributional, it can be well approximated by a smooth metric. 

The counting of states can be carried out as follows. First one constructs a 
micro-canonical ensemble by restricting oneself only to those states for which the 
total area, angular momentum, and charges lie in small intervals around fixed val- 
ues Uhor, -Ihor, Qhor- usual in statistical mechanics, the leading contribution 

to the entropy is independent of the precise choice of these small intervals.) For 
each set of punctures, one can compute the dimension of the surface Hilbert space, 
consisting of Chern-Simons states compatible with that set. One allows all possible 
sets of punctures (by varying both the spin labels and the number of punctures), 
subject to the constraint that the total area ahor be fixed, and adds up the di- 
mensions of the corresponding surface Hilbert spaces to obtain the number Af of 
permissible surface states. One finds that the horizon entropy Shov is given by 

5hor ;= InV = — ^ + e>( — where 7„ = ( 4 ) 

7 «P1 ûhor VStt 

Thus, for large black holes, entropy is indeed proportional to the horizon area. This 
is a non- trivial result; for examples, early calculations often led to proportionality 
to the square- root of the area. However, even for large black holes, one obtains 
agreement with the Hawking- Bekenstein formula only in the sector of quantum 
geometry in which the Barbero-Immirzi parameter 7 takes the value 7 = 7^. Thus, 
while all 7 sectors are equivalent classically, the standard quantum field theory in 
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curved space-times is recovered in the semi-classical theory only in the jo sector 
of quantum geometry. It is quite remarkable that thermodynamic considerations 
involving large black holes can be used to fix the quantization ambiguity which 
dictates such Planck scale properties as eigenvalues of geometric operators. Note 
however that the value of 7 can be fixed by demanding agreement with the semi- 
classical result just in one case - e.g., a spherical horizon with zero charge, or a 

cosmological horizon in the de Sitter space-time, or, Once the value of 7 is 

fixed, the theory is completely fixed and we can ask: Does this theory yield the 
Hawking-Bekenstein value of entropy of all isolated horizons, irrespective of the 
values of charges, angular momentum, and cosmological constant, the amount of 
distortion, or hair. The answer is in the affirmative. Thus, the agreement with 
quantum field theory in curved space-times holds in all these diverse cases. 

Why does 7^ not depend on other quantities such as charges? This impor- 
tant property can be traced back to a key consequence of the isolated horizon 
boundary conditions: detailed calculations show that only the gravitational part 
of the symplectic structure has a surface term at the horizon; the matter sym- 
plectic structures have only volume terms. (Furthermore, the gravitational surface 
term is insensitive to the value of the cosmological constant.) Consequently, there 
are no independent surface quantum states associated with matter. This provides 
a natural explanation of the fact that the Hawking-Bekenstein entropy depends 
only on the horizon geometry and is independent of electro-magnetic (or other) 
charges. 

Finally, let us return to Wheeler’s Tt from Bit’. One can ask: what are the 
states that dominate the counting? Perhaps not surprisingly, they turn out to 
be the ones which assign to each puncture the smallest quantum of area (i.e., 
spin value j = ^), thereby maximizing the number of punctures. In these states, 
each puncture defines Wheeler’s ‘elementary cell’ and his two states correspond to 
whether the deficit angle is positive or negative. 

Remark: If one allows non- minimally coupled scalar fields, semi-classical con- 
siderations show that entropy of large black holes is no longer Uhor/4^Pi but has a 
contribution also from the scalar field. Recently, this case was analyzed in detail in 
collaboration with Alejandro Corichi. The non-minimal coupling changes the def- 
inition of momenta conjugate to the gravitational connection, causing interesting 
modifications in the expressions of the quantum geometry operators. Nonetheless, 
the delicate interplay between the Chern-Simons theory and bulk quantum geome- 
try is preserved and the quantum geometry calculation reproduces the modification 
in the entropy formula correctly. This is yet another indication of the robustness 
of the underlying framework. 

To summarize, quantum geometry naturally provides the micro-states re- 
sponsible for the huge entropy associated with horizons. In this analysis, all black 
holes and cosmological horizons are treated in an unified fashion; there is no re- 
striction, e.g., to near-extremal black holes. The sub- leading term has also been 
calculated and shown to be proportional to Inuhor- 
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5 Conclusion 

In the last three sections, I have summarized recent advances which have answered 
some of the long standing questions of quantum gravity raised in the Introduc- 
tion. Throughout the development of loop quantum gravity, unforeseen simplifica- 
tions have arisen regularly, leading to surprising solutions to seemingly impossible 
difficulties. Progress could occur because some of the obstinate problems which 
had slowed developments in background independent approaches, sometimes for 
decades, evaporated when ‘right’ perspectives were found. I will conclude with a 
few examples. 

• Up until the early nineties, it was widely believed that spaces of connections 
do not admit non-trivial diffeomorphism invariant measures. This would have made 
it impossible to develop a background independent approach. Quite surprisingly, 
such a measure could be found by looking at connections in a slightly more gen- 
eral perspective. It is simple, natural, and has just the right structure to support 
quantum geometry. This geometry, in turn, supplied some missing links, e.g., by 
providing just the right expressions that Ponzano-Regge had to postulate without 
justification in their celebrated, early work. 

• Fundamental discreteness first appeared in a startling fashion in the con- 
struction of the so-called weave states, which approximate a classical 3-geometry. 
In this construction, the polymer excitations were introduced as a starting point 
with the goal of taking the standard continuum limit. It came as a major surprise 
that, if one wants to recover a given classical geometry on large scales, one can 
not take this limit, i.e., one can not pack the polymer excitations arbitrarily close 
together; there is an in-built discreteness. 

• At a heuristic level, it was found that the Wilson loop functionals of a 
suitably defined connection around a smooth loop solve the notoriously difficult 
quantum scalar constraint automatically. No one expected to find such simple and 
natural solutions even heuristically. This calculation suggested that the action of 
the constraint operator is concentrated at ‘nodes’, i.e., intersections, which in turn 
led to strategies for its regularization. 

• As I indicated in some detail, unforeseen insights arose in the well-studied 
subject of quantum cosmology essentially by taking an adequate account of the 
quantum nature of geometry, i.e., by respecting the fundamental discreteness of 
the eigenvalues of the scale factor operator. Similarly, in the case of black holes, 
three quite distinct structures - the isolated horizon boundary conditions, the 
bulk quantum geometry and the surface Chern-Simons theory - blended together 
unexpectedly to provide a coherent theory of quantum horizons. 

Repeated occurrence of such ‘unreasonable’ simplifications suggest that the 
ideas underlying loop quantum gravity may have captured an essential germ of 
truth. 
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CP Violation and the CKM Matrix 

Martin Beneke 



Abstract. This lecture provides a general overview of CP violation, emphasizing CP 
violation in flavour-violating interactions, such as due to the Kobayashi-Maskawa 
mechanism. 



1 Introduction 

The breaking of CP symmetry (“CP violation”), the composition of parity and 
charge conjugation, is an interesting phenomenon for several reasons: 

a. CP violation together with CPT symmetry implies non- invariance of the 
microscopic equations of motion under motion-reversal. CP violation rather than C 
violation implies different physical properties of matter and antimatter. These two 
facts are of fundamental importance for our understanding of the laws of Nature, 
and they were perceived as revolutionary, when CP violation was discovered in 
1964 [1]. They were also important in the development of the fundamental theory 
of particles, since the observation of CP violation motivated some early extensions 
of the Standard Model as it was known at the time (1973), either by extending 
the Higgs sector [2] or by adding a third generation of quarks and leptons [3]. Na- 
ture has opted for the second possibility for certain and the Kobayashi-Maskawa 
mechanism of CP violation has become part of today’s Standard Model. Prom 
today’s perspective motion-reversal non- invariance and the distinction of matter 
and antimatter, though fundamental, appear no longer surprising and even “natu- 
ral” . What remains surprising, however, is the peculiar way in which CP violation 
occurs or rather does not occur in the Standard Model and its possible extensions. 

b. Assuming that the evolution of the universe began from a matter- 
antimatter symmetric state, CP violation is necessary [4] to generate the matter- 
antimatter asymmetric universe that one observes today. At the electroweak phase 
transition the Standard Model satisfies all the other necessary criteria for baryoge- 
nesis (baryon number violation, departure from thermal equilibrium) [5], but CP 
violation in the Standard Model is too weak to explain the observed baryon-to- 
photon ratio. With the above assumption on the initial condition of the cosmic 
evolution, our own existence provides evidence for a source of CP violation beyond 
the Standard Model. 

Electroweak baryogenesis has the attractive feature that it couples the re- 
quired new mechanisms of CP violation to the electroweak scale, therefore making 
them testable also in particle collider experiments. Nevertheless it now appears 
more likely that the matter-antimatter asymmetry is not related to the sources of 
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CP violation that one may observe at colliders. Two facts have contributed to this 
change of perspective: first, the lower limit on the masses of Higgs bosons has been 
increasing. A heavier Higgs boson implies a weaker (first-order) electroweak phase 
transition. As a consequence electroweak baryogenesis is already too weak over 
most of the parameter space of even the minimal supersymmetric extension of the 
Standard Model. Second, the observation of small neutrino masses through neu- 
trino oscillations is explained most naturally by invoking the seesaw mechanism, 
which in turn is most naturally realized by postulating massive neutrinos, which 
are singlets under the Standard Model gauge group. All three necessary conditions 
for the generation of lepton number are naturally realized in the decay of the mas- 
sive neutrino(s). Lepton number is then partially converted into baryon number 
via B+L- violating (but B-L conserving) sphaleron transitions [6]. While the lep- 
togenesis scenario is very appealing, the new sources of CP violation related to 
the Yukawa couplings of the heavy neutrinos occur at scales of order of the heavy 
neutrino mass, Mr ~ (10^^ — 10^®) GeV (needed to explain the small left-handed 
neutrino masses), and are not directly testable with collider experiments in the 
near future. For this reason, CP violation in the context of baryogenesis will not 
be discussed further in this talk. 

c. CP violation in the Standard Model is essentially an electroweak phe- 
nomenon originating from the Yukawa couplings of the quarks to the Higgs boson. 
This implies that probes of CP violation are indirect probes of the electroweak 
scale or TeV scale, complementary to direct probes such as the observation of 
Higgs bosons. This is probably the most important reason for the current interest 
in CP symmetry breaking: in addition to testing the Kobayashi-Maskawa mecha- 
nism of CP violation in the Standard Model, experiments directed at CP violation 
limit the construction of extensions of the Standard Model at the TeV scale. There 
is an analogy between CP symmetry and electroweak symmetry breaking. Both 
occur at the electroweak scale and for both the Standard Model provides a simple 
mechanism. However, neither of the two symmetry breaking mechanisms has been 
sufficiently tested up to now. Such tests may or may not confirm the Standard 
Model mechanisms but they may also provide answers to questions that concern 
the structure of the Standard Model in its entirety, such as the origin of the elec- 
troweak scale and the origin of flavour and CP violation. 

d. Leaving aside the matter-antimatter asymmetry in the universe as evidence 
for CP violation since this depends on a further assumption, CP violation has 
now been observed in the weak interactions of quarks in three different ways: 
in the mixing of the neutral kaon flavour eigenstates (e, 1964) [1]; in the decay 
amplitudes of neutral kaons (e'/e, 1999) [7, 8]; in the mixing of the neutral Bd 
meson flavour eigenstates (sin(2/^), 2001) [9, 10]. It will be seen below that these 
pieces of data together with others not directly related to CP violation suggest that 
the Kobayashi-Maskawa mechanism of CP violation is most likely the dominant 
source of CP violation at the electroweak scale. The latest piece of evidence also 
rules out that CP symmetry is an approximate symmetry. A consequence of this is 
that generic extensions of the Standard Model at the TeV scale, needed to explain 
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the stability of the electroweak scale, suffer from a CP fine-tuning problem since 
any such extension implies the existence of many new CP-violating parameters 
which have no generic reason to be small. Despite the apparent success of the 
standard theory of CP violation, the problem of CP and flavor violation therefore 
remains as mysterious as before. 

The three quantities listed above establish CP violation unambiguously, but 
since the observables depend on decays of mesons at low energy, interpreting these 
quantities in terms of CP-violating fundamental parameters of the theory often 
involves a very difficult strong interaction problem. Chiral perturbation theory 
and the heavy quark expansion provide analytic tools to address this problem for 
kaon and B meson decays, respectively. In addition, lattice QCD can contribute 
substantially to making the theoretical prediction for many (but not all) quantities 
relevant to CP violation more precise. 

2 CP violation in the Standard Model 

CP violation can occur in the Standard Model in three different ways: 

2.1 The 9 term 

The strong interactions could be CP-violating [11, 12, 13]. The topology of gauge 
fields implies that the correct vacuum is given by a superposition |^) = e^^^\n) 

of the degenerate vacua |n) in which pure gauge fields have winding number n. Cor- 
relation functions in the ^-vacuum can be computed by adding to the Lagrangian 
the term 

= ( 1 ) 

where 9 now represents a parameter of the theory. Physical observables can depend 
on 9 only through the combination det Ad, where M is the quark mass matrix. 
A non- zero value of 

9 = 9 + arg det M (2) 

violates CP symmetry. It also implies an electric dipole moment of the neutron 
of order 10~^^ 9 ecm. The non-observation of any such electric dipole moment 
constrains 9 < 10“^^ and causes what is known as the strong CP problem, since the 
Standard Model provides no mechanism that would require 9 to vanish naturally. 
The strong CP problem has become more severe with the observation of large CP 
violation in B meson decays since one now knows with more confidence that the 
quark mass matrix has no reason to be real a priori. 

There exist mechanisms that render ^ = 0 exactly or very small through 
renormalization effects. None of these mechanisms is convincing enough to provide 
a default solution to the problem. What makes the strong CP problem so difficult 
to solve is that one does not have a clue at what energy scale the solution should be 
sought. Strong CP violation is not discussed further in this talk (see the discussion 
in [14]). 




S74 



M. Beneke Ann. Henri Poincaré 



2.2 The neutrino mass matrix 

The Standard Model is an effective theory defined by its gauge symmetries and its 
particle content. CP violation appears in the lepton sector if neutrinos are massive. 
The leading operator in the effective Lagrangian is [15] 

^ (3) 

After electroweak symmetry breaking this generates a Majorana neutrino mass 
matrix with three CP-violating phases. One of these phases could be observed 
in neutrino oscillations, the other two phases only in observables sensitive to the 
Majorana nature of neutrinos. 

Unless the fij are extremely small, the scale A must be large to account for 
small neutrinos masses, which suggests that leptonic CP violation is related to very 
large scales. For example, the standard see-saw mechanism makes the fij depen- 
dent on the CP-violating phases in the heavy gauge-singlet neutrino mass matrix. 
As a consequence one may have interesting model-dependent relations between lep- 
togenesis, CP violation in lepton-flavour violating processes and neutrino physics, 
but since the observations are all indirect through low-energy experiments, one 
may at best hope for accumulating enough evidence to make a particular model 
particularly plausible. Such experiments seem to be possible, but not in the near 
future and for this reason leptonic CP violation is not discussed further here. It 
should be noted that there is in general no connection between CP violation in 
the quark and lepton sector except in grand unification models in which the two 
relevant Yukawa matrices are related. Even then further assumptions are necessary 
for a quantitative relation. 



2.3 The CKM matrix 



CP violation can appear in the quark sector of the Standard Model at the level of 
renormalizable interactions [3] . The quark Yukawa interactions read 

Cy = -yfjQ'iHd'^ - + h.c., (4) 

with Q' the left-handed quark SU(2)-doublets, u' and d! the right-handed SU(2) 
singlets and i = 1,2,3 the generation index. The complex mass matrices that 
arise after electroweak symmetry breaking are diagonalized by separate unitary 
transformations of the left- and right-handed up- and down-type fields. Only 
the combination 



VCKM = Ul^Ut = 



^ud ^us U/6 \ 

Ucd Uc. Uc6 , (5) 

Utd U. / 



referred to as the CKM matrix, is observable, since the charged current interactions 
now read 



\/2 sin 0\ 



-Ui'y>^[VcKM\ijdjWj + h.c.. 



(6) 
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Flavour and CP violation in the quark sector can occur in the Standard Model only 
through charged current interactions (assuming ^ = 0). With three generations 
of quarks, the CKM matrix contains one physical CP-violating pha^e. Any CP- 
violating observable in flavour-violating processes must be related to this single 
phase. The verification or, perhaps rather, falsification of this highly constrained 
scenario is the primary goal of many current B- and K-physics experiments. This 
type of CP violation is therefore discussed in some detail in later sections. 

For reasons not understood the CKM matrix has a hierarchical structure as 
regards transitions between generations. It is therefore often represented in the 
approximate form [16] 



/ 1 - AV2 



y AA ^(1 — p — irj) 



A AX^{p — irf) ^ 
1 - AV2 AA2 

-AA2 1 y 



(7) 



where A 0.224 and A, p, 77 are counted as order unity and corrections are of 
order A^. It is the great achievement of heavy quark theory of the 1990s to have 
determined \Vcb\^ ke. A, to the accuracy of a few percent, whereas determining 
p and 77 with this accuracy remains a challenge for this decade. The unitarity of 
the CKM matrix leads to a number of relations between rows and columns of the 
matrix. The one which is most useful for :B-physics is obtained by multiplying the 
first column by the complex conjugate of the third: 



Vudv:, + + Vidv:, = 0 . ( 8 ) 

If 77 7 ^ 0 (which implies CP violation) this relation can be represented as a triangle 
in the complex plane, called the unitarity triangle. See Figure 1, which also in- 
troduces some notation for the angles of the triangle that will be referred to later 



The hierarchy of the CKM matrix implies that CP violation is a small effect 
in the Standard Model. More precisely, CP-violating observables are either small 
numbers, or else they are constructed out of small numbers such as small branch- 
ing fractions of rare decays. The hierarchy of quark masses and mixing angles 
represents a puzzle, sometimes called the flavour problem, which will also not be 
discussed further in this talk. Typical attempts to solve the flavour problem focus 
on broken generation symmetries. 



3 Constraints on the Unitarity Triangle 

In the following I review the current constraints on (p, 77 ), the apex of the unitarity 
triangle, and the prospects for improving these constraints. (The definitions p/ p — 
fjjr] = 1 — A^/2 render the location of the apex accurate to order A^ [17] and will 
be used in the following.) It is not the purpose of this talk to go into the details of 
the theoretical calculations that contribute to these constraints. Recent summaries 
of the relevant lattice calculations can be found in [18, 19, 20]. 




S76 



M. Beneke 



Ann. Henri Poincaré 



(P,il) 




Figure 1: The unitarity triangle. 



3.1 CP-conserving observables 

The lengths of the three sides of the triangle are determined from CP-conserving 
observables. 

Semileptonic decays. \Vcb\ = 0.041 ± 0.002 sets the scale of the sides of the 
triangle and is determined from exclusive [21] and inclusive semileptonic B decays 
[22, 23]. Both methods rely on the heavy quark expansion. The current error on 
\Vcb\ is not a limiting factor in the determination of (p, 77), but it may become 
important for rare kaon decays, which depend on A to a high power. The inclusive 
method has probably reached its intrinsic limits [24]. Further improvement then 
depends on how well the B form factors can be computed with lattice 

QCD. 

The determination of | Vub \ uses semileptonic b ^ u decays and gives 



^ p2 = 



1 - AV2 

A 



Vcb ■ 



( 9 ) 



However, ~ 0.085 is currently known only within an error of about ±20%. 

\Vub\ can also be determined from inclusive or exclusive decays. The inclusive 
treatment would parallel that of \ Vcb\ if not the background from b ^ c transitions 
had to be suppressed. Distributions in various kinematic variables (lepton energy, 
hadronic invariant mass) have been considered at the price of a more complicated 
and uncertain theory. A cut on the leptonic and hadronic invariant mass avoids the 
kinematic region, where the heavy quark expansion (in local operators) is invalid, 
but the scale of the expansion is now around 2GeV rather than mb [25, 26, 27]. 
While the ultimate accuracy of this method is not known, it should be possible 
to halve the error on \Vub\- The exclusive determination of \Vub\ from B Mlu 
must rely on lattice QCD for the B M form factor. \Vub\ is extracted by 

comparing the lepton invariant mass spectrum at large q‘^ with the form factor in 
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this region, where it is computed most reliably at present. The exclusive method 
is not yet competitive, but it will certainly play an important role in an accurate 
determination of \Vub\ in the future. 

The 1 146 1 -constraint on (p, fÿ) is particularly important, since it is the only 
constraint that is based on a tree decay and therefore arguably insensitive to non- 
standard Model interactions. It constrains the parameters of the CKM matrix even 
in the presence of new physics and helps to define the Standard Model reference 
point. The error on |146l is also a major component of the error in the indirect 
determination of sin 20 from the global fit to (p, fj) discussed below. Its reduction 
would therefore sharpen the consistency test with the direct measurement of sin 20. 

BB mixing. In the Standard Model the BB mass difference is dominated by 
the top quark box diagram, proportional to \VtqV*0‘^ {q = d, s). This determines 
the length of the remaining side of unitarity triangle. The large maiss of the top 
quark implies that the mass difference can be calculated up to the matrix element 
of the local operator {qb)y-A{qb)y-A, conventionally parameterized by 
For AMsa one obtains 



= (0.83 ± 0.03) X (10) 

1 /2 

The use of this result is limited by an error of about ±15% on the quantity . 

Only lattice QCD can possibly improve upon this error. 

Since Vts is already determined by the unitarity of the CKM matrix, 
alone does not constrain the unitarity triangle further. However, the ratio 
(AMb^/AMbJ^/^ also determines ((1 — p)^ ± and involves only the ra- 

tio ^ = ÎBsB^b‘^ / fsdB^Bd ^ which is believed to be known from lattice QCD with 
an error (± 6 %) smaller than the error on each of the hadronic parameters individ- 
ually. The current lower limit on AM^^ then provides an important upper limit on 
the length of the relevant side of the unitarity triangle, which turns into the most 
important restriction on the upper limit for the angle 7 in the combined (p, p) fit. 

3.2 CP violation in kaon decays 

CP violation in mixing (indirect, 1964.) ■ Due to CP violation in KK mixing, the 
neutral kaon mass eigenstates are superpositions of CP-even and CP-odd com- 
ponents. The long-lived kaon state, Ki K 2 ± cKi, is predominantly CP-odd, 
but decays into two pions through its small CP-even component K\. The decay 
Kl ^ TTTT constituted the first observation of CP violation ever [1]. The quan- 
tity |e| = 2.27 • 10“^ (equal to |ë| in the standard phase convention to very good 
accuracy) has now been measured in many different ways. 

KK mixing is dominated by top and charm box diagrams. The long-distance 
contributions are encoded in the matrix element of a four- fermion operator similar 
to BB mixing, conventionally parameterized by Bk and computed in lattice QCD. 
There is currently an uncertainty of at least ±15% on this parameter. |e| determines 
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^ (1.3 ± 0.05 — p). This is the fourth and last constraint that enters the “standard” 
unitarity triangle fit. 

CP violation in decay (direct, 1999). CP- violating effects can also be seen in 
the interference of two decay amplitudes with different CKM phases. The double 
ratio 



T{Kl 7T^7T^)r{Ks — > 7r~^7T ) 

T{Ks 7r^7T^)r{KL ^ TT+TT-) 



1 — 6Re 




( 11 ) 



if different from unity, implies such an effect, since both ratios would equal |c|, 
if CP violation occurred only in mixing. The existence of this effect has been 
conclusively demonstrated by two experiments in 1999, following the first hints of 
a non- vanishing e'/c in 1992 [28]. The new results of 2001 have further clarified 
the situation, which is now summarized by [29, 30] 



_ f (15.3 ± 2.6) • 10-4 NA48 (97-99) 

€ “ \ (20.7 ± 2.8) • 10-4 KTeV (96/97). 



(12) 



The theory of e'/e is more complicated than that of any other quantity 
discussed so far. The short-distance contributions are many-fold, but have been 
worked out to next-to-leading order [31, 32]. The following, approximate represen- 
tation of the result, 



- == 16- lO""* 



1.2 - 10-4 



/ llOMeV V 
Vm,(2GeV) ) 



î(l/2) 



(1 — VIib) — 0.4 



^ mt y 
.165 GeV/ 



B. 



( 3 / 2 ) 



QCD penguin EW penguin 



(13) 



illustrates the difficulty that arises from a cancellation between strong and elec- 
troweak penguin contributions and the need to know the hadronic matrix elements 
Bi (X {7r7r\Oi\K), which involve a two-pion final state, accurately. Before 1999 it 
was commonly, though not universally, assumed that Be, 8 ~ 1 near their vacuum 
saturation value, and with the isospin breaking factor fl/e ~ 0.25, this gives only 
about 6 • 10-4. The experimental result has triggered a large theoretical activ- 
ity directed towards understanding better the hadronic matrix elements. Different 
approaches continue to disagree by large factors, but it appears now certain that 
serious matrix element calculations must in one way or another account for final 
state interactions of the two pions. Chiral perturbation theory combined with a 
large- A'c matching of the non-leptonic operators can probably go furthest towards 
this goal with analytic methods. The calculation reported in [33] finds en- 

hanced by a factor 1.55 through rescattering and this, together with a réévaluation 
of isospin-breaking, may account for the experimental result within theoretical un- 
certainties. 

It has also been demonstrated that, in principle, lattice QCD can settle the 
matrix element issue definitively, since K ^ tttt matrix elements computed in 
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a lattice of finite volume, can be matched to continuum, infinite-volume matrix 
elements including all information on rescattering [34, 35]. Due to the potential 
cancellations the matrix elements are needed with high precision. They are also 
needed soon, since further CP-violating observables that will be measured in the 
future will diminish the importance of ^'/e. 

Rare kaon decays (future). There exist several proposals to measure the very 
rare decays 'k^vv and ^ Tr^iyiÿ with expected branching fractions 

of about 7 • 10“^^ and 3 • 10“ respectively. The first of these decays is CP- 
conserving and constrains (p, fj) to lie on a certain ellipse in the (p, 7ÿ)-plane. The 
second decay is CP-violating and determines fj. The branching fractions are pre- 
dicted theoretically with high precision, so that these two kaon modes alone can 
in principle fix the shape of the unitarity triangle, or uncover inconsistencies with 
other constraints [36]. Two 7V+ — > events have in fact been observed [37] 

resulting in a branching fraction somewhat larger than expected but consistent 
with expectations within the experimental error. 

3.3 Summing up 

The four quantities \Vub/Vcb\, and \e\ are usually combined into a global fit 

of (p, fj). Different groups use different statistical methods, but since the dominant 
errors of all input quantities are theoretical, no sophisticated procedure can conceal 
the fact that there is a difficulty in quantifying such errors objectively. Currently 
the various procedures appear to give similar results when the same inputs are 
used. Figure 2 shows the result of one such global fit [38] . 

The four quantities are in remarkable agreement. This results in an indirect 
determination of the angles of the unitarity triangle, in particular 

sin(2/3) = 0.68 ±0.21, (14) 

7 = (58±24)± (15) 

As discussed above the precision of the indirect fit relies on the accuracy to which 
a few hadronic matrix elements are known. It is therefore clear that the future 
of the standard unitarity triangle fit (based on the four quantities above) is now 
entirely in the hands of lattice QCD (up to, perhaps, |I46|)- 

3.4 sin(2^) 

In 2001 CP violation has been observed also in B meson decays, more precisely 
in the interference of mixing and decay. Assume that both, and B^, can decay 
into a CP eigenstate /, call the amplitude of the former decay A, the latter À and 
define A = with 2/3 the phase of the BB mixing amplitude (standard 

phase convention). A B meson identified as B^ at time t = 0 can decay into / at 
a later time t either directly or indirectly through its B^ component acquired by 
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Figure 2: Summary of unitarity triangle constraints (excluding the direct measure- 
ment of sin(2/3) which is overlaid) [38]. 



mixing. If there is CP violation, the amplitude for the CP conjugate process will 
be different, resulting in a time-dependent asymmetry 



Acp{t) = 






T{B%t) 
2ImA 



1 + |A|" 



sm(AMBt) — 



f)^T{B%t) 
l-|Ap 



1 -h |A|^ 



-/) 

cos{AMBt). 



(16) 



In the special case that A is dominated by a single weak phase, A = (so 

that |A| = 1), the asymmetry is proportional to ±sin2{P-\-0w), the sign depending 
on the CP eigenvalue of /. 

The final state J/ipKs (and related ones) satisfies this special condition to 
the accuracy of a percent. Furthermore ôw ~ 0 for 6 ^ ccs. Hence the mixing- 
induced CP asymmetry in B ^ J/ij^K decay determines the BB mixing phase 
(relative to h ^ ccs), or sin(2/^) in the Standard Model, with little theoretical 
uncertainty [39, 40]. It determines the BB mixing phase also beyond the Standard 
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Model, since it is unlikely that the CKM-favoured b — > ccs transition acquires a 
large CP- violating phase from new flavour-changing interactions. 

The asymmetry is now precisely measured by the two B factories. The central 
values reported by both experiments have been increasing over the past year as 
the statistics of the experiments improved and now reads [9, 10] 



sin(2/5) = 



0.59 ±0.15 
0.99 ±0.15 



BaBar 

Belle, 



(17) 



yielding the world average sin(2;^) = 0.79 ± 0.10. The fact that this asymmetry is 
large and in agreement with the indirect determination of the angle (3 leads to two 
important conclusions on the nature of CP violation: 

• CP is not an approximate symmetry of nature (as could have been if CP 
violation in kaon decays were caused by some non-standard interactions). 

• the Kobayashi-Maskawa mechanism of CP violation is most likely the dom- 
inant source of CP violation at the electroweak scale. 



New flavour- changing interactions certainly could have affected BB mixing and 
could have been revealed first by the direct sin(2/3) measurement. If the CKM 
matrix were the only source of flavour-changing processes also in an extension of 
the Standard Model, then the new interactions that modify BB mixing also affect 
KK mixing. One then finds (if one adds an additional assumption that there are 
no new operators in the low-energy effective weak Hamiltonian) that only small 
modifications of the BB mixing phase (still related to the phase of Vtd in such 
models of “minimal flavour violation”) could have been possible, in particular 
sin(2/3) > 0.42 with a preferred range from 0.5 to 0.8 [41, 42, 43, 44]. Vice versa 
the observation of a very small or very large sin(2/^) would have implied a new 
mechanism of flavour violation with (probably) new CP phases. As discussed below 
in a more general context, this would have led to a CP problem. 



4 More on CP violation in B meson decays 

The search for CP violation will continue in many ways (B decays, D decays, K 
decays, electric dipole moments), but the Kobayashi-Maskawa mechanism predicts 
large effects only in B decays and very rare K decays. The primary focus of the 
coming years will be to verify relations between different observables predicted in 
the Kobayashi-Maskawa scenario and to search for small deviations. 

An example of this type is Ëa (f)K due to the penguin b sss transition 
at the quark level. In the Standard Model the time-dependent CP asymmetry 
of this decay is also proportional to sin (2/3) to reasonable (though not as good) 
precision. However, new interactions are more likely to affect the loop-induced 
penguin transition than the tree decay b ccs and may be revealed if the time- 
dependent asymmetry in Bd (j)K turns out to be different from that in Èd 
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J/il;K. However, if the difference is small, its interpretation requires that one 
controls the strong interaction effects connected with the presence of a small up- 
quark penguin amplitude with a different weak phase. This difficulty is of a very 
general nature in B decays. 

4.1 CP violation in decay 

The need to control strong interaction effects is closely related to the possibility of 
observing CP violation in the decay amplitude. The decay amplitude has to have 
at least two components with different weak phases, 

A{B f) = (18) 

If the strong interaction phases are also different, the partial width of the decay 
differs from that of its CP-conjugate, T{B f) j^T{B /). Many rare B decays 
are expected to exhibit CP violation in decay due to the interference of a tree and 
a sizable or even dominant penguin amplitude. The weak phase difference can only 
be determined, however, if the strong interaction amplitudes are known. This is 
also necessary for mixing- induced CP cisymmetries, if the decay amplitude is not 
dominated by a single term. 

There exist two complementary approaches to obtain the strong interaction 
amplitudes. The first employs a general parameterization of the decay amplitudes 
of a set of related decays, implementing SU(2)-isospin relations. The remaining 
strong interaction parameters are then determined from data (often also using 
SU(3) flavour symmetry and “little” further assumptions on the magnitudes of 
some amplitudes). Very often this needs difficult measurements. The second ap- 
proach attempts to calculate the strong interaction amplitudes directly from QCD 
with factorization methods also used in high-energy strong interaction processes. 
This approach makes essential use of the fact that the b quark mass is large. There 
is currently no theoretical framework that also covers 1 /m^,-corrections system- 
atically, so there is an intrinsic limitation to the accuracy that one can expect 
from this approach. Nonetheless, the additional information on the dynamics of 
the decay provided by this approach is important as long as data is sparse and 
will continue to be useful later on. 

4.2 The angle 7 

In the Standard Model the angle f3 is obtained accurately from the time-dependent 
CP asymmetry in Bd It remains to determine directly the angle 7, the 

phase of 

The preferred methods rely on decays with interference of 6 cüD (no phase) 
and b ucD (phase 7) transitions and their conjugates {D = d, s). These de- 
cays receive no penguin contributions and are arguably insensitive to new flavour- 
changing interactions. 7 can be extracted from either of the following decay classes, 
Bd{t) ^ D±7 tT [45], Bait) ^ DKs [46], K^Dcp [47], Bs{t) DfK^ 
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[48], since every one of them provides sufficiently many observables to eliminate 
all strong interaction parameters, thus providing nice illustrations of the first of 
the two approaches mentioned above. None of these strategies is simple to carry 
out experimentally, however, since they involve either small CP asymmetries, or 
small branching fractions, or disparate amplitudes, or rapid Bg oscillations. 

The possibility to determine 7 from decays with interference of 6 uUD 
(tree, phase 7) and h Dqq (penguin, phase 0 {D = s)^ j3 {D = d)) transitions 
has therefore been thoroughly investigated recently, in particular the decays B 
ttK. The branching fractions for these modes are of order 10~^ and have already 
measured with an error of ±(10 — 20)%, including first measurements of direct 
CP asymmetries (all compatible with zero). The drawback of these and related 
modes is that the amplitudes contain more strong interaction parameters than 
there are observables. SU(3) symmetry and the structure of the weak effective 
Hamiltonian allow one to construct a number of interesting bounds on 7 [49, 50], 
but a full understanding of these modes requires a calculation of the penguin- 
to-tree amplitude ratio including its strong rescattering phase. In the following I 
describe very briefly one such method. 

4.3 QCD factorization 

In the heavy quark limit the b quark decays into very energetic quarks (and glu- 
ons), which must recombine to form two mesons. Methods from the heavy quark 
expansion and soft-collinear factorization (“colour-transparency”) can the be ar- 
gued to imply a factorized form of the amplitude of a decay into two light mesons 
[51, 52]. Schematically, 

A{B M 1 M 2 ) = [ duT\u)^MA^) 

Jo 

+ Jd^dudvT“{i,U,v)^B{mMAv)<^MAu), (19) 

where is a form factor, denote light-cone distribution amplitudes and 

TUI 

are perturbative hard-scattering kernels, which also contain the strong rescat- 
tering phases. This result is valid up to 1/rrib corrections, some of which can be 
large. The extent to which the QCD factorization formalism can be of quantita- 
tive use is not yet fully known. The approach has been successful in explaining 
the universality of strong-interaction effects in class-I B D-\- light meson decays 
and understanding the non-universality in the corresponding class-II decays [52]. 
It also appears to account naturally for the magnitude of the ttK branching frac- 
tions, sometimes considered as unexpectedly large, but there is currently no test 
that would allow one to conclude that the computation of strong interaction phases 
which are either of order «s or 1 / rrn, is reliable in the case of penguin-dominated 
final states [53]. Such tests will be possible soon and the non-observation of di- 
rect CP violation at the current level of sensitivity already supports the idea that 
strong rescattering effects are suppressed. 
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Figure 3: 95% (solid), 90% (dashed) and 68% (short-dashed) confidence level con- 
tours in the (p, fj) plane obtained from a global fit to the CP averaged B ttK, tttt 
branching fractions, using the scanning method [38]. The darker dot shows the 
overall best fit, whereas the light dot indicates the best fit for the default param- 
eter set. The light-shaded region indicates the region preferred by the standard 
global fit, excluding the direct measurement of sin (2/3). 



Figure 3 shows the result of a global fit of (p, fj) to CP- averaged B ttK, tttt 
branching fractions with a QCD factorization computation used as an input [53]. 
The result is consistent with the standard fit based on meson mixing and Vub^ 
but shows a preference for larger 7 or smaller Vub^ If the estimate of the theory 
uncertainty (included in the curves in the Figure) is correct, non-leptonic decays 
together with \Vub\ from semileptonic decays already imply the existence of a CP- 
violating phase of Vub- 



4.4 Resume 

The B factories are already providing data on dozens of rare B decay modes. QCD 
calculations - though probably not very precise - will be necessary to interpret 
these data beyond “simple” quantities like the mixing- induced CP asymmetry in 
Bd J/il^K. The immediate future should be very interesting since the measure- 
ments of direct CP asymmetries at the few percent level and the mixing- induced 
CP asymmetry in Bd tt+tt" decay provide tests of the theoretical framework 
and further information on CP violation. Subsequent second generation B physics 
experiments will probably supply enough data to rely more and more on measure- 
ments and symmetries. Altogether the Kobayashi-Maskawa mechanism will be 
decisively and precisely tested, but on the way one can expect many discussions 
on hadronic physics, imagined and, perhaps, true new physics signals. 

Lattice calculations will continue to play an important role by making more 
precise the standard unitarity triangle fit. They could also provide some of the 
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non-perturbative quantities (form factors, light-cone-distribution amplitudes) that 
are needed in factorization-based calculations of non-leptonic decay amplitudes. It 
will be much harder for lattice QCD to make an impact on non-leptonic, exclusive 
decays directly, since inelastic rescattering dominates final-state interactions and 
there is currently no method that would allow one to compute this on the lattice. 

5 CP violation in extensions of the Standard Model 

The emerging success of the Kobayashi-Maskawa mechanism of CP violation is 
sometimes accompanied by a sentiment of disappointment that the Standard 
Model has not finally given way to a more fundamental theory. However, returning 
to the perspective of the year 1973, when the mechanism was conceived, one can 
hardly feel this way. After all, the Kobayashi-Maskawa mechanism predicted a new 
generation of particles on the basis of the tiny and obscure effect of CP violation 
m KK mixing. It then predicted relations between CP- violating quantities in K, 
D, .B-physics which a priori might be very different. The fact that it has taken 
nearly 30 years to assemble the experimental tools to test this framework does not 
diminish the spectacular fact that once again Nature has realized a structure that 
was concepted from pure reasoning. 

Nevertheless several arguments make it plausible that the Kobayashi- 
Maskawa mechanism is not the final word on CP violation. The strong and cosmo- 
logical CP problem (baryogenesis) continue to call for an explanation, probably 
related to high energy scales. There may be an aesthetic appeal to realizing the 
full Poincaré group as a symmetry of the Lagrangian, in which case CP and P 
symmetry breaking must be spontaneous. One of the strongest arguments is, how- 
ever, that the elect roweak hierarchy problem seems to require an extension of 
the Standard Model at the TeV scale. Generic extensions have more sources of 
CP violation than the CKM matrix. These have not (yet) been seen, suggesting 
that there is some unknown principle that singles out the CKM matrix as the 
dominant source of flavour and CP violation. In the following I give a rather col- 
loquial overview of CP violation in generic TeV scale extensions of the Standard 
Model. This is perhaps an academic catalogue, but it illustrates how restrictive 
the Kobayashi-Maskawa framework is. 

5.1 Extended Higgs sector 

Extending the Higgs sector by just a second doublet opens many new possibilities. 
The Higgs potential may now contain complex couplings, leading to Higgs bosons 
without definite CP parity, to CP violation in charged Higgs interactions, flavour- 
changing neutral currents, and CP violation in flavour-conserving interactions such 
as ttH and electric dipole moments. The Lagrangian could also be CP-conserving 
with CP violation occurring spontaneously through a relative phase of the two 
Higgs vacuum expectations values [2] . 
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Both scenarios without further restrictions already cause too much CP vio- 
lation and flavour-changing neutral currents, so that either the Higgs bosons must 
be very heavy or some special structure imposed. For example, discrete symme- 
tries may imply that up-type and down-type quarks couple to only one Higgs 
doublet, a restriction known as “natural flavour conservation” [54, 55], since it for- 
bids flavour- changing neutral currents (and also makes spontaneous CP violation 
impossible with only two doublets). With flavour conservation imposed, CP and 
flavour violation occurs through the CKM matrix, but in addition to the usual 
charged currents also in charged Higgs interactions. This is usually considered in 
the context of supersymmetry, since extended Higgs models suffer from the same 
hierarchy problem as the Standard Model. 

5.2 Extended gauge sector (left-right symmetry) 

Left-right-symmetric theories with gauge group SU(2 )l xSU(2)rxU(1)b-l are at- 
tractive [56] , because parity and CP symmetry can be broken spontaneously. The 
minimal model requires already an elaborate Higgs sector (with triplets in addi- 
tion to doublets) and suffers from the hierarchy problem. CP violation in the quark 
sector now occurs through left- and right-handed charged currents with their re- 
spective CKM matrices. But since all CP violation arises through a single phase 
in a Higgs vacuum expectation value there is now a conflict between suppressing 
flavour- changing currents and having this phase large enough to generate the CP 
violating phenomena already observed. The minimal left-right symmetric model 
with spontaneous CP violation is therefore no longer viable [57, 58]. 

5.3 Extended fermion sector 

The Standard model can be extended by an extra d-type quark with electric charge 
— 1/3 [59]. This quark should be a weak singlet in order not to conflict electroweak 
precision tests. After electroweak symmetry breaking, the down-quark mass ma- 
trix must be diagonalized by a unitary 4x4 matrix. The motivation for such 
an extension of the Standard Model may be less clear, in particular as there is 
no symmetry principle that would make the extra singlet quark naturally light. 
However, this theory provides an example in which the unitarity triangle does no 
longer close to a triangle, but is extended to a quadrangle: 

VudV:b + VcdV:t^VtdV,l + ^ =0 ( 20 ) 

« 8 • 10 “^ < 

The unitarity triangle “deflcit” Udb also determines the strength of tree- level 
flavour-changing Z boson couplings and is currently constrained hy BB mixing 
and rare decays to about a tenth of the length of a side of the triangle. (The 
corresponding coupling Uds is constrained much more tightly in the kaon system.) 
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This model could in principle still give large modifications of BB mixing and 
non-leptonic B decays, including CP asymmetries [60]. 

5.4 Supersymmetry 

The minimal supersymmetric standard model [61, 62] is arguably the most natural 
solution to the electroweak hierarchy problem, but it is not particularly natural 
in its most general form from the point of view of CP violation. The Lagrangian 
including the most general Lagrangian that breaks supersymmetry softly contains 
44 CP- violating constants of nature. (R-parity conservation is assumed.) One of 
them is the usual CKM phase which appears in charged current and chargino inter- 
actions. Three phases appear in flavour-conserving CP observables, 27 in flavour- 
and CP-violating quark-squark-gluino interactions (squark mass matrices and A- 
terms) and 13 in the (s)lepton sector. 

The flavour-conserving phases must be small to comply with the non- 
observation of electric dipole moments. An intriguing feature of supersymmetry is 
the existence of CP and flavour violation in strong interactions (gluinos, squarks). 
These interactions can be much stronger than the Standard Model weak interac- 
tions and to suppress them to a phenomenologically acceptable level, one has to 
assume that either (some of) the masses of superparticles are rather large, or that 
the squark mass matrices are diagonal in the same basis that also diagonalizes 
the quark mass matrices (alignment) or that the squarks have degenerate masses, 
in which case a generalization of the GIM mechanism suppresses flavour-changing 
couplings [63, 64]. Since almost all CP-violating phases of the minimal supersym- 
metric standard model originate from supersymmetry breaking terms, one must 
understand supersymmetry breaking to answer the question why CP and flavour 
violation are so strongly suppressed. There exist mechanisms which can naturally 
realize one or the other of the conditions listed above (for example supersymmetry 
breaking through gauge interactions [65]), but none of the mechanisms is somehow 
singled out. 

There is currently much activity aiming at constraining the flavour- and CP- 
violating couplings from the many pieces of data that become now available. In 
fact these couplings are so many-fold that the CP-violating effects observed in 
kaon and B meson decays can all be ascribed to them at the price of making the 
consistency of the Kobayashi-Maskawa mechanism appear accidental. The hope 
could be that eventually some pattern of restrictions on these small couplings 
is seen that could give a hint on the origin of supersymmetry breaking. It is 
also plausible to assume that strong flavour and CP violation is absent (or too 
small to observe) in supersymmetry. Neglecting also the flavour- conserving CP- 
violating effects, the CKM matrix is then the only effect of interest. The presence 
of additional particles with CKM couplings still implies modifications of meson 
mixing and rare decays, but these modifications are now much smaller and, in 
general (but excepting rare radiative decays), precise theoretical results are needed 
to disentangle them from hadronic uncertainties. 
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Whatever the outcome of the search for new CP violation may be, it will 
restrict the options for model building severely. The current data point towards a 
privileged standing of the CKM matrix. However, a theoretical rationale for this 
privileged standing is yet to be discovered. 

6 Conclusions 

I. The (expected) observation of large CP violation in B decays together with e'/e 
and the consistency of indirect determinations of the unitarity triangle imply that; 

- CP is not an approximate symmetry of nature - rather CP violation is rare 
in the Standard Model because of small flavour mixing. 

- the Kobayashi-Maskawa mechanism of CP violation in charged currents is 
probably the dominant source of CP violation at the electroweak scale. 

II. CP and electroweak symmetry breaking provide complementary motiva- 
tions to search for extensions of the Standard Model, but: 

- on the one hand, there exists no favoured candidate model for CP viola- 
tion beyond the Standard Model - rather there is a CP problem in many 
conventional extensions. 

- on the other hand, baryogenesis requires CP violation beyond the Standard 
Model, probably decoupled from CP violation observable at accelerators. 

III. The study of CP violation is at a turning point with many new exper- 
imental capabilities and new theoretical methods to interpret non-leptonic decay 
data. Perhaps the most important result of the near future, however, will be to 
find (or not find) the Bs mass difference AMb^ ~ 17.5 ps~^, confirming once more 
the Standard Model paradigm (or to put it into serious difficulty). 

Bibliographical Note 

This write-up has been originally prepared in January 2002 after a talk given 
at the 19th International Symposium on Lattice Field Theory (Lattice 2001) in 
Berlin, Germany, 19-24 Aug 2001, but did not appear in time for the proceedings. 
The review presented at TH2002 deviated slightly but not substantially from the 
contents of this write-up. The experimental data quoted here are changing and 
the numbers given reflect the status at the end of 2001. 
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Extended Abstract 

It is known that an arbitrary {m — 1) -dimensional Einstein space Xm-i possesses 
a Ricci flat metric on its m-dimensional cone C{Xm-i) of the form 

ds^ = dr^ + r^ds\^_^ , ( 1 ) 

where r is the radial coordinate, and the special holonomies on C{Xm-i) origi- 
nate from the “weak special holonomies” on Xm-i- To be more precise, the SU (n), 
5p(n), G 2 and Spin{7) holonomies on the cone C(Xm-i) are in one to one cor- 
respondence with the Sasaki- Einstein (m = 2n) , tri-Sasakian (m = 4n) , nearly 
Kdhler (m = 7) and weak G 2 (m = 8) structures on Xm-i, respectively. This fact 
is very useful to systematically construct special holonomy manifolds with coni- 
cal singularities, because the Einstein homogeneous spaces Xm-i = Gf H endowed 
with these geometrical structures are well known from the old days of Kaluza-Klein 
supergravity (SUGRA). 

On the other hand, there also exists extensive literature on the worldsheet 
approaches to these conical backgrounds in string theory. Early literature is for 
the conifold and A3-singularity and more recent studies are in the case of SU{n)- 
holonomies, emphasizing the role oiM — 2 Liouville theory and the holographically 
dual descriptions based on the (wrapped) NS5-brane geometry. All of these cases 
possess the worldsheet H = 2 superconformai symmetry and have been discussed 
from the viewpoints of the “non-compact extensions” of Gepner models. There are 
also several related results from the spacetime view points, but with the RR-flux 
at infinity. 

While these constructions in the case of the N — 2 supersymmetry have 
been rather successful, it is difficult to construct string vacua on the conical back- 
grounds with Spin{7) and G 2 holonomies, which possess at most the J\f = 1 
worldsheet SUSY. Partial attempts to construct the string vacua of Spin{7) and 
G 2 holonomies with conical singularities have been given in [1, 2]. General struc- 
ture of string theory on manifolds with Spin(7) and G 2 was discussed in [3] in 
particular from the point of view of the existence of extended chiral algebras. 
There are also several results [4] for the GET constructions of compact G 2 and 
Spin{7) manifolds based on the geometrical method of Joyce. 




S94 



Tohru Eguchi, Yuji Sugawara and Satoshi Yamaguchi Ann. Henri Poincaré 



The main purpose of this paper is to give a systematic way of constructing 
special holonomy manifolds with conical singularities based on the solvable J\f = 1 
SCFT’s, which may be regarded as a natural generalization of the construction in 
the Àf = 2 category mentioned above. Our strategy is quite simple: We formally 
replace an Einstein homogeneous space X = G/H by an M = \ supercoset CFT 
Ai = (G X SO{n)) /H, (n = dim G — d\mH) based on the affine Lie algebra of 
G and H . SO{n) stands for n free fermions. We then add the M — \ Liouville 
sector in place of the radial degrees of freedom. We may call our construction 
as the “CFT cone” as opposed to the original geometrical cone construction. Of 
course, one should keep in mind that the coset CFT (WZW model) of G/H is 
not identical to the non-linear cr- model with the target manifold G/H because of 
the presence of NS B field in WZW models. Nevertheless, as we will see in the 
following, taking the supercoset CFT associated with the Einstein homogeneous 
space provides a very good anzats for the superstring vacua of special holonomy. 
We completely classify these coset constructions (at least for the cosets G/H with 
compact simple groups G), which include the models found in [1, 2] as well as 
many of the vacua in the M — 2 category presented in [5, 6, 7, 8, 9, 10]. Among 
other things, we will find that our CFT cone approach leads to the right amount of 
worldsheet and spacetime SUSY’s as expected from geometrical grounds in many 
examples. 
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Kaon Decays and the Flavour Problem 



Gino Isidori 



Abstract. After a brief introduction to the so-called flavour problem, we discuss 
the role of rare K decays in probing the mechanism of quark-flavour mixing. Par- 
ticular attention is devoted to the formulation of the Minimal Flavour Violation 
hypothesis, as a general and natural solution to the flavour problem, and to the 
fundamental role oî K —>■ decays in testing this scenario. 



1 Introduction: the flavour problem 

Despite the Standard Model (SM) provides a successful description of particle 
interactions, it is natural to consider it only as the low-energy limit of a more 
general theory, or as the renormalizable part of an effective field theory valid up to 
some still undetermined cut-off scale A. Since the SM is renormalizable, we have 
no direct indications about the value of A. However, theoretical arguments based 
on a natural solution of the hierarchy problem suggest that A should not exceed 
a few TeV. 

One of the strategies to obtain additional clues about the value of A is to 
constrain (or find evidences) of the eflFective non-renormalizable interactions, sup- 
pressed by inverse powers of A, which encode the presence of new degrees of free- 
dom at high energies. These operators should naturally induce large effects in 
processes which are not mediated by tree- level SM amplitudes, such as AF = 1 
and AF = 2 flavour-changing neutral current (FCNC) transitions. Up to now there 
is no evidence of these effects and this implies severe bounds on the effective scale 
of dimension-six FCNC operators. For instance the good agreement between SM 
expectations and experimental determinations of mixing leads to bounds 

above 10^ TeV for the effective scale of A5 = 2 operators, i.e. well above the few 
TeV range suggested by the Higgs sector. 

The apparent contradiction between these two determinations of A is a mani- 
festation of what in many specific frameworks (supersymmetry, techincolour, etc.) 
goes under the name of flavour problem: if we insist with the theoretical prejudice 
that new physics has to emerge in the TeV region, we have to conclude that the 
new theory possesses a highly non-generic flavour structure. Interestingly enough, 
this structure has not been clearly identified yet, mainly because the SM, i.e. 
the low-energy limit of the new theory, doesn’t possess an exact flavour symmetry. 
Then we should learn this structure from data, using the experimental information 
on FCNCs to constrain its form. 
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Figure 1: Allowed regions for the reduced Wolfenstein parameters [2] p and fj 
(68% and 95% contours) compared with the uncertainty bands for IKibl/IKbl, 
6k, the limit on and acp{B J/'^Ks) (from Ref. [3]). 



Recently the flavour problem has been considerably exacerbated by the new 
precise data of .B-factories, which show no sizable deviations from SM expectations 
also in Bd Bd mixing and in a clean AB = 1 FCNC processes such as B ^ 

One could therefore doubt about the need for new tests of the SM in the sector of 
(quark) flavour physics. However, there are at least two arguments why the present 
status cannot be considered conclusive and a deeper study of FCNCs is still very 
useful: 

• The information used at present to test the CKM mechanism [1] and, in 
particular, to constrain the unitary triangle, is obtained only from charged 
currents (i.e. from tree-level amplitudes) and AB = 2 loop-induced processes 
(see Fig. 1). In principle, rare K and B decays mediated by AF = 1 FCNCs 
could also be used to extract indirect information on the unitary triangle, 
or to constrain new-physics effects. However, with the exception of the B ^ 
Xs7 rate, the quality of this information is very poor at present, either 
because of experimental difficulties or because of theoretical problems. Since 
new physics could affect in a very different way AF = 2 and AF = 1 loop- 
induced amplitudes [e.g. with 0(100%) effects in the former and 0(10%) in 
the latter], it is mandatory to improve the quality of the AF = 1 information. 

• The most reasonable (but also most pessimistic) solution to the flavour prob- 
lem is the so-called Minimal Flavour Violation (MFV) hypothesis. Within 
this framework, which will be discussed in detail in the next section, flavour- 
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and CP- violating interactions are linked to the known structure of Yukawa 
couplings also beyond the SM. This implies that deviations from the SM in 
FCNC amplitudes rarely exceed the 0(10%) level, or the level of irreducible 
theoretical errors in most of the presently available observables. Moreover, 
theoretically clean quantities such as acp(^ ^ J/'^Ks) and 
which measure only ratios of FCNC amplitudes, turn out to be insensitive to 
new-physics effects. Within this framework the need for additional clean and 
precise information on FCNC transitions is therefore even more important. 
As we shall discuss in the following, the measurements of T{K ttvp) would 
offer a unique opportunity in this respect (see Ref. [4] and references therein 
for a more extensive discussion). 

2 The Minimal Flavour Violation hypothesis 

The pure gauge sector of the SM is invariant under a large symmetry group 
of flavour transformations: Gp = SU(3)^ C) SU(3)| C) where SU(3)g = 

SU(3)q^ O SU(3)t/^ (g) SU(3)i)^, SU(3)^ = SU(3 )l^ SU(3)^^ and three of the 

five U{1) charges can be identified with baryon number, lepton number and hy- 
percharge [5]. This large group and, particularly the SU(3) subgroups controlling 
flavour-changing transitions, is explicitly broken by the Yukawa interaction 

Cy = QlYdDrH + QlYuUrH, + LlYeErH + h.c. (1) 

Since Cy? is broken already within the SM it would not be consistent to impose it as 
an exact symmetry of the additional degrees of freedom present in SM extensions: 
even if absent a the tree-level, the breaking of Cf would reappear at the quantum 
level because of the Yukawa interaction. The most restrictive hypothesis we can 
make to protect the breaking of Cf in a consistent way, is to assume that Yp 
and Ye are the only source of G F-breaking also beyond the SM. 

To implement and interpret this hypothesis in a natural way, we can assume 
that Cf is indeed a good symmetry, promoting the Y to be dynamical fields with 
non-trivial transformation properties under Gp- 

Yf/ ~ (3,3, 1 )su(3)F Uj ~ (3, 1,3)su(3)F ~ (3, 3)su(3)F 

If the breaking of Cf occurs at very high energy scales - well above the TeV region 
where the new degrees of freedom necessary to stabilize the Higgs sector appear - 
at low-energies we would only be sensitive to the background values of the Y, i.e. 
to the ordinary SM Yukawa couplings. Employing the effective-theory language, 
we then define that an effective theory satisfies the criterion of Minimal Flavour 
Violation if all higher-dimensional operators, constructed from SM and Y fields, 
are invariant under CP and (formally) under the flavour group Gp [5, 6]. 

According to this criterion one should in principle consider operators with 
arbitrary powers of the (adimensional) Yukawa fields. However, a strong simplifi- 
cation arises by the observation that all the eigenvalues of the Yukawa matrices are 
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Minimally flavour violating 
dimension six operator 


main 

observables 


A [TeV] 

- + 




€k, AniB^ 


6.4 5.0 


eH^ {D ij,uQ l) 




5.2 6.9 




B (x)ee, K TTj/p, {w)(£ 


3.1 2.7 * 


{QlXfc^^Ql)(HUD^H) 


B {X)£i, K TTW, (7t)« 


1.6 1.6 * 



Table 1: 99% CL bounds on the scale of representative dimension-six operators in 
the MFV scenario [6]. The constraints are obtained on the single operator, with 
coefficient drl/A^ (-f or — denote constructive or destructive interference with the 
SM amplitude). The * signals the cases where a significant increase in sensitivity 
can be achieved by future measurements of rare decays. 



small, but for the top one, and that the off-diagonal elements of the CKM matrix 
(Vij) are very suppressed. It is then easy to realize that, similarly to the pure SM 
case, the leading coupling ruling all FCNC transitions with external down- type 
quarks is 



(AFc)y = 







i=j. 



(3) 



As a result, within this framework the bounds on the scale of dimension-six FCNC 
effective operators turn out to be much less severe than in the general case (see 
table 1). 

The idea that the CKM matrix rules the strength of FCNC transitions also 
beyond the SM has become a very popular concept in the recent literature and 
has been implemented and discussed in several works (see e.g. Refs. [7]). However, 
it is worth to stress that the CKM matrix represent only one part of the prob- 
lem: a key role in determining the structure of FCNCs is also played by quark 
masses, or by the Yukawa eigenvalues. In this respect the above MFV criterion 
provides the maximal protection of FCNCs (or the minimal violation of flavour 
symmetry), since the full structure of Yukawa matrices is preserved. We finally 
emphasize that, contrary to other approaches, the above MFV criterion is based 
on a renormalization-group-invariant symmetry argument and is completely inde- 
pendent of specific new-physics framework. 



3 K ^ TTuu decays 

The s dvD transition is one of the rare examples of weak processes whose lead- 
ing contribution starts at 0{G‘p). At the one- loop level it receives contributions 
only from Z-penguin and IT-box diagrams, as shown in Fig. 2, or from pure quan- 
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Figure 2: One- loop diagrams contributing to the s duiy transition. 



turn electroweak effects. Separating the contributions to the one-loop amplitude 
according to the intermediate up- type quark running inside the loop, we can write 

A(s^di^i>)= ~ I OiXmp +iO{\^mi) (4) 

q=u,c,t C?(AAqç 0 ) (q=u) 

where Vij denote the elements of the CKM matrix. The hierarchy of these ele- 
ments would favour up- and charm-quark contributions; however, the hard GIM 
mechanism of the perturbative calculation implies Aq ~ leading to a 

completely different scenario. As shown on the r.h.s. of (4), where we have em- 
ployed the standard CKM phase convention {^Vus = ^Vud = 0) and expanded 
the Vij in powers of the Cabibbo angle (A = 0.22), the top-quark contribution 
dominates both real and imaginary parts. This structure implies several interest- 
ing consequences for A{s — > duû): it is dominated by short-distance dynamics, 
therefore its QCD corrections are small and calculable in perturbation theory; it 
is very sensitive to Vtd, which is one of the less constrained CKM matrix elements; 
it is likely to have a large CP-violating phase; it is very suppressed within the SM 
and thus very sensitive to possible new sources of quark-flavour mixing. 

Short-distance contributions to ^ di^9), both within the SM and within 
MFV models, can efficiently be described by means of a single effective dimension- 
6 operator: = si^f^dL . Within the SM both next-to- leading-order 

(NLO) QCD corrections [8, 9] and 0{G^pm^) electroweak corrections [10] to the 
Wilson coefficient of have been calculated. The simple structure of leads 
to two important properties of K — ^ tti/û decays: 

• The relation between partonic and hadronic amplitudes is exceptionally ac- 
curate, since hadronic matrix elements of the s^^d current between a kaon 
and a pion can be derived by isospin symmetry from the measured Kis rates. 



• The lepton pair is produced in a state of definite CP and angular momentum, 
implying that the leading SM contribution to Kl is CP-violating. 
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The dominant theoretical error in estimating the 'k^vv rate is due to 

the subleading, but non-negligible charm contribution. Perturbative NNLO cor- 
rections in the charm sector have been estimated to induce an error in the total 
rate of around 10% [8], which can be translated into a 5% error in the determi- 
nation of \Vtd\ from B[K^ Non- perturbative effects introduced by the 

integration over charmed degrees of freedom have been discussed in Ref. [11]: a 
precise estimate of these contributions is not possible, but they are expected to 
be within the error of NNLO terms. Genuine long-distance effects associated to 
light-quark loops have been shown to be much smaller [12]. 

The case oîK^ ^ is even cleaner from the theoretical point of view [13]. 
Because of the CP structure, only the imaginary parts in (4) -where the charm 
contribution is absolutely negligible- contribute to A{K 2 Thus the dom- 

inant direct-CP- violating component of A{Kl tt^z/z/) is completely saturated by 
the top contribution, where QCD corrections are suppressed and rapidly conver- 
gent. Intermediate and long-distance effects in this process are confined only to the 
indirect- CP- violating contribution [14] and to the CP-conserving one [15], which 
are both extremely small. Taking into account the isospin- breaking corrections to 
the hadronic matrix element [16], we can write an expression for the ^ tt^z/z/ 
rate in terms of short-distance parameters, namely 



B{Kl 



= 4.16 X 10 



10 ^ 


■ mt{mt) ■ 


2.30 






167 GeV 




A5 



(5) 



which has a theoretical error below 3%. 

The high accuracy of the theoretical predictions of B{K^ and 

B{Kl 'K^vv) in terms of modulus and phase of Xt = Vts^td clearly offers the 
possibility of very interesting tests of flavour dynamics. Within the SM, a mea- 
surement of both channels would provide, two independent pieces of information 
on the unitary triangle, or a complete determination of p and fj from AS = 1 
transitions. As illustrated in Fig. 2, the high sensitivity of these two modes to 
short-distance dynamics makes them extremely efficient probes of MFV scenarios. 

At present the SM predictions of the two K iivv rates are not extremely 
precise owing to the limited knowledge of A^. Taking into account all the indirect 
constraints in a global Gaussian fit, the allowed range reads [17, 18] 

^ = (0-72 ± 0.21) X 10~^° (6) 

B{Kl = (0.28 ± 0.10) X 10"^^ (7) 



The search for processes with missing energy and branching ratios below 
10“^^ is definitely a very difficult challenge, but has been proved not to be im- 
possible: two 7T+z/z> candidate events have been observed by the BNL-E787 

experiment [19]. The branching ratio inferred from this result. 



B{K^ 7T+Z/P) - (1.57 ^ x 10“^° 



(8) 
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Figure 3: Comparison of the effectiveness of different rare modes in setting future 
bounds on the scale of the representative operator {QL^FCjfxQL){LLjijLL) within 
MFV models [6] . The vertical axis indicates the relative precision of an hypothetic 
measurement of the rate, with central value equal to the SM expectation. The 
curves in the two panels are obtained assuming an uncertainty of 10% (left) or 1% 
(right) on the corresponding overall CKM factor. 



is consistent with SM expectations, although the error does not allow precision 
tests of the model yet. In a few years this result should be substantially improved 
by the BNL-E949 experiment, whose goal is to collect about 10 events (at the SM 
rate). In the longer term, a high-precision result on this mode will arise from the 
CKM experiment at Fermilab, which aims at a measurement of B{K^ 
at the- 10% level [20]l 

Uhforbunately the progress concerning the neutral mode is much slower. No 
dedicated Experiment has started yet (contrary to the case) and the best direct 
limit is more than four orders of magnitude above the SM expectation [21]. An 
indirect model-independent upper bound on T{Kl can be obtained by 

the isospin relation [22] 

T{K~^ 'k'^vv) — T{Kl TT^i^ü) + r(AT5 ^ (9) 

which is valid for any s dvv local operator of dimension < 8 (up to small 
isospin- breaking corrections). Using the BNL-E787 result (8), this implies B{Kl 
TT^ uü) < 1.7 X 10“^ (90% CL). Any experimental information below this figure can 
be translated into a non-trivial constraint on possible new-physics contributions 
to the s dvv amplitude. In a few years this goals should be reached by E931a at 
KEK: the first Kl dedicated experiment. The only approved experiment 

that could reach the SM sensitivity on Kl is KOPIO at BNL, whose goal 

is a SES of 10“^^, or the observation of about 50 signal events (at the SM rate) 
with signal/ background ~ 2 [20]. 
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4 Other Rare K decays 

As far ais theoretical cleanliness is concerned, K nuü have essentially no com- 
petitors. FCNC transitions where the neutrino pair is replaced by a charged- lepton 
pair have a very similar short-distance structure. However, in these processes the 
size of long-distance contributions is usually much larger because of electromag- 
netic interactions. Only in few cases (mainly in CP-violating observables) we can 
extract the interesting short-distance information with reasonable accuracy (see 
Ref. [20, 23] and references therein). 

K — ^ The GIM mechanism of the s d'y* amplitude is only logarithmic 

[24]. As a result, the K — > 7T7* — 7r£~^£~ amplitude is completely dominated 
by long-distance dynamics and provides a large contribution to the CP-allowed 
transitions — » 7t'^£'^£~ and Ks tt^£'^£~ [25]. Rate and form factor of the 

charged mode have been measured with high accuracy by BNL-E865 [26]. However, 
this information is not sufficient to predict the rate of the neutral mode. The latter 
is usually parameterized as B{Ks vr^e^e”) = 5 x 10“^ x where as is a low- 
energy free parameter expected to be 0(1) [27]. The present experimental bound 
B(Ks — > TT^e^e~) < 1.4 x 10~^ [28] is still one order of magnitude above the most 
optimistic expectations, but a measurement or a very stringent bound on \as\ will 
soon arise from the A'5-dedicated run of NA48 and/or from the KLOE experiment 
at Frascati. 

Apart from its intrinsic interest, the determination of B(Ks Tr^e'^e") has 
important consequences on the Kl 'n^e^e~ mode. Here the long-distance part 
of the single-photon exchange amplitude is forbidden by CP invariance and the 
sensitivity to short-distance dynamics in enhanced. The direct-CP-violating part 
of the Kl ^ tt^£'^£~ amplitude is conceptually similar to the one oi Kl 
it is calculable with high precision, being dominated by the top-quark contribution 

[29] , and is highly sensitive to non-standard dynamics. This amplitude interfere 
with the indirect- CP-violating contribution induced by Kl~Ks mixing, leading to 
[27] 

B{Kl 7r°e+e-)cpv = 10“^^ x 

where the ± depends on the relative sign between short- and long-distance contri- 
butions, and cannot be determined in a model-independent way. Given the present 
uncertainty on B(Ks 7T^e'^e~), at the moment we can only set a rough upper 
limit of 5.4 x 10“^® on the sum of all the CP-violating contributions to this mode, 
to be compared with the direct limit of 5.6 x 10”^^ obtained by KTeV at Fermilab 

[30] , 

An additional contribution to 7 t^£'^£ decays is generated by the CP- 

conserving long-distance processes Kl 7 t^ 7*7* — >■ tt^£'^£~ [31]. This amplitude 
does not interfere with the CP-violating one, and recent NA48 data on Kl 7 t ^77 






( 10 ) 
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(at small dilepton invariant mass) indicate that it is very suppressed, with an 
impact on B{Kl — ^ 7r^e^e“) below the 10“^^ level [32]. 

At the moment there exist no definite plans to improve the KTeV bound on 
B{Kl 7T^e+e~). The future information on B{Ks — >■ 7T^e‘^e“) will play a crucial 
role in this respect: if as were in the range that maximizes the interference effect 
in (10), we believe it would be worthwhile to start a dedicated program to reach 
sensitivities of 10“^^. 



Kl Both Kl e~^e~ decays are dominated by the 

two- photon long-distance amplitude Kl 7*7* . The absorptive part of 

the latter is determined to good accuracy by the two-photon discontinuity and is 
calculable with high precision in terms of the A/, 77 rate. On the other hand, 

the dispersive contribution of the two-photon amplitude is a source of considerable 
theoretical uncertainties. 

In the Kl ^ mode the dispersive integral is dominated by a large 

infrared logarithm [~ ln(m|^/mg)], the coupling of which can be determined in a 
model-independent way from V{Kl 77). As a result, T{Kl e‘^e“) can be esti- 
mated with good accuracy but is almost insensitive to short-distance dynamics [33] . 

The Kl ^ mode is certainly more interesting from the short-distance 

point of view. Here the two-photon long-distance amplitude is not enhanced by 
large logs and is almost comparable in size with the short-distance one, sensitive 
to ^\t [8] . Actually short- and long-distance dispersive parts cancel each other to 
a good extent, since the total Kl rate (measured with high precision by 

BNL-E871 [34]) is almost saturated by the absorptive two- photon contribution. 





ccntrn] 

range 



Cl. mtciA-alK 



‘ It' VVli t limil 



cumruJ va I Ilf 



Figure 4: Left: present constraints in the p-ff plane from rare K decays only (the 
small dark region close to the origin denotes the combined result of B-physics 
observables). Right: p-fj constraints excluding observables sensitive to Bd~Bd mix- 
ing [17]. 

The accuracy on which we can bound the two-photon dispersive integral 
determines the accuracy of possible bounds on A partial control of the Kl 
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7 * 7 * form factor, which rules the dispersive integral, can be obtained by means 
of Kl and Kl ^ e^e~ spectra; additional constraints can also 

be obtained from model-dependent hadronic ansatze and/or perturbative QCD 
[35, 36]. Combining these inputs, significant upper bounds on ‘RXt (or lower bounds 
on p) have recently been obtained [34, 37]. The reliability of these bounds has still 
to be fully investigated, but some progress can be expected in the near future. In 
particular, the extrapolation of the form factor in the high-energy region, which 
so far requires model-dependent assumptions, could possibly be limited by means 
of lattice calculations. 

5 Beyond the MFV hypothesis 

To conclude this discussion about kaon physics, we summarize in Fig. 4 (left) the 
present impact of rare K decays in constraining the p-p plane. As can be noted, the 
bounds from these modes are substantially less precise than those from 5-physics. 
As a result, we are still far from precision tests of the main MFV prediction: the 
universality of non-standard effects in 6 ^ d, 6 ^ s and s d FCNC transitions 
[6]. On the other hand, the comparison is already quite non-trivial concerning non- 
MFV scenarios. In particular, present rare-A"-decay constrains put severe bounds 
on realistic scenarios with large new sources of flavour mixing in s ^ d transitions 
(see e.g. Ref. [38]). 

Interestingly enough, non-MFV models with 0{1) effects in s ^ d, b ^ 
s, and even b ^ d FCNC transitions are still far from being excluded. As an 
example, in Fig. 4 (right) we show the result of a fit allowing arbitrary new- 
physics contributions to Bd~Bd mixing. As can be noted, all remaining constraints 
are in good agreement; however, the large central value of tends 

to flavour a CKM structure rather different from the standard case [17]. This 
indication is not statistically significant yet, but it provides a good illustration 
of the main points of this discussion: there is still a lot to learn about FCNC 
transitions and the measurements of K ^ niyi? rates provide a unique opportunity 
in this respect. 
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We explain how the string spectrum in flat space and plane waves arises from 
the large N limit of U(N) J\f = i super Yang Mills. We reproduce the spectrum by 
summing a subset of the planar Feynman diagrams. We also describe some other 
aspects of string propagation on plane wave backgrounds. This talk is based on 
[40]. 

1 Introduction 

The fact that large N gauge theories have a string theory description was believed 
for a long time [4] . These strings live in more than four dimensions [5] . One 
of the surprising aspects of the AdS/CFT correspondence [8, 7, 9, 6] is the fact 
that for A/” = 4 super Yang Mills these strings move in ten dimensions and are 
the usual strings of type IIB string theory. The radius of curvature of the ten 
dimensional space goes as R/lg ~ The spectrum of strings on AdS^ x 

corresponds to the spectrum of single trace operators in the Yang Mills theory. 
The perturbative string spectrum is not known exactly for general values of the ’t 
Hooft coupling, but it is certainly known for large values of the ’t Hooft coupling 
where we have the string spectrum in flat space. In these notes we will explain 
how to reproduce this spectrum from the gauge theory point of view. In fact we 
will be able to do slightly better than reproducing the flat space spectrum. We 
will reproduce the spectrum on a plane wave. These plane waves incorporate, in a 
precise sense, the first correction to the flat space result for certain states. 

The basic idea is the following. We consider chiral primary operators such as 
Tr[Z'^] with large J. This state corresponds to a graviton with large momentum 
. Then we consider replacing some of the Zs in this operator by other fields, 
such as (/), one of the other transverse scalars. The position of (j) inside the operator 
will matter since we are in the planar limit. When we include interactions (p can 
start shifting position inside the operator. This motion of (p among the Zs is 
described by a field in 1+1 dimensions. We then identify this field with the field 
corresponding to one of the transverse scalars of a string in light cone gauge. This 
can be shown by summing a subset of the Yang Mills Feynman diagrams. We will 
present a heuristic argument for why other diagrams are not important. 
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Since these results amount to a “derivation” of the string spectrum at large 
’t Hooft coupling from the gauge theory, it is quite plausible that by thinking 
along the lines sketched in this paper one could find the string theory for other 
cases, most interestingly cases where the string dual is not known (such as pure 
non-supersymmetric Yang Mills). 

These notes are organized as follows. In section two we describe various as- 
pects of plane waves. We discuss particle and string propagation on a plane wave 
as well as their symmetries. In section three we describe how plane waves arise 
from Penrose limits of various spacetimes, concentrating mostly on AdS^ x 5^. In 
section 4 we describe the computation of the spectrum from the J\f = 4 Yang Mills 
point of view. 



2 Plane waves 



We will be interested in the following plane wave solution of IIB supergravity [19] 
ds^ = —4:dx^dx~ — y‘^{dx^)‘^ + dy^dy'^ . (1) 

We also have a constant field strength 

F = dx* {dyidy2dy3dy4, + dy^dyedyjdyg) (2) 



String propagation on this background can be solved exactly, by choosing light 
cone gauge in the Green-Schwarz action [2, 30]. The lightcone action becomes 



S = 



27Ta' 



dt 

J Jo 



7ra'\p- I 



da 



- ^2/'^ - + ixI)S 



(3) 



where I = and S' is a Major ana spinor on the worldsheet and a positive chi- 

rality SO (8) spinor under rotations in the eight transverse directions. We quantize 
this action by expanding all fields in Fourier modes on the circle labeled by a. For 
each Fourier mode we get a harmonic oscillator (bosonic or fermionic depending 
on the field). Then the light cone Hamiltonian is 



2p = = Hu 



+ OD 

E 



Nr, 



n— — oo 



yr + 



(q'Ip- 1/2)2 



(4) 



Here n is the label of the fourier mode, n > 0 label left movers and n < 0 right 
movers. 7V„ denotes the total occupation number of that mode, including bosons 
and fermions. Note that the ground state energy of bosonic oscillators is canceled 
by that of the fermionic oscillators. The constraint on the momentum in the sigma 
direction reads 

oo 

P= Y, niV„ = 0 



n= — oo 



(5) 
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In the limit that /i is very small, fia'\p-\ 1, we recover the flat space 
spectrum. It is also interesting to consider the opposite limit, where 

liap^ > 1 . (6) 

This limit corresponds to strong tidal forces on the strings. It corresponds to strong 
curvatures. In this limit all the low lying string oscillator modes have almost the 
same energy. This limit corresponds to a highly curved background with RR fields. 
In fact we will later see that the appearance of a large number of light modes is 
expected from the Yang-Mills theory. 

3 Plane waves as Penrose limits. 

Penrose showed that plane waves can be obtained as limits of various backgrounds 
[21] . Here we first consider a specific case and then we will say something about 
the general case. 

3.1 Type IIB plane wave from AdS^ x 

In this subsection we obtain the maximally supersymmetric 
IIB string theory as a limit of AdS^ x . 

The idea is to consider the trajectory of a particle that 
along the and to focus on the geometry that this particle 
the AdS^ X metric written as 

ds^ = \^dt^ cosh^ p -h dp^ -h sinh^ pd^l\ -h d'lp^ cos^ 0 H- dO^ + sin^ (7) 

We want to consider a particle moving along the ijj direction and sitting at p = 0 
and ^ = 0. We will focus on the geometry near this trajectory. We can do this 
systematically by introducing coordinates and then performing the 

rescaling 

, x~ = R?x~ , 0 = ^ , R-^ 00 ( 8 ) 

R R 

In this limit the metric (7) becomes 

ds^ = —4:dx^dx~ — (r ^ + y ^){dx^Ÿ -\- dy^ -\- dr ^ (9) 

where y and f parametrize points on R"^ . We can also see that only the compo- 
nents of F with a plus index survive the limit. The mass parameter p can be intro- 
duced by rescaling (8) x~ x~ j p and x^ px^ . These solutions where studied 
in [19]. 

It will be convenient for us to understand how the energy and angular mo- 
mentum along scale in the limit (8). The energy in global coordinates in AdS 



plane wave of type 

is moving very fast 
sees. We start with 
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is given hy E = idt and the angular momentum by J = This angular mo- 

mentum generator can be thought of as the generator that rotates the 56 plane 
of R^. In terms of the dual CFT these are the energy and R-charge of a state of 
the field theory on x R where the has unit radius. Alternatively, we can say 
that E — A is the conformal dimension of an operator on We find that 



+ 

1 

II 

1 

CM 


= id^+ = id^+ = i(dt + d^) = A - J 




(10) 


+ 

II 

1 

1 




A + J 


(11) 


R? 



Configurations with fixed non zero p_ in the limit (8) correspond to states in 
AdS with large angular momentum J ^ R^ It is useful also to rewrite 

(4) in terms of the Yang Mills parameters. Then we find that the contribution of 
each oscillator to A — J is 

(A - = .... = + (12) 

Notice that gN / J‘^ remains fixed in the gN oo limit that we are taking. 

When we perform the rescalings (8) we can perform the limit in two ways. If 
we want to get the plane wave with finite string coupling then we take the N ^ oo 
limit keeping the string coupling g fixed and we focus on operators with J ~ 
and A — J fixed. 

On the other hand we could first take the ’t Hooft limit ^ > 0, gN = fixed, 

and then after taking this limit, we take the limit of large ’t Hooft coupling keeping 
J/y/gN fixed and A — J fixed. Taking the limit in this fashion gives us a plane 
wave background with zero string coupling. Since we will be interested in these 
notes in the free string spectrum of the theory it will be more convenient for us to 
take this second limit. 

From this point of view it is clear that the full supersymmetry algebra of the 
metric (7) is a contraction of that of AdS^ x [19]. This algebra implies that 

> 0 . 

4 Strings from A/^ = 4 Super Yang Mills 

After taking the ’t Hooft limit, we are interested in the limit of large ’t Hooft 
coupling gN oo. We want to consider states which carry parametrically large 
R charge J ~ y/gN. ^ This R charge generator, J, is the SO (2) generator rotating 
two of the six scalar fields. We want to find the spectrum of states with A — J 
finite in this limit. We are interested in single trace states of the Yang Mills theory 
on X R^ or equivalently, the spectrum of dimensions of single trace operators 
of the euclidean theory on R"^. We will often go back and forth between the states 
and the corresponding operators. 

^ Since we first took the ’t Hooft limit then giant gravitons are not important. 
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Let us first start by understanding the operator with lowest value of A— J == 0. 
There is a unique single trace operator with A — J = 0, namely Tr[Z‘^], where 
Z = (f)^ + i(j)^ and the trace is over the N color indices. We are taking J to be 
the SO (2) generator rotating the plane 56. At weak coupling the dimension of 
this operator is J since each Z field has dimension one. This operator is a chiral 
primary and hence its dimension is protected by supersymmetry. It is associated 
to the vacuum state in light cone gauge, which is the unique state with zero light 
cone hamiltonian. In other words we have the correspondence 



1 



TrlZ-’] 



|0,P+)(.c. 



(13) 



We have normalized the operator as follows. When we compute 



{Tr{Z^]{x)Tr[Z^]{0)) 



we have J possibilities for the contraction of the first Z but then planarity implies 
that we contract the second Z with a Z that is next to the first one we contracted 
and so on. Each of these contraction gives a factor of N. Normalizing this two 
point function to one we get the normalization factor in (13).^ 

Now we can consider other operators that we can build in the free theory. We 
can add other fields, or we can add derivatives of fields like , where 

we only take the traceless combinations since the traces can be eliminated via the 
equations of motion. The order in which these operators are inserted in the trace is 
important. All operators are all “words” constructed by these fields up to the cyclic 
symmetry, these were discussed and counted in [5] . We will find it convenient to 
divide all fields, and derivatives of fields, that appear in the free theory according 
to their A — J eigenvalue. There is only one mode that has A — J = 0, which is 
the mode used in (13). There are eight bosonic and eight fermionic modes with 
A — J = 1. They arise as follows. First we have the four scalars in the directions 
not rotated by J, i.e. <j)\ i = 1,2, 3, 4. Then we have derivatives of the field Z, 
DiZ = diZ + [Ai,Z], where i = 1,2, 3, 4 are four directions in Finally there 
are eight fermionic operators x^t-i which are the eight components with J = h 

of the sixteen component gaugino x (fhe other eight components have J = ~|)- 
These eight components transform in the positive chirality spinor representation 
of 5'0(4) X *90(4). We will focus first on operators built out of these fields and 
then we will discuss what happens when we include other fields, with A - J > 1, 
such as Z. 

The state (13) describes a particular mode of ten dimensional supergravity 
in a particular wavefunction [9] . Let us now discuss how to generate all other 
massless supergravity modes. On the string theory side we construct all these states 
by applying the zero momentum oscillators Uq, i — 1, . . . , 8 and Sq, b = 1, ... 8 

^In general in the free theory any contraction of a single trace operator with its complex 
conjugate one will give us a factor of N'^ , where n is the number of fields appearing in the 
operator. 
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on the light cone vacuum |0,p+)/.c.. Since the modes on the string are massive 
all these zero momentum oscillators are harmonic oscillators, they all have the 
same light cone energy. So the total light cone energy is equal to the total number 
of oscillators that are acting on the light cone ground state. We know that in 
AdSs X all gravity modes are in the same supermultiplet as the state of the form 
(13) M. Gunaydin and N, Marcus, Class, Quant, Grav. 2 , Lll (1985). H. J. Kim, 
L. J. Romans and P. van Nieuwenhuizen, Phys. Rev, D 32 , 389 (1985). The same 
is clearly true in the limit that we are considering. More precisely, the action of all 
supersymmetries and bosonic symmetries of the plane wave background (which are 
intimately related to the AdS^ x symmetries) generate all other ten dimensional 
massless modes with given p_. For example, by acting by some of the rotations of 
that do not commute with the SO (2) symmetry that we singled out we create 
states of the form 












jYJ/2+1/2 






(14) 



where (jf , r = 1,2, 3, 4 is one of the scalars neutral under J. In (14) we used the 
cyclicity of the trace. Note that we have normalized the states appropriately in 
the planar limit. We can act any number of times by these generators and we 
get operators roughly of the form ^Tr[- • • z(t)'^ z • • • z(f)^]. where the sum is over 
all the possible orderings of the </>s. We can repeat this discussion with the other 
A — J = 1 fields. Each time we insert a new operator we sum over all possible 
locations where we can insert it. Here we are neglecting possible extra terms that 
we need when two A — J = 1 fields are at the same position, these are subleading in 
a 1/J expansion and can be neglected in the large J limit that we are considering. 
We are also ignoring the fact that J typically decreases when we act with these 
operators. In other words, when we act with the symmetries that do not leave Z 
invariant we will change one of the Zs in (13) to a field with A — J = 1, when we 
act again with one of the symmetries we can change one of the Zs that was left 
unchanged in the first step or we can act on the field that was already changed 
in the first step. This second possibility is of lower order in a 1/J expansion and 
we neglect it. We will always work in a “dilute gas” approximation where most of 
the fields in the operator are Zs and there are a few other fields sprinkled in the 
operator. 

For example, a state with two excitations will be of the form 



r\j 



]^J/2+ 



1=0 



(15) 



where we used the cyclicity of the trace to put the (f)^ operator at the beginning of 
the expression. We associate (15) to the string state a}^Sl ^|0,p+)/.c.. Note that for 
planar diagrams it is very important to keep track of the position of the operators. 
For example, two operators of the form Tr[<fA Z^cfP Z'^~^] with different values of I 
are orthogonal to each other in the planar limit (in the free theory). 
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The conclusion is that there is a precise correspondence between the super- 
gravity modes and the operators. This is of course well known [7, 9, 6]. Indeed, we 
see from (4) that their A — J = — is indeed what we compute at weak coupling, 
as we expect from the BPS argument. 

In order to understand non-supergravity modes in the bulk it is clear that 
what we need to understand the Yang Mills description of the states obtained by 
the action of the string oscillators which have n ^ Q. Let us consider first one of 
the string oscillators which creates a bosonic mode along one of the four directions 
that came from the 5^, let’s say We already understood that the action of 
Uq ^ corresponds to insertions of an operator on all possible positions along 
the “string of Z’s”. By a “string of Zs” we just mean a sequence of Z fields one 
next to the other such as we have in (13). We propose that corresponds to the 
insertion of the same field but now with a position dependent phase 



1 

7j 



j 



E 



1 



y^jV-^/2+1/2 



Tr[Z‘ 



I J^4 rvj — h 



(16) 



In fact the state (16) vanishes by cyclicity of the trace. This corresponds to the 
fact that we have the constraint that the total momentum along the string should 
vanish (5), so that we cannot insert only one oscillator. So we should insert 
more than one oscillator so that the total momentum is zero. For example we can 
consider the string state obtained by acting with the ^ and which has zero 
total momentum along the string. We propose that this state should be identified 
with 



4®a-n|0>P+)(.c. < ► ^E 









(17) 



where we used the cyclicity of the trace to simplify the expression. The general 
rule is pretty clear, for each oscillator mode along the string we associate one of 
the A — J = 1 fields of the Yang-Mills theory and we sum over the insertion 
of this field at all possible positions with a phase proportional to the momentum. 
States whose total momentum is not zero along the string lead to operators that are 
automatically zero by cyclicity of the trace. In this way we enforce the Lq — Lq = 0 
constraint (5) on the string spectrum. 

In summary, each string oscillator corresponds to the insertion of a A — J = 1 
field, summing over all positions with an n dependent phase, according to the rule 



at* 


-^DiZ 


for 2 = 1, • • 


■A 


(18) 


ott 




for j = 5, • • 


•,8 


(19) 


ga 


'^—2 




(20) 



In order to show that this identification makes sense we want to compute the 
conformal dimension, or more precisely A — J, of these operators at large ’t Hooft 
coupling and show that it matches (4). First note that if we set ^ 0 in (12) we 
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find that all modes, independently of n have the same energy, namely one. This is 
what we find at weak ’t Hooft coupling where all operators of the form (17) have 
the same energy, independently of n. Expanding the string theory result (12) we 
find that the first correction is of the form 



= + ( 21 ) 

This looks like a first order correction in the ’t Hooft coupling and we can 
wonder if we can reproduce it by a a simple perturbative computation. 

In order to compute the corrections it is useful to view the A7 = 4 theory as 
an jV = 1 theory. As an A7 = 1 theory we have a Yang Mills theory plus three 
chiral multiplets in the adjoint representation. We denote these multiplets as 
where i — 1, 2, 3. We will often set Z — and W = W^. The theory also has a 
superpotential 

w ~ gYMTr{W*W^W'^)eijk ( 22 ) 

The potential for the Yang Mills theory is the sum of two terms, V = Vf + Vd, 
one coming from F terms and the other from D- terms. The one coming from F 
terms arises from the superpotential and has the form 

Vf ~ (23) 

On the other hand the one coming from D terms has the form 

Vd ~ ^Tr([PCS W%W^, W^\) (24) 




Figure 1: Diagrams that come from F terms. The two diagrams have a relative 
minus sign. The F terms propagator is a delta function so that we could replace 
the three point vertex by a four point vertex coming from Figure 1 . If there are 
no phases in the operator these contributions vanish. 

We will concentrate in computing the contribution to the conformal dimen- 
sion of an operator which contains a W insertion along the string of Zs. There are 
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various types of diagrams. There are diagrams that come from D terms, as well as 
from photons or self energy corrections. There are also diagrams that come from 
F terms. The diagrams that come from F terms can exchange the W with the Z. 
The F term contributions cancel in the case that there are no phases, see Figure 
1. This means that all other diagrams should also cancel, since in the case that 
there are no phases we have a BPS object which recieves no corrections. All other 
one loop diagrams that do not come from F terms do not exchange the position of 
IT, this means that they vanish also in the case that there are phases since they 
will be insensitive to the presence of phases. In the presence of phases the only 
diagrams that will not cancel are then the diagrams that come from the F terms. 
These are the only diagrams that give a momentum, n, dependent contribution. 

In the free theory, once a W operator is inserted at one position along the 
string it will stay there, states with IT’s at different positions are orthogonal to 
each other in the planar limit (up to the cyclicity of the trace). We can think of 
the string of Zs in (13) as defining a lattice, when we insert an operator W at 
different positions along the string of Zs we are exciting an oscillator b\ at the site 
I on the lattice, / = 1, • • • J. The interaction term (23) can take an excitation from 
one site in the lattice to the neighboring site. So we see that the effects of (23) will 
be sensitive to the momentum n. In fact one can precisely reproduce (21) from 
(23) including the precise numerical coefficient. Below we give some more details 
on the computation. 

We will write the square of the Yang-Mills coupling in terms of what in AdS 
is the string coupling that transforms as p — > 1/g under S-duality. The trace is 
just the usual trace of an A" x A' matrix. 

We define Z = and similarly for W. Then the propagator is 

normalized as 

(25) 

In (23) there is an interaction term of the form the form ^ f d"^xTr([Z, W][Z, W]), 
where W is one of the (complex) transverse scalars, let’s say W = VFb The 
contribution from the F terms shown in (23) give 



<O(x)O*(0) >= 



A/* 

^|2A 



27T77 

1 + A(47t^)(— 2 + 2 cos -j-)I{x) 



where A/" is a normalization factor and I{x) is the integral 



I{x] 



(4tI-2)2 J 



d y 



1 



1 



y'^(x — yY dTT^ 



log |:r|A -h finite 



(26) 



(27) 



We extracted the log divergent piece of the integral since it is the one that reflects 
the change in the conformal dimension of the operator. 
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In conclusion we find that for large J and N the first correction to the 
correlator is 



< O(:r)O*(0) >= 

which implies that the contribution of the operator W inserted in the “string of 
Zs” with momentum n gives a contribution to the anomalous dimension 






( 28 ) 



= 1 + P9) 

which agrees precisely with the first order term computed from (21). 

There are similar computations we could do for insertions of W or the 
fermions Xj=i/ 2 - fermions the important interaction term will 

be a Yukawa coupling of the form 'ÿ^z[Z'x\ + xF^lZ, x]. We have not done these 
computations explicitly since the 16 supersymmetries preserved by the state (13) 
relate them to the computation we did above for the insertion of a W operator. 

The full square root in (12) was recently obtained in the beautiful paper 
by Santambrogio and Zanon, A. Santambrogio and D. Zanon, “Exact anoma- 
lous dimensions of N = 4 Yang-Mills operators with large R charge,” arXiv:hep- 
th/0206079. 

In summary, the “string of Zs” becomes the physical string and each Z carries 
one unit of J which is one unit of — . Locality along the worldsheet of the string 
comes from the fact that planar diagrams allow only contractions of neighboring 
operators. So the Yang Mills theory gives a string bit model (see [39]) where each 
bit is a Z operator. Each bit carries one unit of J which is one unit of — p_ . 

The reader might, correctly, be thinking that all this seems too good to be 
true. In fact, we have neglected many other diagrams and many other operators 
which, at weak ’t Hooft coupling also have small A — J. In particular, we considered 
operators which arise by inserting the fields with A — J = 1 but we did not 
consider the possibility of inserting fields corresponding toA — J = 2,3,..., such 
as Z, dk(j>^ ^ ^(/^/c)Z, etc. The diagrams of the type we considered above would 
give rise to other 1-hl dimensional fields for each of these modes. These are present 
at weak ’t Hooft coupling but they should not be present at strong coupling, since 
we do not see them in the string spectrum. We believe that what happens is that 
these fields get a large mass in the N oo limit. In other words, the operators 
get a large conformal dimension. One can compute the first correction to the 
energy (the conformal weight) of the of the state that results from inserting Z 
with some “momentum” n. In contrast to our previous computation for A — J = 
1 fields we find that besides an effective kinetic term as in (21) there is an n 
independent contribution that goes âs g N with no extra powers of 1/ [40] . This 

is an indication that these excitations become very massive in the large gN limit. 
In addition, we can compute the decay amplitude of Z into a pair of cf) insertions. 
This is also very large, of order gN. 
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Though we have not done a similar computation for other fields with A — J > 
1, we believe that the same will be true for the other fields. In general we expect 
to find many terms in the effective Lagrangian with coefficients that are of order 
gN with no inverse powers of J to suppress them. In other words, the lagrangian 
of Yang-Mills on acting on a state which contains a large number of Zs gives 
a lagrangian on a discretized spatial circle with an infinite number of KK modes. 
The coefficients of this effective lagrangian are factors of gN^ so all fields will 
generically get very large masses. 

The only fields that will not get a large mass are those whose mass is protected 
for some reason. The fields with A — J — 1 correspond to Goldstone bosons and 
fermions of the symmetries broken by the state (13). Note that despite the fact 
that they morally are Goldstone bosons and fermions, their mass is non-zero, due 
to the fact that the symmetries that are broken do not commute with the 
light cone Hamiltonian. The point is that their masses are determined, and hence 
protected, by the (super)symmetry algebra. 

Having described how the single string Hilbert space arises it is natural to 
ask whether we can incorporate properly the string interactions. Clearly string 
interactions come when we include non-planar diagrams [4] . 

Some of the arguments used in this section look very reminiscent of the 
DLCQ description of matrix strings [35, 36] . It would be interesting to see if one 
can establish a connection between them. Notice that the DLCQ description of 
ten dimensional HB theory is in terms of the M2 brane field theory. Since here 
we are extracting also a light cone description of HB string theory we expect that 
there should be a direct connection. 

It would also be nice to see if using any of these ideas we can get a better han- 
dle on other large N Yang Mills theories, particularly non-supersymmetric ones. 
The mechanism by which strings appear in this paper is somewhat reminiscent of 

[41]. 



4.1 Light font kinematics 

If we have a two dimensional metric of the form 



ds^ == —Adx'^dx = —dt^ -h dx^ (30) 

we can define the corresponding momenta p± which generate translations in the 
x^ coordinates. We find that these momenta are related to the energy and P by 

= (31) 

We see that for massive particles p± < 0 and for massless particles one of them can 
be zero. Massless particles with nonzero and p^ — 0 are moving along the -\-x 
direction. Their wavefunctions depend only on x” , so we can form a wave packet 
localized in the x~ direction. Note that the waves given by (35) move in the —x 
direction. 
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4.2 Particle propagation on plane waves 

In order to understand the physics of these backgrounds let us consider particle 
propagation on these plane waves. Starting with the particle action 

5 = ^ y — em^) (32) 

where X is contracted with the spacetime metric. We now set = r and we 
insert the plane wave metric in (32)^. Since the metric is independent of x~ we 
can eliminate x~ using the fact that the corresponding momentum, which is p_ = 
— 2e“^ is conserved^. In this way we find an action where the einbein e is eliminated 
and which depends only on the y* coordinates 

s = j - Aij{x+)y'^y^) - m^/|p_| (33) 



The equations of motion for y'^ are 

(34) 

The light cone hamiltonian which implements translations in x'^ is equal to — 



-P+ = H 



M 

b- 



+ 



b-l 



Ajy^'y^ + 



\p-\ 



(35) 



We see that for A = 0 we recover the usual dispersion relation. If A is independent 
of we get a set of harmonic oscillators for the coordinates, some of which 
could have negative vo^ if A has negative eigenvalues. If A depends on time we 
essentially get harmonic oscillators with time dependent frequencies. Note that 
y = 0 is a solution which corresponds to a particular geodesic. 

Let us analyze a couple of cases. Let us first assume that the metric is a 
solution of the vacuum Einstein equations. Then we see that A has some negative 
eigenvalues and some positive eigenvalues. This is just the familiar fact that tidal 
forces for geodesics near the y = 0 geodesic will point towards the initial geodesic 
or away from it depending on the direction of the displacement, i.e. they are 
“focusing” in some directions and “defocusing” in others. 

If we have constant field strengths we can consider As with only positive 
eigenvalues. In that case tidal forces are focusing and particles with nonzero p_ feel 
in a gravitational potential well from which they cannot escape. The frequency of 
oscillations in this well is independent of p_ , thought the amplitude of oscillations, 
for fixed p+ is proportional to 1/ y/\p-\. We see that as we take p_ ^ 0 we obtain 
that particles move to y = oc and back in finite x^ time. Note that in the case 
that all eigenvalues of A are positive the x~^ direction is timelike except at y = 0. 



^We call this light cone gauge since p- is lightlike, though in general is not lightlike. 
"^Note that p- is then naturally negative since e non negative. 
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If we have Aij = . Then we have a set of harmonie oscillators of frequency 

fl and their energies are 

dt ^ 

-P+ =Y^f^{ni + (36) 

where rii is the occupation number of the oscillator i. 

Another way to treat the problem would be to solve the wave equation for a 
scalar field in the plane wave background. Fourier expanding the wave equation in 
x~ and (if A is independent) we find the harmonic oscillator equation for 
the coordinates. 



4.3 Symmetries of plane waves 

Let us start by considering a plane wave of the form 

ds^ = —4:dx^dx~ — Aijy^y^ (dx~^)‘^ + dy^dy^ (37) 



Let us first consider the bosonic symmetries. We start with general x~^ dependent 
A. There is an obvious symmetry generated by the killing vector 




(38) 



Though it is not apparent from (37) a generic plane wave has also, in addition, 
2dt killing vectors. To understand the origin of these let us first take the case where 
we have only one transverse dimension so that A is just a single function and not 
a matrix. Let us choose ya{x~^), a = 1,2, two linearly independent solutions of the 
equation (34). Then we can form two Killing vectors of the form 



C = Va{x'*')^ + -ya{x'^)y- 



It can be checked that these are killing vectors for (37). We also find that their 
commutator is 

[r - ^^1 = ^ (2/« w -yave) (40) 

Note that due to the equation of motion (34) the prefactor of p_ in (40) is a 
constant independent of . We see that once we diagonalize p_ these two Killing 
vectors obey a Heisenberg algebra. This algebra is realized on the light cone gauge 
Hilbert space as the Heisenberg algebra of position and momentum for the co- 
ordinate y. We see that the simplicity of the particle action in light cone gauge 
is intimately related to the existence of these Killing vectors. These two Killing 
vectors are realized on the particle light cone Hilbert space as creation and annihi- 
lation operators for the corresponding harmonic oscillator modes. More explicitly, 
if A = yu, independent of x+, we can choose y± ^ then (40) becomes the 

commutation relation of harmonic oscillators. In the case that A is a matrix we 




S124 



J. Maldacena Ann. Henri Poincaré 



can similarly find 2dt Killing vectors by solving (34) and choosing 2dt linearly 
independent solutions, etc. 

If A is independent there is another Killing vector, corresponding to 
translations in This becomes the energy in light cone gauge. Note that in this 
case [—p^,y±] = =F2/±- So we see that the energies of the harmonic oscillators on 
the light cone lagrangian are determined by the spacetime symmetry algebra. 

If (37) is a solution of a supersymmetric lagrangian we can ask whether we 
preserve any supersymmetries. In general we preserve half of the supersymmetries, 
the half given the equation 

r_e = 0 (41) 

on the supersymmetry generating spinor parameter e. We call these supersymme- 
tries Q-a- The a index depends on the theory we consider. These supersymmetries 
obey usual anti commutation relations of the form 



{Q — — ~ P—^ab 



(42) 



If we have a constant field strength of the form (2) we find that the x~^ dependence 
of the susy generating spinor parameter is of the form 



z r 

= e J 



dx'^ 












(43) 



where eo is constant, independent of and obeys (41). In the case of type IIB 
string theory we can choose 

F = dx^Lp , Lp = !i{dy^dy^dy‘^dy'^ + dy^ dy^ dy'^ dy^) (44) 

which would give a self dual ^ In this case we see that 

e - , r_eo = 0 , 1 = (45) 



where we used the self duality condition as well as the positive chirality of e in ten 
dimensions. 

These supersymmetries are very analogous to the bosonic symmetries we had 
above. When we act with Q_ on the light cone Hilbert space we raise or lower the 
energy by ±fi. 

Finally in type IIB super gravity, if Aij = we have sixteen extra super- 
symmetries. Their explicit form can be found in [19]. What we want to note here 
is that they obey an algebra of the form 



{Q+a, Q+b} = -p^Sab + (46) 

where the currents J appearing on the right hand side are the 50(4) x 50(4) 
generators which rotate the first four or the second four coordinates. The field 

^Note that the self duality condition relates the -1- component of F to the + component of F 
due to the off diagonal term in the metric (37). 
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strength (44) breaks the 50(8) symmetry of the metric to 50(4) x 50(4). These 
supersymmetries are fairly standard supersymmetries in light cone gauge. They 
anticommute to the Hamiltonian plus the currents. Similarly if we commute or 
anticommute with Q_ or the killing vectors (39) we obtain some combination 
of (39) and Q_ respectively, see [19]. These plane waves are maximally supersym- 
metric backgrounds that preserve 32 supersymmetries. 
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Solution OÎ M — 2 Theories via Instanton Counting 

Nikita A. Nekrasov 



Abstract. The J\f = 2 gauge theories in four dimensions were solved to a large 
extend by the method of Seiberg and Witten which relies on the constraints of 
special geometry and several natural assumptions. However these constraints are 
not powerful enough to solve any J\f = 2 theory and one would like to have a more 
direct route. We present here such a technique, following [1]. 

1 Introduction 

The dynamics of gauge theories is a long and fascinating subject. The dynamics of 
supersymmetric gauge theories is a subject with shorter history but better under- 
standing [2] yet with rich enough applications both in physics and mathematics. 
In particular, the solution of Seiberg and Witten [3] oi J\f = 2 gauge theory using 
the constraints of special geometry of the moduli space of vacua led to numerous 
achievements in understanding of the strong coupling dynamics of gauge theory 
and as well as string theory backgrounds of which the gauge theories in question 
arise as low energy limits. The low energy effective Wilsonian action for the mass- 
less vector multiplets (a/) is governed by the prepotential .F(a; A), which receives 
one-loop perturbative and instanton non-perturbative corrections (here A is the 
dynamically generated scale): 

J^(a;A) = + (1.1) 

In spite of the fact that these instanton corrections were calculated in many indirect 
ways, their gauge theory calculation is lacking beyond two instantons[4][5]. The 
problem is that the instanton measure seems to get very complicated with the 
growth of the instanton charge, and the integrals are hard to evaluate. 

The present paper attempts to solve this problem via the localization tech- 
nique, proposed long time ago in [6, 7, 8]. 

Throughout the paper G denotes U (N), T = U{1)^ is maximal torus, g = Lie{G) 
its Lie algebra, t = Lie{T) its Cartan subalgebra, W = N{T)/T denote its Weyl 
group, U = (t(K>C)/W denotes the complexified space of conjugacy classes in g. We 
consider the moduli space Mk{G) of framed G-instantons: the anti-self-dual gauge 
fields A, = 0, in the principal G-bundle V over the 4-sphere = IR^ Uoo with 

k = -~ [ Tr Fa A Fa (1.2) 

87T J^4 
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considered up to the gauge transformations g : g ^Ag-\-g ^dg, s.t. ^'(oo) = 1. 

We also consider several compactifications of the space Mk{G): the Uhlenbeck 
compactification Mk{G) and the Gieseker compactification In the formula 
(1.2) we use the trace in the fundamental representation. 

We calculate vacuum expectation value of certain observables (defined more 
precisely in the main body of the paper): 

Z(a, 6 i, 62;A) = ^exp lj A Tr {(pF 'ip'ip) - H{x) TrF A ^ (1.3) 

where a labels vacua, and A is the dynamically generated low-energy scale. The 
observables in (1.3) are annihilated by a combination of the supercharges, and 
their expectation value is not sensitive to various parameters, the energy scale in 
particular. Hence, one can do the calculation in the UV, where the theory is weakly 
coupled and the instantons dominate. Or, one can do the calculation in the IR, 
and relate the answer to the prepotential of the effective low-energy theory. By 
equating these two calculations we obtain the desired formula for the latter . 

Our first claim is 



Z(a,ei,C2; A) = exp 



.F-^Ha,ei, 62 ;A) 



Ci€2 



(1.4) 



where the function is analytic in ei, 62 near ei = €2 = 0. 

We also have the following explicit expression for Z in the case ei = —€2 = h for 
G = U{N) : 






a-in + (j - i) 



(1.5) 



Here a;„ = ai — an, the sum is over all coloured partitions: k = (ki, . . . ,kjv), 
k/ = > A:;, 2 > . . . > ki^m + 1 = = . . . = 0}, 

|k| = ki,i , 

and the product is over 1 < /, n < AT, and i,j > 1. 

The formulae (1.5) were checked against the Seiberg-Witten solution [9]. Namely, 
we claim that 



F^^^^{a,ei,e2)\ei=e2=o = fhe instanton part of the prepotential 

of the low-energy effective theory of the N=2 
gauge theory with the gauge group G 
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The latter statement means that is related to periods of a family of 

curves. More precisely, consider the following family of curves 

A 27V ^ 

w + = P(A) = J]^(A a/). (1.6) 

1=1 



The base of the 
set = 0). 

In we can 
given by: 



family (1.6) is the space ^=C^“^3nof the polynomials P (we 
Consider the region C U where |o;/|, |ck/ - an\ > |A|, |m/|. 

pass from the local coordinates (o/) to the local coordinates (a/) 




(1.7) 



where the cycle Ai can be described cts encircling the cut on the A plane connecting 
the points af = a/ + o(A) which solve the equations: 



±2A"^Q2(af=) = P(ckf=) 



( 1 . 8 ) 



The sum Ai vanishes in the homology of E^^, therefore we get TV — 1 independent 
coordinates, as we should have. Now, define the dual coordinates 



a 



D 

i 



7 -/ 

2tti w 



(1.9) 



where Bi encircles the cut connecting a'l~ and ^'çi^i^rnodN' Then, one can show 
[3, 9] that 

dai A daf = 0 

i 

on and as a consequence, there exists a (locally defined) function, called pre- 
potential, T{a]m,A) such that 



afdai = d^(a) (1-10) 

i 

In the region the prepotential has the expansion: 

J^(a; A) = 

= i («; - an)' log (“) (1.11) 

where is a power series in A. Our claim is that (1.11) 



^instjefined by the formula (1.11 coincides with^^’^^^(a, ei, e 2 )|ei=e 2 =o . 



We have checked this claim by an explicit calculation for up to five instantons, 
against the formulae in [10]. 
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The paper is organized as follows. In the next section we describe the physical idea 
of our calculation. We define the observable of interest, and sketch two calculations 
of its expectation value - in the weak coupling regime in the ultraviolet, and the 
infrared low-energy effective theory calculation. In the section 3 we discuss our 
results from the M-theory viewpoint, consider some generalizations, present our 
conjectures and describe future directions. 

Acknowledgments. Research was supported in part by grant 01-01-00549 

and by the grant 00-15-96557 for the support of scientific schools. I am grateful to 
Rutgers University for hospitality during the preparation of the manuscript. 

2 Field theory construction 

In this section we explain our approach in the field theory language. We exploit 
the fact that the supersymmetric gauge theory on flat space has a large collection 
of observables whose correlation functions are saturated by instanton contribution 
in the limit of weak coupling. In addition, in the presence of the adjoint scalar 
vev these instantons tend to shrink to zero size. Moreover, the observables we 
choose have the property that the instantons which contribute to their expectation 
values are localized in space. This solves the problem of the runaway of point-like 
instantons, pointed out in [6]. 

2.1 Supersymmetries and twisted supersymmetries 

The M = 2 theory has eight conserved supercharges, Qa,Ql, which transform 
under the global symmetry group SU{2)l x SU{2)r x SU{2)f of which the first 
two factors belong to the group of spatial rotations and the last one is the R- 
symmetry group. The indices a,à,i are the doublets of these respective SU (2) 
factors. The basic multiplet of the gauge theory is the vector multiplet. Here is 
the spin content of its members: 

Field SU{2)l SU{2)r SU{2)i 



^ è 0 

- 0 - 

0 - - 

^Oi ^2 2 

0 0 0 



It is useful to work in the notations which make only SU{2)l x SU{2)d part of 
the global symmetry group manifest. Here SU{2)d is the diagonal subgroup of 
SU{2)r X SU{2)i. If we call this subgroup a “Lorentz group”, then the super- 
charges, superspace, and the fermionic fields of the theory split as follows: 
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Fermions: 

Superspace: 

Superfield: $ = 0 + + . . . 

Supercharges: Q, G'yu- 

The supercharge Q is a scalar with respect to the “Lorentz group” and is usually 
considered as a BRST charge in the topological quantum field theory version of 
the susy gauge theory. It is conserved on any four-manifold. Our idea is to use 
other supercharges Gf^ as well. Their conservation is tied up with the isometries 
of the four- manifold on which one studies the gauge theory. Of course, the idea 
to regularize the supersymmetric theory by subjecting it to the twisted boundary 
conditions is very common both in physics [11, 12], and in mathematics [13] [14] 
[15] [16], 

At this point we should mention that the idea to apply localization tech- 
niques to the instanton integrals has been recently applied in [17] in the one- and 
two-instanton cases. Without T^-localization this is still rather complicated, yet 
simpler calculation then the direct evaluation [5]. We refer the interested reader 
to the beautiful review [18] for more details. 

2.2 Good observables: UV 

In the applications of the susy gauge theory to Donaldson theory, where one works 
with the standard topological supercharge Q, the observables one is usually inter- 
ested in are the gauge invariant polynomials = P{(j){x)) in the adjoint scalar 

0, evaluated at space-time point x, and its descendants: where 

S is a k-cycle. Unfortunately for /c > 0 all such cycles are homologically trivial on 
and no non-trivial observables are constructed in such a way. One construct an 
equivalent set of dual observables by integration over IR'^ of a product of a closed 
/c-form cj = 4 “ /c-form part of P($): 

0‘^= J éxée lü{x,9) P($) (2.1) 

Again, most of these observables are Q-exact, as any closed /c-form on IR^ is exact 
for /c > 0. 

However, if we employ the rotational symmetries of IR^ and work equivari- 
antly, we find new observables. 

Namely, consider the fermionic charge 

Q = Q + ( 2 . 2 ) 

Here ior a = 1 ... 6 are the vector fields generating SO{A) rotations, 

and E e Lie (5(9 (4)) is a formal parameter. 
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With respect to the charge Q the observables are no longer invariant^. How- 
ever, the observables ( 2 . 1 ) can be generalized to the new setup, producing a priori 
nontrivial Q-cohomology classes. Namely, let us take any 50(4)-equivariant form 
on IR^. That is, take an inhomogeneous differential form ft{E) on IR^ which de- 
pends also on an auxiliary variable E E Lie{SO{4)) which has the property that 
for any g E SO{4): 

g*n{E) = Ü(g-^Eg) (2.3) 

where we take pullback defined with the help of the action of 5'0(4) on IR^ by 
rotations. Such £^-dependent forms are called equivariant forms. On the space of 
equivariant forms acts the so-called equivariant differential. 



D — d-\- tv(E) (2-4) 

where V{E) is the vector field on IR'^ representing the infinitesimal rotation gen- 
erated by E. For equivariantly closed (i.e. D-closed) form Ü{E) the observable: 

= [ Cl{E) A F($) (2.5) 



is Q-closed. 

Any 50 (4) invariant polynomial in E is of course an example of the O-closed 
equivariant form. Such a polynomial is characterized by its restriction onto the 
Cartan subalgebra of 50(4), where it must be Weyl-invariant. The Cartan subal- 
gebra of 50(4) is two-dimensional. Let us denote the basis in this subalgebra 
corresponding to the decomposition IR^ = IR^ 0 IR^ into a orthogonal direct 
sum of two dimensional planes, by (€ 1 , 62 ). Under the identification Lze(50(4)) 
Lie{SU (2)) 0 Lie[SU(2)) these map to (ci + e 2 ,€i — € 2 ). The Weyl {= IL 2 x IL 2 ) 
invariant polynomials are polynomials in a — €2 and x = ^ 1 ^ 2 - As 50(4) does 

not preserve any forms except for constants we should relax the 50(4) symmetry 
to get interesting observables. 

Let us fix a translationally invariant symplectic form u on IR^. Its choice 
breaks 50(4) down to U{2) - the holonomy group of a Kahler manifold. Let us 
fix this U (2) subgroup. Then we have a moment map: 

^ : IR^ — > Lie{U{2))\ dg.{E) = tv'(B)W, E € Lie{U{2)) (2.6) 

And therefore, the form u — fi{E) is O-closed. One can find such euclidean coor- 
dinates u = 1 , 2 , 3, 4 that the form uj reads as follows: 

u = dx^ A dx^ + dx^ A dx"^ (2.7) 

The Lie algebra of U{2) splits as a direct sum of one-dimensional abelian Lie 
algebra of 17(1) and the Lie algebra of 517(2). Accordingly, the moment map /i 



^except for OpQ where 0 E IR"^ is the origin, left fixed by the rotations 
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splits as {h^ ( 1 ^ , fj? , In the coordinates 

h = = \t}1^x^x‘', ( 2 . 8 ) 

where is the anti-self-dual ’t Hooft symbol. 

Finally, the choice of u also defines a complex structure on IR'^, thus identifying 
it with with complex coordinates z\, Z 2 given by: z\ — ix^, Z 2 = x^ ix^. 
For E in the Cartan subalgebra H = /j>{E) is given by the simple formula: 

H = ci\zi\‘^ C2\z2\‘^ (2.9) 

After all these preparations we can define the correlation function of our interest: 

Z{a, £) = - iî TV (F A F)) ^ (2.10) 

where we have indicated that the vacuum expectation value is calculated in the 
vacuum with the expectation value of the scalar 0 in the vector multiplet given by 
a E t. More precisely, a will be the central charge of M — 2 algebra corresponding 
to the FF-boson states (cf.[3]) . 

Remarks. 

1) Note that the observable in (2.10) makes the widely separated instantons 
suppressed. More precisely, if the instantons form clusters around points 
ri, . . . , r/ then they contribute exp— H{fm) to the correlation function. 

2) One can expand (2.10) as a sum over different instanton sectors: 

oo 

Z{a,e) = '^q'^Zk{a,€) 
k=0 

where q ~ is the dynamically generated scale - for us - simply the 
generating parameter. 

3) The observable (2.10) is formally cohomologous to the identity, as 

j {üj + H) Tr#2 = Q j éxd*e A{x,e) , 

where A{x, 6) = We cannot eliminate it without having to perform 

the full path integral, however, as it serves as a supersymmetric regulator. 
On the other hand, in the presence of this observable the path integral can be 
drastically simplified, the fact we shall exploit below. The analogous manip- 
ulation in the context of two dimensional supersymmetric Landau- Ginzburg 
models was done in the first reference in [11]. 
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The supersymmetry guarantees that (2.10) is saturated by instantons. Moreover, 
the superspace of instanton zero modes is acted on by a finite dimensional version 
of the supercharge Q which becomes an equivariant differential on the moduli 
space of framed instantons. Localization with respect to this supercharge reduces 
the computation to the counting of the isolated fixed points and the weights of the 
action of the symmetry groups (a copy of gauge group and U (2) of rotations) on 
the tangent spaces. This localization can be understood as a particular case of the 
Duistermaat- Heckman formula [19], as (2.10) calculates essentially the integral of 
the exponent of the Hamiltonian of a torus action (Cartan of G times T^) against 
the symplectic measure. 

The counting of fixed points can be nicely summarized by a contour integral (see 
below). This contour integral also can be obtained by transforming the integral 
over the ADHM moduli space of the observable (2.10) evaluated on the instanton 
configuration, by adding Q-exact terms, as in [8] [7]. It also can be derived from 
Bott’s formula [20]. 



2.3 Reduction to contour integrals 

After the manipulations as in [8, 7] we end up with the following integral [1]: 



Zk{a,ei,e2) = 

1 — TT (2 11 ) 

k\ (27Tzei£2)'' J P(cf>i)P{<pi + e) A-l-i</<J<fc {4>jj - e\){4i]j - el) 

where: 

N 

P(x) = - a/). (2-12) 

1=1 

<l)ij denotes </>/ — (l)j^ and e = ei + C2. 

The integrals (2.11) should be viewed as contour integrals. As explained in 
[7] the poles at </)/j = ei , €2 should be avoided by shifting ei ,2 ^ ^ 1,2 + ^0, those at 
0/ = a I similarly by a/ a/ + zO (this case was not considered in [7] but actually 
was considered (implicitly) in [8]). The interested reader should consult [21] for 
more mathematically sound explanations of the contour deformations arising in 
the similar context in the study of symplectic quotients. 

Perhaps a more illuminating way of understanding the contour integral (2.11) 
proceeds via the Duistermaat - Heckman formula [19]: 



L 






E 

f-V^(f)=a 



nr = i «^ 4 ^](/) 



(2.13) 



Here is a symplectic manifold (in the original DH setup it should be 

compact, but the formula hold in more general situation which extends to our 
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case) with a Hamiltonian action of a torus T^, /i : X ^ t* is the moment map, 
Ç G t = Lze(T) is the generator, G Vect{X) is the vector field on X representing 
the T action, generated by / G X runs through the set of fixed points, and 
Wi[^]{f) are the weights of the T action on the tangent space to the fixed point. 

In our case X = Xik is the moduli space of instantons of charge /c, T is 
the product of the Cartan torus of G and the torus C 50(4), ^ = (a, € 1 , 62 ) • 
The fixed points / will be described in the next section. However, already without 
performing the detailed analysis of fixed points one can understand the meaning 
of ( 2 . 11 ): 

Suppose / is a fixed point. It corresponds to some quadruple {Bi, B 2 , 1 ^ J) 
such that the T-transformed quadruple belongs to the same f/(A:)-orbit. Working 
infinitesimally we derive that there must exist cj) G Lie{U{k)), such that: 



[^1,0] [^ 2 , (/)] = 62 H 2 

— (f)I H“ ICL = 0, — ClJ -\- J (f) — — {c\ “h ^2)J 

We can go to the bases in the spaces F, W where 0 and a are diagonal. Then the 
equations (2.14) will read as follows: 

{<t>i - 4>J + ^i) Bijj = 0 

(0/ - + 62 ) 52,/./ — 0 (2.15) 

(0/ - cLi) — 0 

(0/ + Cl + C2 — ai) Jij = 0 

For (2.15) to have a non-trivial solution some of the combinations 

4>ij + € 1 , 2 , 4>i - </>/ + + ^2 - ai (2.16) 

must vanish. This is where the poles of the integrand (2.11) are located. The 
equations (2.15) specify (f) uniquely, given a,ci,e 2 and /. Thus, exactly k out of 
2fc^ + 2kN combinations should vanish. Now let us look at the (holomorphic) 
tangent space TjMk- It is spanned by the quadruples (SBi,0B2,SI,SJ) obeying 
the linearized ADHM equations, and considered up to the linearized U {k) trans- 
formations, that is, it can be viewed as first cohomology group of the following 
“complex” : 

C® = End{V) — » = 

End{V) ® L © Hom{V, W) © Hom{W, V) — > = End{V) ® A^L (2.17) 

where the first arrow is given by the infinitesimal gauge transformation, while the 
second is the linearized ADHM equation: d/Jc- To calculate Ilz ^ 4^5 ^ 2 ](/) it is 

convenient to compute first the “Chern” character: 

Ch{TjMk) = E 



(2.18) 
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which is given by the alternating sum of the Chern characters of the terms in 
(2.17): 

Ch{C^)-Ch{C^)-Ch{C‘^) = (e‘‘ - 1) {1 - 

I,J I, I 

(2.19) 

Upon the standard conversion 

a a 

we arrive at (2.11). It remains to explain why not every possible A;-tuple out of the 
combinations (2.16) contribute, only those which are picked by the +^0 prescrip- 
tion. This will be done in [22]. 

2.4 Good observables: IR 

The nice feature of the correlator (2.10) is it simple relation to the prepotential of 
the low-energy effective theory. In order to derive it let us think of the observable 
(2.10) as of a slow varying changing of the microscopic coupling constant. If we 
could completely neglect the fact that H is not constant, then its addition would 
simply renormalize the effective low-energy scale A ^ Ae~^ . 

However, we should remember that H is not constant, and regard this renor- 
malization as valid up to terms in the effective action containing derivatives of H. 
Moreover, H is really a bosonic part of the function 7i{x^0) on the (chiral part 
of) superspace (in [6] such superspace- dependent deformations of the theory on 
curved four-manifolds were considered): 

Together these terms add up to the making the standard Seiberg- Witten effective 
action determined by the prepotential .F(a; A) to the one with the superspace- 
dependent prepotential 

Ae“^^^’^^) = ^(u; Ae~^) + uj Adf^T(a] Ae~^) -h (A^a)^ Ae~^) 

( 2 . 20 ) 

This prepotential is then integrated over the superspace (together with the conju- 
gate terms) to produce the effective action. 

Now, let us go to the extreme infrared, that is let us scale the metric on 
IR" by a very large factor t (keeping cu intact). On flat IR^ the only term which 
may contribute to the correlation function in question in the limit ^ ^ oo is the 
last term in (2.10) as the rest will (after integration over superspace) necessarily 
contain couplings to the gauge fields which will require some loop diagrams to get 
non- trivial contractions, which all will be suppressed by inverse powers of t. The 
last term, on the other hand, gives: 
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where we used the fact that the derivatives of H are proportional to ei^ 2 - Recalling 
(2.7) (2.8) the integral in (2.21) reduces to: 



Z{a;ei,e 2 ) = exp 



j^inst(^a-A) + 0{e) 



€ i 62 



( 2 . 22 ) 



where 



nOO 

djjJ^{a-Ae-^) HàH 

Jo 

thereby explaining our claim about the analytic properties of Z and T. 



2.5 Classification of the residues 

The poles which with non- vanishing contributions to the integral must have (j)ij ^ 
0, for I ^ J, otherwise the numerator vanishes. This observation simplifies the 
classification of the poles. They are labelled as follows. Let k = k\ k 2 + . . . 
be a partition of the instanton charge in N summands which have to be non- 
negative (but may vanish), ki > 0. In turn, for all / such that ki > 0 let Yi denote 
a partition of ki : 

ki = /c/,1 + . . . kiy,, , ki^i > /c/,2 > . . . > > 0 

Let > . . . > 0 denote the dual partition. Pictorially one represents 

these partitions by the Young diagram with ki^i rows of the lengths z/^’^ , . . . V 
This diagram has z/^’^ columns of the lengths . . . ,kiy,i. Sometimes we also 
use the notation rii = z/^’^, and we find it useful to extend the sequence ki^i all the 
way to infinity by zeroes: k/ = {/c/q > ki ^2 > • • • = ki^m +2 = . . . = 0}. 

In total we have k boxes distributed among N Young tableaux (some of which 
could be empty, i.e. contain zero boxes). Let us label these boxes somehow (the 
ordering is not important as it is cancelled in the end by the factor /c! in (2.11)). Let 
us denote the collection of N Young diagrams by Y = (Yi , . . . , Yv). We denote by 
|Y| = ki the number of boxes in the /’th diagram, and by |Y| = |Y| = |k| = k. 

Then the pole of the integral (2.11) corresponding to Y is at (/>/ with I 
corresponding to the box {a,/3) in the Vth Young tableau (so that 0 < a < 
z/^’^, 0 < /3 < ki^oJ) equal to: 

y ^ 0, = az + ei(a - 1) + t2{P ~ 1) (2.23) 

2.6 Residues aud fixed poiuts 

The poles in the integral (2.11) correspond to the fixed points of the action of the 
groups (7 X on the resolved moduli space Mk.N’ Physically they correspond to 
the U{N) (noncommutative) instantons which split as a sum of U{\) noncommu- 
tative instantons corresponding to N commuting U{1) subgroups of U(N). The 
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instanton charge ki is the charge of the U{1) instanton in the /’th subgroup. More- 
over, these abelian instantons are of special nature - they are fixed by the group 
of space rotations. If they were commutative (and therefore point-like) they had 
to sit on top of each other, and the space of such point-like configurations would 
have been rather singular. Fortunately, upon the noncommutative deformation the 
singularities are resolved. The instantons cannot sit quite on top of each other. In- 
stead, they try to get as close to each other as the uncertainty principle lets them. 
The resulting abelian configurations were classified (in the language of torsion free 
sheaves) by H. Nakajima [23]. 

Now let us fix a configuration Y and consider the corresponding contribution 
to the integral over instanton moduli. It is given by the residue of the integral 
(2.11) corresponding to (2.23): 

p _ TT TT ff ~ 1) + ^2(/? - 1)) 

^ I Ü M + 1 ) - e2h{s)){e2h{s) - ee{s)) 

TT TT TT / (û(m + êl(o! “ _|_ £2(1 — /3)^ (flim + f 1Û + Ê2(^;,a + 1 ~ P)) \ 

i<ma=i ,3=^ \ ^ 2(1 ~ P)) {o-lm + £i(Q “ + €2(fc/,a + 1 — P)) J 

(2.24) 

where we have used the following notations: aim = ai — a„, 



Si(x) 



Q(ai + x) 

nTTi5^/(^ + ^im){x + e + aim) ’ 



Si{x) 



1 ) _ 



(2.25) 



and 

^( 5 ) = fa, a - P, h{s) = ki^oi + - a - P Yl ( 2 . 26 ) 

Now, if we set e\ = ft — —€2 the formula (2.24) can be further simplified and it 
becomes exactly the formula for the term in the sum (1.5), corresponding to the 
partition {fa,i}- 



2.7 Perturbative part 

So far we were calculating the nonperturbative part of the prepotential. It would 
be nice to see the perturbative part somewhere in our setup, so as to combine the 
whole expression into something nice. 

One way i^o calculate carefully the equivariant Chern character of the tan- 
gent bundle to Aik along the lines sketched in the end of the previous section [22]. 
The faster way in the ci + 62 — 0 case is to note that the expression (1.5) is a sum 
over partition with the universal denominator, which is not well-defined without 
the non-universal numerator. Nevertheless, let us try to pull it out of the sum. 
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We get the infinite product (up to an irrelevant constant): 

1 



exp 



nn 

i,j = l l^m 

/•oo 



aim + Hi - j) 



ds 



s — l)(e" 



1) 



(2.27) 



If we regularize this by cutting the integral at 5 ~ ^ 0, we get a finite expression, 

which actually has the form 



exp 






ei€2 



with being analytic in ei,C 2 at zero. In fact 

0, 0) = ^ ^a^^log aim + ambiguous quadratic polynomial i 



m ain 



l^m 



The formula (2.27) is a familiar expression for the Schwinger amplitude of a mass 
aim particle in the electromagnetic field 

F oc €i dx^ A dx^ + C2 dx^ A dx"^ . (2.28) 



Its appearance be explained in the next section. 

Let us now combine and into a single €-deformed prepotential 



T{a, ei , 62) = ei , €2) + (a, €1 , «2) 

where for general ei, €2 we define: 

/*oo J p-saim 

^-‘(a,€l,€2) = ^ / 

Je 



s sinh(^)sinh(fi) 

with the singular in 5 part dropped. We define: 

T{a,ei,e2\q) 



Z(a,€i,€ 2 ;g) = exp- 



€1«2 



(2.29) 



(2.30) 



3 M- and K-theory inspired conjectures 

In this section we suggest a physical interpretation to the e-deformed prepotential 
■^(a,ei,e2;g). 
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3.1 Five dimensional viewpoint 

It was understood long time ago that the instanton corrections to the prepotential 
of AT = 2 gauge theory can be interpreted as one-loop corrections in the five 
dimensional theory compactified on a circle, in the limit of vanishing circle radius 
[24]. 

Consider five dimensional M — 2 gauge theory with the gauge group G, and 
possibly some matter. Consider a path integral in this theory on the space-time 
manifold which is a product xIR^, We shall impose periodic boundary conditions 
on the fermions in the theory (up to a twist described momentarily). We shall 
also consider the vacuum in which the adjoint scalar in the vector multiples hats 
vacuum expectation value a Ç t. More precisely, we should specify the holonomy 
of the gauge field around the circle at infinity of IR^. Together with the vacuum 
expectation value of the scalar (which must commute with the holonomy) they 
define an element a of the complexified Cartan subgroup Tc of the gauge group 

[24]. 

Let us define the following generalized index: 



2(a,ei,e2,«3;/?) = 

— [(ei 4- £2) Jl + (^1 “ ^ 2 ) Jr + ^sJf + /3/f] (3.1) 

We now choose €3 = €2 ^ ei. This is the counterpart of choosing the subgroup 
SU{2)d as we did in the section 2. Here we have used the little group *90(4) spins 
JlGr and the generator Jf of the R-symmetry group SU{2)i. 

The five dimensional theory has two kinds of particles. The perturbative spec- 
trum consists of the gauge bosons and their superpartners (we shall now consider 
the case of minimal susy theory for simplicity). In addition, the theory has soli- 
tons, coming from instanton solutions of four dimensional gauge theory. To find 
the spectrum of these solitons and their bound states one can adopt the standard 
collective coordinate quantization scheme. 

In the limit /3 ^ 0 the infinite-dimensional version of the heat kernel expan- 
sion will reduce the supertrace in (3.1) to a path integral in the four dimensional 
gauge theory. Moreover, the arrangement of the twists is such that the theory will 
possess some supersymmetry, which we identify with Q. One can play with the 
gauge coupling to further reduce the path integral to a finite-dimensional integral 
over the instanton moduli space, of the kind we considered in this paper. 

On the other hand, geometrically, the twists in (3.1) can be realized by re- 
placing the flat five dimensional space-time by a twisted IR^ bundle over the circle 
of the radius /d/27r such that by going around this circle one twists the^ IR^ 
according to (3.1). Let us denote the resulting (locally flat) space by X^. 

Now imagine engineering [25] the five dimensional gauge theory by “compact- 
ifying” M-theory on a non-compact Calabi-Yau given by the appropriate fibration 

^Ifei-t- 62^0 one should also twist the transverse six dimensional space 
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of the ALE singularity over the base By further compact ifying on we get 
a background, which can be now analyzed string-theoretically. 

If we view the base circle of as the M-theory circle, then we end up 
with the II A Melvin-like background, where one has a vev of the RR 1-form field 
strength, which is actually 

F — eidx^ A dx^ + €2dx^ A dx'^ (3.2) 

near x = 0. In fact, by going to the weak string coupling limit {0 0) one can 

make (3.2) to hold arbitrarily far away from the origin. 

The five dimensional particles, going around the circle appear in the four 
remaining dimensions like particles carrying some charge with respect to F. In cal- 
culating their contribution to the supertrace (3.1) we would perform the standard 
Schwinger calculation, as in [26]. 

This identification suggests the interpretation of h. Let us expand F(a, /i, —h) 
as a power series in h: 



1 ^ 

-\ogZ{a, h, -h) = —T{a, h, -h) = ^ (3.3) 

^ 9=0 

(the fact that only the even powers of Ti appear follows from the obvious symme- 
try Cl 62 ). The expansion (3.3) suggests that h has to play a role of the string 
coupling constant. To be more precise, in the setup in which the gauge theory 
is realized as IIA compactification on the ALE singularity fibered over the 
prepotential is essentially calculated by the worldsheet instantons of genus zero. 
The higher genus corrections give rise to the couplings [27], where R 

is the curvature of the four dimensional metric, and F is the graviphoton field 
strength. Collecting all the evidence above we conjecture that the 6 -deformed pre- 
potential captures the graviphoton couplings (even in the case e\ + C 2 7 ^ 0 , where 
the graviphoton field strength is not self-dual - in this case the theory should be 
properly twisted). 

3.2 K- theory viewpoint 

If we don’t take the limit ^ 0 we can still give a finite dimensional expres- 
sion for (1.5). The collective coordinate quantization leads [24] to the minimal 
supersymmetric quant um^niechanics on Mk, whose ground states correspond to 
the harmonic spinors on Mk- The index (3.1) calculates the equivariant index of 
Dirac operator on Aik- The latter has the following Atiyah-Singer expression: 
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where Â{M) is the A-roof genus of the manifold M: 

aroo/ch(TM) = 5]e"- ^ iff(M) = J] (3-5) 

z i e 2 -e 2 



The localization technique (this time in equivariant K-theory [28]) leads to the 
following expression for (3.4): 






sinh (3 {aim + {k^j - kmj + j - i)) 
sinh ^ {aim + ^(j - 0) 



(3.6) 



It would be nice to analyze (3.6) further, relate it to the relativistic Toda chain 
spectral curves [24] , and to the four dimensional analogue of Verlinde formula [29] . 
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Topological Effects in Matrix Models 
Representing Lattice Gauge Theories at Lîirge N 

Herbert Neuberger 



Abstract. Quenched reduction is revisited from the modern viewpoint of field- 
orbifolding. Fermions are included and it is shown how the old problem of pre- 
serving anomalies and field topology after reduction is solved with the help of the 
overlap construction. 



1 Introduction 

The techniques producing numerical values for physical quantities affected by the 
strong interactions have continuously evolved over the last twenty years. They 
have become quite sophisticated and, coupled with major increases in affordable 
computing power, generate steady, incremental progress [1]. However, if the sole 
activity in lattice field theory were that of adding digits of higher accuracy to 
some basic quantities one might get the impression that this subject, while useful, 
is rather dull. Luckily, this is not the case: there are lattice activities which do 
take risks and, at times, produce a breakthrough (e.g. lattice chiral fermions [2]), 
or, show that lattice field theory is good for other things than QCD (e.g. Higgs 
triviality bounds [3] ) . There are people who venture into supersymmetric territory 
[4], tackle “insoluble problems” like QCD with a chemical potential [5], use lattice 
techniques to check scenarios for explaining the baryon number asymmetry [6], 
and are inventive in many other ways. In turn, lattice field theory constructions 
enter mainstream particle physics [7] and even quantum gravity [8], providing 
novel “UV-completions” . 

In this talk I shall present a project whose long term objective is to solve 4D 
planar QCD. The project is in its initial stages and, unlike more typical lattice 
field theory projects, is far from being guaranteed to succeed. This project is a 
collaborative effort with J. Kiskis and R. Narayanan [9]. At present, the focus 
is narrowed down to the calculation of the Q^-dependent meson-meson correla- 
tion functions in 4D. In 2D, where the planar limit is solvable (as shown by ’t 
Hooft [10]), this correlation function is of fundamental importance. Although ’t 
Hooft’s solution was mainly analytical, to actually get numbers also a numerical 
procedure is required, even in two dimensions. There is little chance for a similar 
analytical breakthrough in 4D, But, we hope that a numerical approach - this 
time containing a stochastic element - will be possible using old ideas of large Nc 
reduction and new lattice fermions. 
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’t Hooft’s planar limit [11] in 4D requires three separate limits, one more than 
in ordinary QCD. One needs to take an ultraviolet cutoff, A, (needed to define the 
theory beyond its perturbative series) to infinity, an infrared cutoff (typically the 
volume of an Euclidean torus), V, to infinity and the number of colors, Nc, at 
g'^Nc = A fixed, also to infinity (A is to be traded for a physical scale). The 
natural order of limits is as follows: first F ^ oo, next A oo and last Nc oo. 
One knows how to exchange the first two limits without altering the result even 
when the quarks are massless. Less is known about taking Nc ^ oo earlier than 
last, but we shall do that. Otherwise, we are relegated to solving not only QCD 
first, but also QCD-like theories for different numbers of colors, and then the 
relevance of getting the planar limit numerically becomes mostly academic. The 
result [12] is nevertheless important for our project because such investigations 
tell us, quantitatively, how far Nc = oo is from Nc = 3. For us, “solving” planar 
QCD means that a shortcut to the Nc = oo limit was used, and the method finds a 
numerical solution at A^c = oo with considerable more ease than would be possible 
in real QCD. This ancient hope sounds more realistic today than it did five years 
ago also thanks to progress made by string theorists [13]. To be sure, in general, 
the interchange of the limits Nc oo with A ^ oo is non-trivial [14]. For example, 
the theory may contain a set of degrees of freedom whose number does not increase 
with Nc, but whose impact on the ultraviolet behavior at any finite Nc is large. 
Nor is the interchange of the V oo and Nc ^ oo limits a trivial matter in a 
theory with massless elementary fields [15]. 

2 Large Nc reduction 

We start on a lattice with a “bare” coupling (3 = -^ and at a finite volume V. 

There is very good evidence that the limit Nc oo, with b = ^ fixed, exists. 
It exists for the entire range of positive values of b. At small values of b (strong 
coupling) large Nc reduction [16] tells us that the F oo limit is trivial: there 
is no F-dependence left at leading order in Nc, one can set F = 1 and lattice 
momentum space is generated from the eigenvalues of the gauge field operators. 
Complications arise when one tries to extend this to weak coupling (large b) [17], 
but there are some options available [17]. This is not dissimilar to the situation in 
models that admit a weakly coupled dual string description at large A. 

In their work, Eguchi and Kawai [16] proved that the Schwinger- Dyson equa- 
tions for single-trace operators in the reduced and full models were the same at 
infinite Nc. Later [17] it was realized that this only established reduction for small 
enough 5, and that to extend it to large b an additional ingredient, for example, 
“quenching” [17], was needed. 

In modern language, large Nc reduction works by “field orbifolding” [18]. 
This terminology comes from continuum field theories (actually these theories are 
the low energy sector of some string theories and the term originates from a method 
of constructing special backgrounds for the latter [19]), but the dependence on 




Vol. 4, 2003 Topological Effects in Matrix Models 



S149 



continuous space-time plays little role in the basic combinatorial identity. In our 
lattice application there is no explicit continuum. Field orbifolding can produce 
pairs of theories that admit ’t Hooft expansions and, at least at the level of bare 
diagrams, are simply related by possible rescalings of the coupling constant. The 
fields in one member of the pair are obtained by subjecting the fields in the other 
member to a set of constraints. These constraints reduce the symmetry group 
from U{Nc) x G to x G'. In the context of large Nc reduction one 

starts with a theory that has T = 1, a single site lattice, and one ends up with 
a theory with a larger lattice (here chosen as a four dimensional symmetrical 
torus), V = L^. The original theory has a U{Nc) x U{1)^ and the new theory 
has a symmetry x U{1)^ x Z{L)\ (The = NJL^ and are 

assumed to be integers.) The two theories have the same b and, in the second, the 
factor Z)i=i^L^U{Nc/ L^) is a lattice gauge symmetry group. The original theory is 
the reduced model and the main assumption of our project is the hope that the 
factor isn’t really needed to get to the planar limit. 

Let me first illustrate the main trick in a simpler model, where the field 
integration measure, excepting the action term, is flat and unrestricted. This makes 
it easy to implement the constraint on the original fields by a “decoration” of the 
original planar diagrams. The “decoration” is a discrete gauge-field theory whose 
gauge link variables need to be summed over, living on the planar diagram of the 
original theory. 

The model is known as the “Weingarten model” [20], and it defines a gen- 
erating functional for random surfaces on a hypercubic lattice - a rudimentary 
string theory. 

The partition function of its reduced version [21] is given by 



^Dw(&)=n n [/ 









( 1 ) 



X 6' 






Although the integral does not converge, the formula is meaningful as a com- 
pact definition of an infinite series in b. Similarly, the large N limit of F{b) = 
log{Z^^^{b)) is also meaningful as a series in b. It is given there by the sum of 
all planar l^j'^-type Feynman diagrams with a factor of b attached to each vertex. 

The reduced model has a symmetry SU{N) x f/(l)^, where the SU{N) acts 
by conjugation simultaneously on all of the matrices and the ^7(1) factors in- 
dependently affect the phases of each matrix. There also is a symmetry group 
made out of 2^d\ elements consisting of permutations of the fi indices and conju- 
gations. This “crystallographic group” plays no role in our particular application 
here, and hence will be ignored (it also was ignored a few paragraphs earlier). The 
constrained, or, equivalently, space-time-extended model, is defined on a hypercu- 
bic toroidal lattice containing sites. 
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Its partition functions is: 

^cDw(^) “ n n n 

[1=1 x^=0 

X irA^{x)A^{x+^i)Alix+u)Al{x)-J2^ irA^{x)Al{x) 

(2) 

The symmetry of the extended model is [(8>a:f/(A/'')] x U (1)^^ xZ{L)^. The first factor 
is a local lattice gauge symmetry, the second is a global symmetry and the third 
a space-time symmetry. Again we ignore the crystallographic group. Actually, the 
phases associated with the f/(l)’s go only from zero to ^ - larger phases can be 
reduced by lattice gauge transformations. Again the integral does not converge, 
but the formula is meaningful as a compact definition of an infinite series in b' . 
Similarly, the large N' limit of F'(b') = meaningful as a 

series in b'. It is again given by the sum of all planar j^j'^-type Feynman diagrams 
with a factor of b' attached to each vertex, but, with propagators that are less 
trivial than before. 

The extended model can be viewed as a constrained version of the reduced 
model if we pick N — N' , b — b', write the 1 < i < indices as double indices 
(z',x) with l<i'<N',0<x^ <L — 1 and restrict the to be 

zero when y ^ x -\- ji. When y = x -\- jl there are no additional restrictions. If 
all A^’s in the reduced model of eq. (1) obey this constraint, the portion of the 
trace referring to the x, y components of the indices can be written out explicitly 
and one obtains the extended model of equation (2). An important observation is 
that the constraint can be viewed as enforcing invariance under a subgroup of the 
symmetry group of the reduced model: 

Af^ = %{Aij) = u = l,...,d ( 3 ) 

[ff‘'](î',x),(:)',y) — ^ (4) 

The set of operations 7^ generates a Z{L)^ subgroup of the symmetry group of 
the reduced model. The elements of the group are labeled by [s] where [s] = 
(si,S 2 , . . . ,Sd) and = 0, 1, . . . , L - 1. 

TW(^a.) = (ffW ) = (5) 

U=l 

Note that the action of on differs from its action on ior y ^ v by 
an overall phase factor. This Z{LY intersects both original symmetry group fac- 
tors, SU{N) and Invariance under the action of the four generators % is 

equivalent to invariance under the entire Z{L)^ group. 

Since we integrate freely on the entries of the matrices A^, we can redefine 
the constrained model by keeping full matrices in the quadratic term (where the 
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constrained and unconstrained entries of the A^-matrices do not mix) and only 
matrices satisfying the constraint in the quartic term. The constraint can be put 
in effect by first expanding in the exponential of the quartic factor, and then 
replacing each occurrence of a constrained independently, by T^^l(A^) with 
an unconstrained A^, and averaging each of the many factors over the group 
Z(L)^. Only the parts of A^ that are invariant, making up the matrix A^ constr? 
survive this group averaging, while the other entries are set to zero. We now leave 
this group averaging to the end of the calculation, and, instead, do the Gaussian 
integrals first, producing Feynman diagrams. The propagators are now given by 

< >= ( 6 ) 

Tl*l and appear only in this propagator. Moreover, as we see, there is only 
dependence on [t — s]. Hence, the independent averages over [t] and [s] can be 
replaced by a single average over the combination [^ — s]. It is only this extra 
group averaging procedure that distinguishes the two models. 

The sum over the . . . indices produces a factor of N for each index loop 
in the double line notation for propagators in the unconstrained model. In the 
extended model, because of the constraints, these index sums produce a trace of 
the product of the various factors encountered round the index loop. The 
matrices also form a group, Z{LY, but this group is distinct from the Z{LY made 
out of the operations. Thus, for every index loop we need to calculate the 
trace of a g^^^ that is the parallel transporter round the loop. 

From their definition we see that the obey: 

( 7 ) 

where the delta function forces the group element to be unity. This has the im- 
portant consequence that the phase factors that appear in the definition of the 
operations cancel out any time the contribution from the index loops is non- 
vanishing. (Note that the phase factors are associated with propagators, not index 
lines.) Thus, we can ignore the phase factors and take the propagators as given by 

<TW(A^)„TW(4)M>-5'r'5l7’ (8) 

Group averaging amounts to averaging over all values of the fy] labels. 

More generally, this induces us to think about a situation in which we assign 
to each propagator in the planar diagram a group element g ^ G where G is some 
finite group, g is represented by dun N x N matrix Dij{g) which replaces one of the 
two index lines making up the propagator. Before the “decoration” , this index line 
was a Kronecker delta. The other index line of the same propagator is decorated 
by Dki{g~^)‘ The group element associated with each propagator is independently 
averaged over all of G and the decoration introduces a weighting factor given by 
the product over all index loops (faces) of traces of products of D(g) matrices 
round the loop. The summations over G are now easy to perform - a standard 
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exercise in lattice strong coupling expansion, this time defined on a random two 
dimensional lattice [22]. 

For any h e G, trD{h) is a class function and hence expandable as: 

trD{h) = '^^CrdrXr(h) Cr = ^ Xr{h~^)^trD{h) (9) 

r heG ^ 

The averaging over the p’s associated to the propagators produce a total factor 
given by 

'^44 (10) 

r r 

where / is the number of faces and dr the dimension of the r-irreducible represen- 
tation of G. In the above equation we make use of the planarity of the diagram. 
If e is the number of edges in the diagram and v the number of vertices, we have: 

/ — e + u==2 e = 2v f = 2 v (11) 

In order to be able to absorb the decoration factor into a rescaling of b' we need 
Cr — K for all r. From (9), X = where d(G) is the number of elements in G. 
d{r) is the dimensionality of the irreducible representation r, and d(G) = 

The contribution of the diagram is ^ integer and we see 

that the representation of G furnished by D{h) contains K copies of the regular 
representation. This is a consequence of the requirement to have an overall b'^ 
factor. It is easy to check that in our case, where G = Z{L)^ (with elements 
this condition on the representation holds. 

Turning now to our special case, we set d{G) = N' = and b' = b. 
The weight of the above diagram becomes From the definition of the 

free energy of the reduced and extended model we now conclude that 

F'(b) = ^ log(Z^'Dw(fe)) = m (12) 

The equality of the free energies is the main result of reduction. Relations between 
other correlation functions can be obtained by adding more terms to the action 
and taking derivatives with respect to the new, infinitesimal, couplings. The above 
result was obtained a long time ago [21] by direct inspection of the series and by 
mapping the Schwinger-Dyson equations of single-trace operators from one model 
to the other. 

In its extended version, the Weingarten model will turn into a lattice gauge 
theory if we constrain the A^{x) matrices to be simple- unitary and change the 
integration measure to the SU{N) group Haar measure. The relation between the 
reduced and extended versions holds even after the unitarity constraint has been 
imposed on both models. This was proven a long time ago, first by the Schwinger- 
Dyson equation method [16] and, subsequently, by inspection of lattice strong 
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coupling diagrams [23] and also using stochastic quantization [24]. A proof by 
field orbifolding would require a generalization of the analysis above, but since the 
result is known to hold, there is little doubt that such a generalization exists. Field 
orbifolding admits generalizations involving nonabelian global symmetries, and it 
would be interesting to see whether previously unknown results in planar lattice 
gauge theory could be obtained from this approach. 

3 Prototype model and its extended variants 

The unconstrained, reduced, gauge model consists of 4 matrices G SU{Nc), 

4 unitary matrices e SU{Nf) and two rectangular Grassmann matrices #, 4^, 
where one index is in the fundamental representation of SU(Nc) (color), and the 
other index in the fundamental representation of SU{Nf) (flavor). The fermion 
matrices also have a Dirac index, which we often suppress. The model is defined 
by: 



5 = ^ +/3VJ2 c;^c;t + Tr [^D{U, y) {^)] 

c;, = [u^,u,], c;, = [v^,v,] (i3) 

The linear operator D(U,V) is a reduced version of a lattice Dirac operator, to be 
introduced later on. 

The constrained associated model lives on an lattice [25]. The constraints 
mean that the matrices V^, 4^, ^ are expressible in terms of smaller matrices 
u^(x), t;^(a:), '0(x), '0(rr). The dimensions are instead of Nc and Uf instead of 
Nf where Ucj = Ncj/L^. (ricj, Ncj, L are all integers.) 

~ [^ix{x)]ijSy^x+y^ l^À{ax)Xby) ~ b^fi{x)]ab^y,x-\-y, 

[^](zx),(ay) ~ [^{^)]ia^xy^ [^] (aj/),(ia:) “ [^{^)]ai^yx (14) 

For small lattice gauge coupling the constrained and unconstrained models will be 
equivalent in the limit Ncj oo with ^ held fixed and a simple D{U, V). But, 
for larger coupling the equivalence breaks down and we shall adopt here one of the 
proposals for keeping the equivalence up even at large lattice couplings, namely 
quenching [17]. 

4 Quenching 

Quenching amounts to freezing the eigenvalues of to sets that are uniformly 

distributed round the circle. Other than that the unitary matrices are free. One 
also takes the rescaled coupling, to infinity so as to freeze out the flavor gauge 
group [26]. The eigenvalues play the role of lattice momenta in boson propagator 
lines, while the eigenvalues enter only into the fermionic lines. The operator 
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D{U,V) breaks up into N f blocks of size ANc x 4Nc each. This is the crucial 
point of our approach: The numerical price of handling fermions has a chance to 
simplify tremendously if indeed Nc does not need to be anywhere nearly as big as 
the parameter ricL^ typical to numerical QCD simulations and, nevertheless, the 
planar limit stays well approximated. 

5 Overlap fermions 

We now define D for one of the fiavor blocks. So long we are focusing only on 
fermions we can absorb the fiavor phase into the color matrices (whose eigen- 
values are quenched as before). Define the unitary operators X^: 

X,= '^U,+ ^-^Ul (15) 

The Wilson-Dirac matrix Dw is a given by 

Dw =m + Y,i^- (16) 

We choose the parameter m to be about —1 (it must be between —2 and 0). It is 
easy to see that Hw = is hermitian. 

The overlap construction says that the chiral determinant line bundle is rep- 
resented on the lattice by the inner product of two states, (+|— ). Here (+| is the 
eigen-bra of a system of noninteracting second quantized fermions in a single body 
potential given by Hw with the mass sign switched, while |— ) is the eigen- ket of 
a system of noninteracting second quantized fermions in a single body potential 
given by Hw with the mass chosen as above. One can change the lattice definition 
without affecting the continuum limit so that the mass in Hw is taken to be -hoc 
for the (+|-state. In that case the Hw for the (+|-state can be replaced by 75 [2]. 

The overlap (+|— ) can be written in terms of single particle wave- functions 
as the determinant of a matrix A defined below [2, 27]. Choose the chiral repre- 
sentation for the 7^ matrices, so that 



75 = 





(17) 



Let $ be a simple unitary matrix of eigenvectors of Hw , ordered so that 



sign(ü'vi/)$ "^(^O-l)’ ^ (c£>) 



(18) 



Then 



(+1 — ) — det A 



(19) 
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For a vector-like theory we need | det(A)p, which, unlike det A, is a function, 
not a section of a bundle. Simple linear algebra tells us that 



ldet(yl)|2 =det(>l)det(n) 



( 20 ) 



However, 



Hence, 



75sign(iTv^)$ = 



A -B \ 

-CD) 



det 



'1 + 75sign(//'w/) 



$ 



= det 



1 +75sign(/fn/)' 



K+l-)t 



( 21 ) 



( 22 ) 



We discover that chiral symmetry can be preserved on the lattice if we use the 
overlap Dirac operator [2]; 



^^^l + 75 | g n ff w ( 23 ) 

The operator D is constructed out of with the addition of a small mass 
term for the quarks. 

6 Topology and Anomalies 

Anomalies are expressed by particular Berry-phase factors innate to the |— ) state 
of the overlap [28]. They obstruct smooth gauge invariant sections in cases where 
anomalies do not cancel. 

Topology is expressed by Hw having unequal numbers of positive and nega- 
tive eigenstates (above, where A is taken as a square matrix, we tacitly assumed 
these two numbers to be equal but this need not be so, in which case what we called 
“det A” is identically zero). One can easily exhibit explicit sets of ^7^ matrices for 
which there is non-zero topology. 

Problems arise if Hw has an eigenvalue that is exactly zero. But this will not 
happen as the continuum limit is approached because one can prove the following 
inequality [29]: 



\\[U„ t/.]|| < e ^ > 1 - |1 + m| - 5(e) (24) 

where J(e) goes to zero as e goes to zero. 

At infinite Nc^ for large enough lattice couplings and a standard lattice action, 
the commutators indeed become bounded [9], as required. 
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7 Some history 

The first treatment of fermions in reduced models was carried out by Gross and 
Kitazawa [30] who invented the momentum “force-feeding” prescription that was 
derived above as a remnant of the frozen-out flavor gauge sector [26]. These authors 
also invented a version of reduction that used hermit ian matrices, rather than 

the unitary link matrices one has on a lattice [30] . The Gross-Kitazawa model can 
be obtained from the lattice model by approximating ^ 1 lA^. However, as 
also noted by Gross and Kitazawa, this approximation does away with anomalies 
and topology. For many years we did not known how to cure this problem on 
the lattice, where the link matrices are unitary. The problem was the same as the 
notorious problem of lattice chirality. As we have seen, recent progress on the chiral 
fermion problem [2] makes it now possible to preserve all the necessary fermionic 
properties also in reduced lattice models removing the main obstacle of principle 
to making nontrivial use of reduction in numerical simulations. 
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Aspects of String Motion 
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Abstract. We review various general issues involved in the study of the motion of 
strings in time- dependent backgrounds. We discuss the interplay between the Gen- 
eral Relativity and string points of view. Results and open problems are highlighted. 
The motion in one particular background is studied in some detail. 



1 Introduction 

The properties of supersymmetry on the worldsheet and in space-time play an 
important role in circumventing pitfalls in string theory. Never the less our universe 
is not explicitly supersymmetric. Our universe is also time-dependent. Employing 
the arsenal of familiar methods to treat strings in time- independent backgrounds 
in the cases of time-dependent ones is not straightforward. These are several of 
the important problems that string theorists face with some difficulty. 

In this lectures I will discuss some issues concerning the motion of strings 
in time- dependent backgrounds. The lectures consist of a general review of the 
subject and of a somewhat detailed exposition of a particular case of a string 
propagating in a certain time-dependent background. String theory can be used 
reliably to calculate scattering processes involving gravity only for low energies, 
E, in particular E I/Qs, where Qs is the string coupling. If E <C I /Is as well, 
where Ig is the string length scale, string theory is validated by reproducing the 
known results from General Relativity (GR). When one studies problems which 
are unresolved in GR, such as the propagation through space-like singularities, one 
would actually wish that the string theory analysis will deviate, in a subtle but 
significant manner, from that of GR. In this way string theory will fulfill its duty 
and resolve outstanding problems in GR while adhering to the correspondence 
principle. 

Is such a behavior possible at all? This actually is the case for the propagation 
of strings in the presence of some time-like singularities. Perturbative and non- 
perturbative effects do modify in a subtle way the behavior obtained in a GR 
framework. A story in the Talmud helps exemplify the manner in which extended 
objects modify point particle problems. During a seminar the following question 
came up: Assume one finds a pigeon somewhere, to whom does it belong? The 
rule is, finders- keepers, as long as the pigeon is found further away than some 
determined distance from the entrance to an owned pigeon-hole. This seems rather 




S160 



E. Rabinovici Ann. Henri Poincaré 



well defined for point pigeons. A student raised his hand and asked: “what if the 
pigeon exhibits its extended object nature? that is what if one of its legs is nearer 
than the prescribed distance to the pigeon-hole but the other is further away?” 
The student was actually ejected from the Yeshiva for asking this question. 

We give a short review on some of the recent work on strings propagating 
in time-dependent backgrounds. The talk lecture has the following structure. It 
starts by reviewing various problems rising in a GR study of compact cosmology. 
This follows by suggesting a stringy point of view on several of these problems. 
Exact stringy time-dependent backgrounds are discussed. Special attention is given 
to the algebraic, geometrical and dynamical aspects of coset constructions. The 
results obtained allow a new point of view on the problems and are reviewed. 
The difficulties encountered and some of the open problems are discussed. These 
include a study of singularities; some of the studies in these proceedings are new 
(presented in section 7). 



2 Some Questions regarding Cosmologies in General Relativity 

We start by discussing some of the generic problems encountered by studying a 
universe, compact in space, using GR methods. 

• Constructing Time-Dependent Solutions: 

It is not immediate to construct time-dependent solutions of the equations of 
GR that contain space-like singularities. We will describe how such solutions 
can be generated. We will mainly focus on backgrounds which are in addition 
exact string backgrounds. These will be coset models and orbifolds. 

• Observables: 

One needs to describe what are the appropriate observables to be measured 
in the system. Moreover the universe could even be so small as to disallow 
the installation of a classical measuring device necessary for a quantum mea- 
surement. 

• Cauchy data/ Singularities: 

In GR one studies the evolution of the system as a function of Cauchy data. 
In the presence of singularities it may be unreliable to impose the data and/or 
the boundary conditions at the singularity itself. One would rather impose 
boundary conditions in regions of space in which the couplings and curvature 
are small. In such regions the semi-classical picture may be a useful guide. 
Where are such regions? Given appropriate boundary conditions, it is instruc- 
tive to follow the scattering of a probe from the space-like singularity and to 
calculate the amount reflected and transmitted. One would like to know if the 
question has a precise meaning. Perhaps all the information can be encoded 
in the region up to the singularity. For black holes there are claims that it is 
enough to consider the space-time up to the horizon of the black hole [1]. 
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• Entropy [2]: 

In the absence of a global time-like Killing vector it is not at all clear how 
to define states in GR and how to count them. In de-Sitter space, arguments 
have been presented that the total “number of states” is actually finite [3]. 
This is done by analogy to the classical entropy in a black hole. Is there a 
stringy microscopic estimate of that entropy? 

• (In)Stability of the Cosmology [4, 5]: 

In GR it is known that there are circumstances under which the presence of 
a speck of dust can totally disrupt the global geometry. Does string theory 
have anything to add to this? 

3 Stringy Attempts to Address the Questions 

• Constructing Time-Dependent Solutions: 

Time- dependent backgrounds containing space- like singularities have been 
constructed and studied using several methods. 

1. Exact String Solutions [6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18]: 
Exact conformal field theories which contain space-like singularities 
have been constructed. The advantage of such constructions is that 
they are exact as far as a' corrections are concerned. This allows for 
example the exact calculation of vertex operators using algebraic meth- 
ods without the need to resort to a classical geometrical picture. This 
was done for a class of coset models G/H x M where M is an attendant 
manifold to a time-dependent four dimensional cosmology. This was 
also done for a class of orbifolds of Minkowski space. The results are 
mainly for the tree approximation as far as string perturbation theory 
is concerned. We review in some detail the results of such constructions. 

2. S-Branes[19]: 

These are proposed solutions which are supposed to be in some sense 
Dirichlet branes with a fixed coordinate in the time direction. In a 
manner the idea that such objects may indeed be exact solutions are 
inspired by the presence of the Euclidean instantons. We will not discuss 
them in this lecture. 

3. Holography Inspired Relations [20]: 

A non-perturbative description of the properties of strings in time- 
dependent backgrounds is desirable. Such a description was made pos- 
sible in those systems in which the property of Holography was es- 
sentially shown to exist. If one assumes that topology is valid also in 
some time-dependent background settings, one may wish to claim that 
this holographic description is obtained by simply making the world- 
volume of the holographic dual theory be the appropriate boundary 
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time-dependent background. The non-perturbative description of the 
string theory being essentially a boundary field theory on an appropri- 
ate time- dependent background. The analysis of this field theory is yet 
to be done. 

4. Solutions Obtained by Double Wick Rotating GR Solutions which Con- 
tain Time-Like Singularities [21]: 

These are solutions to Einstein’s equations but are not exact string 
backgrounds. They are useful to study phenomena such as particle cre- 
ation which occurs in time-dependent backgrounds as well as to capture 
important features of the stability of the system. 

5. RG Flow Induced by Unstable Configurations [22]: 

Tachyonic decay in the open string sector of D-branes and in some closed 
string settings can also be formulated as a time-dependent background. 
It has been suggested that this has implications in realistic cosmologies. 
It is a useful setup for studying time independent backgrounds. These 
important studies are not reviewed here. 

• Observables: 

One needs to describe what are the appropriate observables to be measured 
in the system. In the string theory these may be the BRST-invariant opera- 
tors. Moreover in, string theory the S matrix is the observable. In a compact 
universe it is not clear where to place the asymptotic scattering states. This is 
resolved in a class of models in a rather surprising manner. Some exact string 
backgrounds turn out to contain in addition to compact time-dependent cos- 
mologies also static regions which extend all the way to spacial infinity and 
are called “whiskers” (see figure 1)L 




Figure 1: a: A compact cosmology (C) in GR. b: The same compact cosmology 
repeats itself in string theory and comes with additional regions - whiskers (W) . 



^In section 7 we shall discuss instead examples where the static regions are compact and 
singular. 
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Moreover, the system is weakly coupled at the boundary of these regions. The 
accelerators and detectors can thus be placed there to produce and detect 
scattering states. There is a possible fly in the ointment; these regions contain 
closed time-like curves and may contain in some cases time-like singularities 
as well. 

• Cauchy data/Singularities: 

Once a time-independent weakly coupled region exists it is natural to use it 
to enforce boundary conditions and to give the Cauchy data. The behavior 
of the system at the singularity is determined by the behavior at these better 
understood regions. In fact, the reflection coefficient of a given partial wave 
vertex operator can be calculated and is found to be smaller than one [6]. 
The scattering can be shown to be unitary. For the orbifold backgrounds 
also the scattering of two vertex operators has been calculated [16, 17]. The 
results indicated problems; these are discussed later. 

• Entropy: 

In string theory the entropy can be estimated by counting the BRST invari- 
ant states. In a class of models the number is significantly depleted relative 
to other non-cosmological systems [6] . 

• (In) Stability of the Cosmology [23]: 

There are at least two types of stability problems in such systems: short 
and long time scale problems. Consider first the short time scale problem. 
In the presence of a space-like singularity, classically the energy density in- 
creases without bounds as the projectile approaches the singularity which is 
concentrated at zero volume; this would be suggested even if the energy is 
conserved. If it is blue shifted as well as one approaches the singularities the 
situation would only be worsened. In the orbifold models this was pointed 
out from several points of view which had the common feature of closed 
time-like curves and space-like singularities. A prototype of such a universe 
is the Misner Universe. Its metric is given by 

ds^ = —dt^ + t^dx\ + dx 2 + dx\ (1) 

with the identification: 

{t,X\,X2,x:i) = {t,xi +na,X2,X3) (2) 

After a change of coordinates: 



yo = tsinh(xi) 
yi = tcosh(xi) 
2/2 = X2 
2/3 = 



( 3 ) 
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Figure 2: The extended Misner universe; a fundamental domain is marked. 
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where a is a system dependent constant. We will discuss more aspects of 
this singularity. As mentioned, the system is also threatened by a long time 
scale instability. This may occur for systems with a discrete spectrum and 
a finite number of “states” or at least a finite thermal entropy. Compact 
cosmologies may be systems of this nature. For the AdS case the instability 
starts when the contribution of the eternal black hole masterfield, embed- 
ded in AdS, is superceded by that of non-dominant thermal AdS master- 
field [24]. Recall that these two masterfields have different topologies [25]. 
These instabilities were indicated from GR arguments. On the other hand, in 
time-independent string backgrounds we are familiar with two types of insta- 
bilities: perturbative ones - tachyons - which face no barrier in their decay, 
and non- perturbative instabilities (such as the formation of small black holes 
from flat Minkowski space at finite temperature) which do need to overcome 
a barrier. 



4 One Example of an Exact Time-Dependent Background 



Two types of exact time-dependent backgrounds containing space-like singularities 
have been considered. Both have bosonic and fermionic versions with the respective 
central charge c = 26 and c = 15. We discuss here the G jH cosets. In particular 
the coset [6, 15]: 

5L(2,IR)fcx5t/(2)fe 

U{1) X U{1) ^ > 

In this case an identification is done on the group elements of G by an operation 
with group elements of H in the following manner: 

G/H : , {g,g') e SL{2) x SU{2) (9) 



As the original motivation is to construct a compact manifold with space-like 
singularities one is led to examine the semi-classical geometry induced by the 
algebraic coset. This geometry includes time-dependent cosmological regions and 
whiskers, as mentioned. In the cosmological regions the metric, dilaton and B field 
are [6]: 



\ds'^ = -del + d0'^ + 
k 



cot^ 6' 



1 -h tan^ 0i cot^ 



dXi -h 



^1 ,.2 
1 + tan^ $1 cot^ 0 ' 



( 10 ) 



B 



A+,A- — 



1 -h tan^ 6i cot^ 6' 



= $0 - - log(cos^ 0i sin^ 6' -h sin^ 0\ cos^ O') 



(11) 



(12) 
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where X± E [0,27t), and 0i and 0' vary in the interval [0, |]. In the whiskers:^ 



yds‘^ — dOl 4 - d0'‘^ + 



cot^ O' 



1 - tanh^^Q cot^ 6 ' 



dXi - 



tanh^^o 



1 — tanh ^2 cot^ 0' 



-dXt 



^A+,A- 
1 



1 — tanh^^2 cot^ 0 ' 



= $0 — 7 log(cosh ^^2 sin^ O' — sinh ^^2 cos^ O'Ÿ 



(13) 

(14) 

(15) 



where here O 2 E [0,oo), O' E [0, f], X± E [0,27t). 

The geometry is somewhat complex, and follows a list of features which play 
a role in addressing the questions posed in the beginning of the talk: 



• Time-Dependent Comological Regions: 

The geometry 10 indeed describes a compact cosmological region, it is de- 
noted by C in figure 1. In fact, the exact string background contains an 
infinite number of such C regions, (for a partial list of earlier works see [28]). 
Each C region can be viewed as starting from a mild big bang in which only 
one of its spacial coordinate directions degenerates, and ends with a mild 
big crunch where, again, the spacial volume vanishes along a different di- 
rection. The manifold thus is not isotropic. The maximal spacial volume of 
the cosmological region increases with the algebraic level, /c, of the affine Lie 
algebra. The source of the infinite sequence of bangs and crunches is that in 
order to avoid having ab-initio closed time- like curves (CTC) in the manifold, 
one considers the universal cover of iS'L(2,lR) and not just the group itself. 
One may pose the set of questions referring to these compact regions. The 
surprise is the appearance of additional regions: the whisjers, denoted by W 
in figure 1. These regions extend all the way to different spacial infinities. 
Their emergence sheds a new light on the set of questions. 

• The Whiskers Regions and the S Matrix Problem [6]: 

The W regions have infinite extent and moreover their metric is time inde- 
pendent. String theory has thus provided a place to build accelerators so one 
can measure the S matrix. The common wisdom claims one should be able to 
do it in string theory and indeed one can. The whiskers also contain time- like 
submanifolds which are singular and CTCs. Their significance needs to be 
studied. 



^The geometric data obtained is valid in the large k limit. For the bosonic string there are 
known 1/k corrections [14, 29, 27]. The exact background sometimes has a different singularity 
structure. For fermionic strings, the semiclassical background is expected to be a solution to all 
orders in 1/k. The dilaton <ï> is normalized such that the string coupling is Qs = . 
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• Observables and Entropy [6] : 

The BRST exact operators are natural candidates to be the observables. 
Their number could be defined as the entropy of the system. The analysis 
shows that the number of states is infinitely depleted relative to the number 
of states in the AdS^ case. In the latter case the ten-dimensional string 
theory can be represented by a two-dimensional dual conformal field theory. 
The case at hand contains even fewer states [6]. Examples of observables 
that were calculated are the vertex operators in the theory. One of them, U, 
corresponding to a (5 function normalizable operator, is given by [6, 29]: 

V = UDf (16) 



where D is the SU (2) part of the vertex, / is the internal M part, and U is 
given by: 



U (m, m'; j, e; g) = K++ - 



sin(7r(j + im')) 
sin(7r(j + zm)) 



(17) 



where m,m',j are the usual SL{2) numbers related to the energy and mo- 
menta in target space (see below), and K are defined in [30]. What is 

important here is that U has the following asymptotic form on the boundary 
of the whiskers: 



U{E±-,r,02 oo) -h iî(j; m, 

(18) 

where 

E± = -{m±m'), j = --+ips (19) 

Prom this form one can extract the reflection coefficient of each partial wave. 
Its square is given by: 



1 > I ni2 _ cosh(27rÆ+) -k cosh(27r(p, - E-)) 

- ' ' cosh(27r£;+) -I- cosh (27r(p, -|- E.)) 

The theory is unitary. This results from the fact that string theory on 
SL{2, IR) is unitary and has similar lower point functions. The interpretation 
of that part of the vertex operator as a reflection coefficient was possible af- 
ter choosing the appropriate boundary conditions. The system is defined far 
away from the singularities in a weak coupling regime as one would indeed 
prefer. 



• Boundary Conditions and Possible Crossing of the Singularities: 

Once the boundary conditions are set in a weakly coupled region, the vertex 
operator is fixed everywhere. This includes the regions following the par- 
ticular big bang and the following big crunches and big bangs. A projectile 
starting in a definite partial wave in one of the whiskers ends up also in other 
whiskers. Again, unitarity of string theory on SL{2^ IR) is responsible for the 
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unitarity when summed over all the boundaries of the whiskers. One resolu- 
tion of the problem of crossing the singularity would be that the singularity 
was there only semi-classically. Had the singularities had no algebraic origin 
that would be likely. However, one can associate some of the singularities 
which are manifest in the geometrical formulation with fixed points of the 
algebraic identifications implied in constructing the coset. Other are not and 
can be removed as will be discussed later. 

• Holographic Aspects: 

At this stage one would still require that holography be derived for each 
case and not be added to the axioms of a theory of gravity. There is thus no 
guarantee that this background has a holographic description, nevertheless 
one can observe several primodial seeds of this feature. The universe seems 
to be divided in cosmologies and whiskers. Holographic screens, if they have 
a meaning, are likely to be set at the asymptotic boundaries of the whiskers. 
“We” on the other hand are presumably situated in the cosmological parts. 
From the point of view of the bulk theory there are several types of represen- 
tations of *S'L(2, IR) which play a role in the mapping between the boundary 
and the bulk theories. The discrete representations of SL{2,JR) reflect nor- 
malizable states which live near the cosmological region, the information 
about them may be encoded in the non-normalizable operators from that se- 
ries which extend to the boundary. The principal continuous representations 
give more global information about the physics in the bulk and are related on 
the boundary to the Ô function normalizable operators. The system seems to 
have an infinite number of screens. Thus one may compose the dual theory 
out of an infinite entangled Hilbert spaces, the wave function being a product 
of an infinite number of correlated states (a general claim on this issue was 
made in [24]). Recently, in the BTZ black hole case a reduction in the num- 
ber of effective boundaries was indicated [31]. This could be however related 
to special scattering properties of the black hole. As mentioned strings prop- 
agating on AdSs times an attendant manifold do have duals [32] . One may 
expect that the same is true after gauging and appropriate compensation 
of the central charge. Such a non-perturbative definition could be of value, 
and studies of the simpler BTZ black hole case were done. To summarize 
the positive results, the study has allowed the explicit calculation of some 
interesting properties of strings in the presence of cosmology and has offered 
some surprising ways to resolve long standing problems in GR. There are 
however also many open issues. 

• Back Reaction and Other Open Issues: 

As described earlier the cosmological background is exactly of such a com- 
plicated nature that it could be destabilized by a speck of dust. The worry 
that string perturbation theory would invalidate the supposedly exact calcu- 
lation at tree level arises because the various exact solutions have CTC and 
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less generic closed null-like curves, in addition to the space-like singularities. 
We have reviewed the field theory calculation pointing at a singular energy 
density at the singularity. The hope was that a string probe smears and thus 
heals the singularity. The analysis of the lower point functions gives indeed 
finite answers. This is not the case however for the four point function in one 
of the orbifold cases; it diverges [16]. The source of that divergence is that the 
Hamiltonian of the system does not commute with the projection operator 
of the orbifold construction. As all states must be singlets of that projection, 
the allowed states can not be eigen-states of energy. There is no way to con- 
struct a solely low energy scattering probe. The state needs to contain higher 
energy components. The question becomes, what is the weight of these high 
energy components? For the original case it was found that in fact all com- 
ponents had equal weight and thus the average energy of the projectile was 
infinite, leading to a breakdown of string perturbation theory. In other cases 
a control parameter was found that allowed to obtain non-singular scatter- 
ing [7, 17, 26]. For the singularity in the coset the four point function has 
not yet been calculated exactly. On top of that, non-perturbative effects, 
such as black hole formation in a region where GR calculations are reliable 
also threaten the validity of perturbation theory [33] . Preliminary results for 
the BTZ black hole case did not detect a singularity [31]. The coset case 
has actually many similarities to the BTZ case [7]. However, these results 
are not yet conclusive and the issue remains open. The various theories can 
be studied in a world-volume supersysmmetic version. Some orbifolds can 
be arranged to be supersymmetric also in target space. As mentioned, the 
importance of this is not yet clear. Also long strings and various thermo- 
dynamical aspects have not yet been studied. There has been some initial 
work on D-brane probes. The branes probe shorter distances and may offer 
an additional view on how strings experience the singularities. An uninvited 
feature emerged both in the compact cosmological backgrounds and several 
of the orbifold constructions. The systems have regions which contain CTCs. 
The first negative associations with such curves is that they allow the viola- 
tion of causality. In the gauged models the origin of the CTCs is the compact 
SU (2) manifold. Gauge invariance does however select only vertex operators 
which are single valued. The history they represent repeating itself. In fact 
the depletion of the number of states has the same origin and is welcome. 
One may suggest that the two are correlated. The reduction of degrees of 
freedom going hand in hand with those few states for which causality, as well 
as other properties, remain intact. In GR CTCs also lead to accumulation 
of very large energies. In some cases it clearly invalidates perturbation the- 
ory taking away the value of supposedly exact solutions. One is thus led to 
analyze the singularity structure in even more detail. One considers control 
parameters to study the singularities one of them is the radius, i?, of an 
extra fifth dimensions (Such extra parameters can be obtained, for instance, 
by 0(d,d) rotations as in [7, 13, 34, 35] (for a review, see [36])). 
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5 Summary 

Most of the points deserving to be in the summary have already been mentioned 
in the first part of the lectures. The physics involved is very important and very 
challenging, the quality of the difficulties which remain to be overcome befits the 
subject. 
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Abstract. The interpretation of D-branes in terms of open strings has lead to much 
interest in boundary conditions of two-dimensional conformal field theories (CFTs). 
These studies have deepened our understanding of CFT and allowed us to develop 
new computational tools. The construction of CFT correlators based on combining 
tools from topological field theory and non-commutative algebra in tensor cate- 
gories, which we summarize in this contribution, allows e.g. to discuss, apart from 
boundary conditions, also defect lines and disorder fields. 



1 Basic CFT facts 

In the applications of two-dimensional conformal field theory - two-dimensional 
critical systems, quasi one-dimensional condensed matter physics, and string the- 
ory - boundary effects play an important role. They appear in the description of 
pointlike defects such as the Kondo effect, the computation of percolation proba- 
bilities, the construction of type I superstring theories, and the formulation of open 
string perturbation theory in the background of D-branes. The study of boundary 
conditions has very much improved our understanding of the structure of CFT; this 
is one topic we address in the present contribution. But two-dimensional conformal 
field theories possess an extremely rich structure, which is in fact not exhausted by 
bulk and boundary fields. In particular, they also involve defect lines and disorder 
fields; these objects will be discussed as well. Hitherto, they have not yet played a 
role in world sheet theories of strings. 

Let us briefly review a few pertinent aspects of CFT. In the bulk there are 
both left moving and right moving degrees of freedom, which are described by a left 
and a right moving chiral algebra, respectively. Each of them contains the Virasoro 
algebra, which realizes the conformal symmetry, given by generators {Tn | n G Z} 
with relations 



[-^n 5 TTl^ ^ ^n^ — m • 

It is natural to study the representations of the chiral algebra; they pro- 
vide the superselection sectors of the chiral CFT. Primary fields correspond to 
irreducible representations; if their number is finite, the theory is called rational. 
With the help of the representation theory one can introduce conformal blocks 
which are spaces of multivalued functions of insertion points on a complex curve. 
For WZW theories, they can be computed as (subspaces of) the spaces of so- 
lutions to Knizhnik-Zamolodchikov equations. For the full local conformal field 
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theory they play the role of pre correlators; they also form the state spaces of the 
associated three-dimensional topological field theory. The monodromy properties 
of these conformal blocks provide us with a collection of data - fusing matrices, 
braiding matrices, conformal weights, representations of mapping class groups - 
that encode the basic information about the chiral CFT; they are usually referred 
to as chiral data, or also as Moore-Seiberg data. 

A good handle on these data is an essential prerequisite for any powerful 
approach to CFT. Technically, it is afforded by the so-called modular tensor cat- 
egories. They furnish a basis-independent formalization of the chiral data, which 
are obtained from solutions to differential equations, in much the same way as 
the notion of a vector space constitutes a bcisis-independent description for the 
structure of the space of solutions to linear equations. 

The representations of a chiral algebra form a category; its objects are repre- 
sentations, its morphisms are intertwining maps between representations. Here the 
term ‘chiral algebra’ should be taken with some care - we do not necessarily start 
with a maximally extended chiral algebra, for left movers or for right movers. This 
gives us sufficient fiexibility to address symmetry breaking boundary conditions 
as well as situations in which left moving and right moving chiral algebra do not 
coincide. 

The representation category of a chiral algebra comes with a lot of structure: 



• The notion of a tensor product, or fusion, of representations leads to a tensor 
product 

0 : C xC 

for the category. For this tensor product, the vacuum sector acts as a unit. 



• Braid group statistics in low dimensions provides us with the notion of a 
braiding, which constitutes a systematic prescription to permute representa- 
tions in tensor products that is compatible with the braid group relations. 

• The concept of dual representations gives rise to a pairing of representations. 



These structures obey a number of compatibility relations, and these are summa- 
rized in the definition of a modular tensor category. The qualification ‘modular’ 
indicates the fact that from such a category one can infer a natural candidate for 
the action of the modular group, i.e. the mapping class group of the torus, on the 
characters of the CFT. Modular tensor categories also arise as (truncations of) cat- 
egories of representations of quantum groups, and as categories of representations 
of weak Hopf algebras. It is an important result [16] that to every modular tensor 
category one can associate a topological field theory in three dimensions. This re- 
alizes a “holographic” correspondence between chiral CFT in two dimensions and 
TFT in three dimensions. 
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2 Algebra and topological field theory 

The central idea of our approach to (rational) conformal field theory is to com- 
bine tools from topological field theory with non-commutative algebra in tensor 
categories. 

The central insight can be summarized as follows: 

For a given chiral algebra, a full local CFT can be constructed from the 
algebra of open string states for a single boundary condition (= D-brane). 



To understand this result, let us examine the algebra of open string states, i.e. the 
algebra of boundary operators that respect a given boundary condition A, in 
more detail: 

• The field-state correspondence allows us to work with the corresponding 
states. They form a module over the chiral algebra, i.e. a (generically re- 
ducible) object in the modular tensor category C; we use the symbol A also 
to denote this object. 

• The operator product 

C 

of boundary fields is associative. This leads to an equation of the type 

cc = Eccf. 



It is a first crucial observation that this relation gives rise to an associative 
multiplication morphism m G Hom(A(8)A, A) on the object A. The presence 
of the fusing matrix F in the relation for the structure constants C is taken 
into account by a non-trivial notion of associativity in the category C. 

• Another important aspect of the algebra object A is the fact that the non- 
degeneracy of the two-point functions of boundary fields on the disk implies 
that A can be endowed with the structure of a Frobenius algebra, i.e. an 
algebra with a non-degenerate invariant bilinear form. (For precise defini- 
tions, see [9]). Actually, A is even a so-called symmetric special Frobenius 
algebra. It should be appreciated, though, that A is not necessarily (braided- 
) commutative, as befits an algebra of boundary fields. Moreover, the bound- 
ary condition we start with need not be elementary; it may equally well be a 
superposition of several elementary boundary conditions, including possibly 
Chan-Paton multiplicities. 






Finally, for any other boundary condition M the associativity of the operator 
product 

^AA ^AA ÿ^AM 
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of boundary fields allows us to abstract a map p G M) that en- 

dows the corresponding object M of C with the structure of a (left) A-module. 
We thus learn that boundary conditions are given by A- modules. 

Let us now discuss the second input of our construction, topological field 
theory. As a direct consequence of the axioms, every modular tensor category 
allows for the construction of a three-dimensional TFT. The latter assigns to every 
closed oriented two-manifold È a finite-dimensional vector space 7Y(Ê), the space 
of conformal blocks, on which the mapping class group Map{T>) acts projectively. 
Further, to every three-manifold with boundary Ê that contains a Wilson graph, 
the TFT assigns a vector in HÇÈ). The axioms of a TFT formalize the well-known 
relation between a TFT on a three-dimensional manifold and a chiral CFT on its 
boundary, a structure that is e.g. central to the CFT description of universality 
classes of quantum Hall fiuids (for a review, see e.g. [7]). 

3 The TFT construction of CFT correlators 

Our goal is now to determine CFT correlators on a surface E that possibly has a 
boundary. To be able to apply tools from TFT, we need to work with a suitable 
two-manifold E without boundary. A natural candidate for E is the complex double 
[3] of E. It comes with an orientation reversing involution a such that E is identified 
with the quotient of E by the action of a. Indeed, it is natural [15] to view the world 
sheet E as a real scheme and E as its complexification; the orientifold map a is 
then just the action of the non-trivial element of the Galois group (7a/ (C/M) = Z 2 . 

With the help of the double cover E, we can in particular give a concise 
version of the principle of holomorphic factorization: The correlators of the CFT 
on E are specific vectors in the space 77(E) of conformal blocks on the complex 
double, which are determined by two types of constraints: 

(1) They must be invariant under the action of Map(E) = MapÇÈ)^ . 

(2) They must satisfy factorization rules. 

According to the principle of holomorphic factorization we need to select a 
specific element of 77(E) to describe a correlator on E. By the principles of TFT, 
in turn, such a vector is determined by a three-manifold Ms whose boundary is 
the double E and a ribbon graph in Ms- For Ms we take the so-called connecting 
manifold [6], defined as the quotient of the interval bundle E x [—1,1] t>y the Z 2 
that acts as cr on E and as t^ —t on the interval. The points with t — 0 provide 
a distinguished embedding of E into Ms- In fact, E is a retract of Ms or, in 
more intuitive terms, Ms is just a fattening of the world sheet E. (To give one 
example: the double of a disk is a sphere, and the orientifold map is the refiection 
at the equatorial plane; the connecting manifold is in this case a full three-ball.) 
Concerning the prescription for the ribbon graph in Ms we refer to [8] . We merely 
recall that it involves a (dual) triangulation of E with the ribbons labeled by the 
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Probenius algebra object A and with the trivalent vertices corresponding to the 
multiplication morphism m of A. 

In this framework, the consistency relations of modular invariance (1) and 
factorization (2) can be proven rigorously. We also recover the various combinato- 
rial data - such as modular invariant partition functions, NIM-reps and classifying 
algebras that arise when studying specific small subsets of these consistency con- 
ditions (see e.g. [13, 10, 2]). Moreover, Morit a equivalence, combined with orbifold 
technology, allows for an elegant proof of T-dualities for arbitrary topology of the 
world sheet. 

We have already seen that modules over A correspond to boundary condi- 
tions. In non-commutative algebra, it is natural to consider himodules as well, and 
again they turn out to possess a physical interpretation: they describe defect lines. 
More specifically, these defect lines carry neither momentum nor charges. Thus 
they are topological in nature, so that their position matters only up to homotopy. 
These defect lines can actually be generalized to interfaces between two different 
full local CFTs that are based on one and the same chiral CFT. Disorder fields 
create or change defect lines. 

The following table, associating algebraic structures to physical concepts, can 
serve as a succinct summary of our results [9] : 



Physical concept 


Algebraic structure 


Boundary condition 
Boundary field 


left A-module 
Hom^ (M(g)a, N) 


Bulk field ^ab 
Defect line 
Disorder field 

In particular, bulk fields can be regar 
which connect the trivial defect line. 


HomA,yi((A(8)a)“ (A(g)6)+) 

A-bimodule 

Hom^,yi((Bi{g)a)“, 

ded as a special type of disorder fields - those 
i.e. the one labeled by A, to the trivial defect 



line. 

4 An example: Permutation branes 

Our algebraization of physical concepts not only leads to statements that can be 
proved rigorously, but also allows us to establish powerful algorithms for doing 
computations. In particular, for constructing a full local CFT only a single non- 
linear constraint needs to be solved: the one that encodes associativity of the 
Frobenius algebra object A. We illustrate the power of our approach with the 
following example. 

Consider the tensor product of N identical conformal field theories whose 
modular tensor category is C. We are then dealing with the category which 

comes with an action of the symmetric group S^. One may wish to classify all 
those boundary conditions of the C ^ ^-theory which respect the subalgebra of the 
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chiral algebra that is left pointwise fixed under the ^A^-action. To perform this 
task we can work with the orbifold category 

Corb := C^^/Sn 

for which many aspects of the chiral data are explicitly known [4, 1]. In particular, 
the primary fields of the orbifold theory are labeled by pairs ({A},?/^), where {A} 
is an unordered AT-tuple of labels for simple objects in C and is a character of 
the double of the stabilizer subgroup 5^ of {A} in 5 tv- 

It is not difficult to find the relevant algebra object. The algebra structure is 
determined by the product in the algebra of functions on 5jv, and it is realized on 
the object 

^=0 diml^.^({0},(^,[e])). 

Here 0 is the A/"- tuple each entry of which is the vacuum sector of C. Its stabilizer 
is all of 5 ' at; the pairs (^0, [e]), consisting of a character ^1; of Sn and the conjugacy 
class of the unit element specify a subclass of irreducible representations 

of the double D(5 at). 

One can show that 4 is a (braided-) commutative symmetric special Frobe- 
nius algebra. The irreducible A-modules, i.e. the “permutation branes” in the 
terminology of [14], can be easily determined with the help of theorem 3.2 of [12]. 
They are labeled by pairs (A, g) where A is now an ordered A^-tuple of labels and 
g is an element of the stabilizer group S^. One can then apply the formalism of 
[8, 9] to compute e.g. the boundary states and annulus partition functions. The 
latter turn out to be linear combinations of twining characters with prefactors 
that are sums of products of fusion rules of the C-theory. The NIM-rep property 
of the annuli as well as the consistency of the boundary states is guaranteed by 
the general theory. 

5 Conclusions 

Besides its computational power and the possibility to obtain general proofs of 
consistency, an additional benefit of our approach is that it reduces old physical 
questions to standard problems in algebra and representation theory. Here are 
some interesting examples: 

• The classification of CFTs with given chiral data C amounts to classifying 
Morita classes of symmetric special Frobenius algebras in the category C. In 
particular, physical modular invariants of automorphism type are classified 
by the Brauer group of C. 

• The classification of boundary conditions and defect lines is reduced to the 
standard representation theoretic problem of classifying modules and bi- 
modules. As a consequence, powerful methods like induced modules and reci- 
procity theorems are at our disposal. 
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• The problem of deforming CFTs is related to the problem of deforming al- 
gebras, which is a cohomological question. For the moment, the only known 
results in this direction are rigidity theorems [5]: a rational CFT cannot be 
deformed within the class of rational CFTs. It will therefore be important to 
get a better understanding of non-rational CFT. 
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Surprises with Angular Momentum 

Paul K. Townsend 



Abstract. The physics of angular momentum in even space dimensions can be sur- 
prisingly counter-intuitive. Three such surprises, all associated with the properties 
of supersymmetric rotating objects, are examined: (i) 5D black holes, (ii) Dyonic 
instantons and (iii) Supertubes. 



1 Prelude 

In his book Surprises in Theoretical Physics^ Peierls examines various occasions 
on which his research led to a surprising conclusion [1] although, as he says, there 
would have been no surprise if one had really understood the problem from the 
start. Such surprises are significant in that they expose flaws in one’s physical 
intuition and thus serve to refine it. In the same spirit, this article recounts three 
surprising results of the author’s co- investigations into the properties of supersym- 
metric rotating objects. In each case one could imagine a similar surprise arising 
for non-supersymmetric objects, so supersymmetry just provides a convenient, and 
simplifying, context; the surprises are chiefly due to unexpected features of angular 
momentum, a fact that might itself be considered a meta-surprise given the central 
role of angular momentum in physics. However, intuition for angular momentum 
is usually acquired from a study of objects rotating in three-dimensional space, 
whereas the ceises reviewed here involve rotation in either four space dimensions 
or, in the last case to be considered, two space dimensions. Although the context 
of each of the three surprises is quite different, there are a number of points of 
contact that make a comparative review seem worthwhile. I thank the organiz- 
ers of TH-2002 for allowing me the opportunity to present such a review, and I 
congratulate the Mayor of Paris for Paris Plage. 

2 Supersymmetric Rotating Black Holes 

The first of our three surprises arose from a study of supersymmetric black hole 
solutions of 5D supergravity [2]. Supersymmetric black holes are special cases of 
stationary black holes. A stationary (aisymptotically flat) black hole spacetime ad- 
mits a Killing vector field (KVF) k that is timelike near spatial infinity, and unique 
up to normalization. However, there may be interior regions outside the horizon, 
called ‘ergoregions’, within which k is spacelike; in fact, an event horizon with 
a non-zero angular velocity necessarily lies within an ergoregion. Supersymmetric 
spacetimes cannot have ergoregions, however, because supersymmetry implies that 
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k can be expressed in terms of a Killing spinor field, and this expression allows k 
to be timelike or null but not spacelike. It follows that the event horizon of a su- 
persymmetric black hole must he non-rotating [2]. This general observation, which 
applies in any spacetime dimension, should be kept in mind in what follows. 

The angular momentum J of any D-dimensional asymptotically-flat space- 
time can be expressed as the surface integral 

where G/; is the D-dimensional Newton constant, V the standard covariant deriva- 
tive, m a suitably normalized spacelike KVF with closed orbits and dS the dual of 
the (D — 2)-surface element at spatial infinity, which can be viewed as the boundary 
at infinity of a spacelike {D — l)-surface E. In the case of a black hole spacetime 
one may choose E to intersect the event horizon on a (£> — 2)-surface H, in which 
case the expression (1) may be re-expressed in the form 

T = J// -h Je (2) 

where Jh is a surface integral over H with the same integrand as in (1) and Je is 
a ‘bulk’ integral over the region of E outside the horizon. 

For solutions of 4D Einstein-Maxwell theory, and hence of pure TV = 2 4D 
supergravity, the integral Je vanishes and so J = J//. In other words, the angular 
momentum is due to the black hole itself. This seems reasonable given that non- 
zero J implies a rotating horizon by a theorem of Wald (which states that a 
stationary black hole with a non- rotating horizon is static [3]). But a rotating 
horizon is incompatible with supersymmetry so any 4D supersymmetric black hole 
must be static. 

The situation in 5D is more subtle. Firstly, the 5D Einstein-Maxwell theory 
admits a possible ‘FFA’ Chern-Simons (CS) term, which is present in the pure 
5D supergravity theory with a particular coefficient [4]. A supersymmetric black 
hole solution of this Einstein-Maxwell-CS theory must again have a non-rotating 
horizon but this no longer implies that J = 0; there is no 5D analogue of Wald’s 
theorem because the bulk Maxwell field may now carry angular momentum. In 
fact, a stationary supersymmetric black hole solution of 5D supergravity with 
non-zero J exists. It was first found in a slightly different context by Breckenridge 
et al. [5] and is usually called the ‘BMPV’ black hole. It was later shown to be a 
1/2 supersymmetric solution of 5D matter-coupled supergravity [6, 7]. In the form 
found in [2] , as a solution of the pure minimal 5D supergravity, the metric is 

ds^ = ^ j + (l + ^) + r'^dül) (3) 

where JÜ3 is the {SU{2)l x Ff/(2)i^)-invariant metric on = SU (2) and is 
one of the three left- invariant forms on SU {2) satisfying Jcrs = <Ji A <T2 and cyclic 
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permutations. The parameters /i and j are related to the total mass M and total 
angular momentum J as follows: 

4MG5 . 2JG5 ,,, 

The singularity at r = 0 is just a coordinate singularity at a degenerate non- 
rotating event horizon provided that 

(5) 



Although the horizon has zero angular velocity, it is affected by the rotation; the 
horizon is a 3-sphere, topologically, but geometrically it is a squashed 3-sphere, 
with a squashing parameter proportional to J. As the squashed 3-sphere 

degenerates, and for p > there are closed timelike curves through every point 
[2]. For this reason we restrict j as above (and refer the interested reader to [8, 9] 
for details of the ‘over- rotating’ case). 

Because the angular velocity of the horizon vanishes, one might expect to find 
that J// = 0 and hence that Js = J. However, and this is the promised surprise, 
a calculation shows that [2] 








J> J. 



(6) 



This implies not only that Jh is non-zero but also that it is negative, as a direct 
computation confirms! What this means is that a negative fraction of the total 
angular momentum is stored in the Maxwell field behind the horizon. 

Of course, given that there can be a contribution to the total angular mo- 
mentum of a charged black hole from the Maxwell field outside the horizon there 
is no good reason to suppose that there is no similar bulk contribution from inside 
the horizon, and once this has been appreciated it is not difficult to see why the 
fraction should be negative: given a positive bulk contribution to the angular mo- 
mentum, one would expect frame dragging effects to cause the horizon to rotate 
unless these effects are counterbalanced by the frame dragging effects due to a 
negative contribution to the angular momentum in the fields behind the horizon 
[2]. Because the horizon of a supersymmetric black hole cannot rotate, Jd > J 
should be expected. So why was it a surprise? The answer is presumably that 
there is a clash with intuition derived from approaches to black hole physics such 
as the membrane paradigm [10, 11] in which physical properties of the black hole 
relevant to an exterior observer are expressed entirely in terms of the horizon and 
its exterior spacetime. Rotating 5D black holes appear to present an interesting 
challenge to this paradigm. 



3 Interlude: symmetries and angular momentum: I 

Before considering the next surprise it will be useful to consider the effects, or 
expected effects, of rotation on rotational symmetry. In four space dimensions the 
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angular momentum 2-form L has two skew eigenvalues J ± J', where J and J' 
are the quantum numbers associated to the rotation group 5pin(4) = SU{2)r x 
SU (2)l. One expects spm(4) to be broken to U {1)r xU{1)l for generic ( J, J'), but 
to U (1) rX SU (2) L when J' = 0, in which case L is self-dual, and to SU (2) rxU (1) l 
when J — 0, m which case L is anti- self-dual. The supersymmetric rotating 5D 
black hole has J' = 0, which is why there is only a single rotation parameter J; 
the U{1)r X SU{2)l symmetry is evident from the metric (3), and this is also the 
isometry group of the squashed 3-sphere. 

The supersymmetry generators of minimal 5D supersymmetry transform as 
the ( 2 , 1 ) 0 ( 1 , 2 ) representation of SU{2)r x SU{2)l. Half the generators, call 
them Ql, are singlets of SU(2)r and the other half, Qr, are singlets of SU{2)r. A 
half-supersymmetric configuration on which Ql acts trivially will preserve SU (2) l 
in which case the rotational symmetry group must be unbroken or broken to 
U{1)r X SU{2)r. In the case of the 5D black hole the former possibility applies 
when J = 0 and the latter when J ^ 0. More generally, we deduce that preservation 
of 1/2 supersymmetry implies a self-dual or anti-self-dual angular momentum 2- 
form. 

4 Dyonic Instantons 

The instanton solution of Euclidean 4D Yang-Mills theory has an alternative in- 
terpretation as a static soliton of the 5D Yang-Mills theory. As a solution of 5D 
Super-Yang-Mills (SYM) theory these ‘instanton-solitons’ preserve 1/2 of the su- 
persymmetry of the gauge theory vacuum. Let us concentrate on the minimal 5D 
SYM theory with gauge group SU (2)^ for which the bosonic field content con- 
sists of the SU (2) triplet of gauge potential 1-forms {a = 1,2,3) and a single 
scalar (Higgs) triplet (j )^ . Supersymmetry does not permit a potential for the Higgs 
field so its expectation value is arbitrary. The vacua are thus parametrized by the 
constant 3- vector 

m = ( 7 ) 

For vanishing Higgs field, and hence u = 0, a Yang-Mills instanton is a (marginally) 
stable supersymmetric soliton solution of the 5D SYM theory with unbroken 
SU (2). A class of multi-soliton solutions, with arbitrary instanton number /, is 
given by the ’t Hooft ansatz 

Af = %djlogH (z = l,2,3,4) (8) 

where fj^ is the triplet of anti-self-dual complex structures on and if is a 
harmonic function on with point singularities such that if ^ 1 ais r ^ oo, 
where r is the radial distance from the origin of E^ . The simplest possibility, 



yields the one-instanton (1=1) solution of ‘size’ p. 



( 9 ) 
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When V ^ 0 then SU{2) is spontaneously broken to U{1) and the instanton- 
soliton is destabilized; a simple scaling argument shows that the energy is reduced 
if p is reduced, so the soliton will implode to a singular instanton configuration with 
p = 0. However, the addition of an electric U{1) charge can stabilize the soliton 
at some equilibrium radius, at which supersymmetry is again partially preserved 
[12]. Specifically, if we set 

A^ = 4)^ = v^H~^ (10) 

then one again has a supersymmetric configuration. It is 1/2 supersymmetric as a 
solution of the minimal SYM theory discussed here but 1/4 supersymmetric as a 
solution of the maximally-supersymmetric 5D SYM theory, as is suggested by the 
formula 

M = 47t^|/| + \vq\ (11) 

for the mass of a dyonic instanton. 

The special case of 

H = l+-^ (12) 

yields the one dyonic instanton (/ = 1) of size p. A computation of the U{1) 
electric charge q of this solution shows that q^ vj equivalently [12] 



p ~ 


(13) 


The energy density takes the form 




8 = v^f{vr) 


(14) 


for some function /. As long as 




vq < IGtt^ 


(15) 



the energy density is peaked about the origin in a region with diameter of order 
p. However, if 

vq > 167t^ (16) 

then the energy density takes its maximum on some 3-sphere centred on the origin 
[13]; in the limit of large vq the radius of this 3-sphere is of order p. 

Note that the energy density of this I = I solution is hyper-spherically- 
symmetric, and the 3-sphere around which the energy density is distributed at large 
vq is a round 3-sphere, not a squashed one. The rotational Spin(4) = SU (2) l x 
SU {2)ji symmetry is broken, to either SU (2) or SU (2)/^, by the Yang-Mills-Higgs 
field configuration itself. However, these fields are also acted on by an ‘isospin’ 
group SU{2)[, and the diagonal subgroup SU{2)d of either SU{2)r x SU{2)i or 
SU (2) L X SU (2) / survives. Thus the Yang-Mills-Higgs fields of the dyonic instanton 
preserve either SU{2)l x SU{2)o or SU{2)o x SU{2)ji. This is interpreted as the 
rotational invariance group of gauge-invariant quantities such as the energy density. 
Compare this state of affairs to that of the BPS monopole of 4D N=2 SYM theory. 
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In that case the rotation group is SU (2) and the group SU (2) x SU (2)/ is broken to 
the diagonal SU (2 ) d by the one-monopole solution, which is therefore spherically 
symmetric. The one dyonic instanton solution is hyper-spherically symmetric for 
essentially the same reason. 

The spherical symmetry of the BPS monopole indicates that it carries no an- 
gular momentum, because angular momentum would break the spherical symme- 
try. One might similarly expect the hyper-spherically symmetric dyonic instanton 
to carry no angular momentum but, and this is our second surprise, a computation 
yields [13] 

Lii = -?v • fjij (17) 

where v is the unit 3-vector with components n“/n. Thus, a self-dual dyonic in- 
Stanton has an anti-self-dual angular momentum 2-form proportional to q, and 
this is true even for the hyper-spherically symmetric dyonic instanton solution of 
[ 12 ], 

The realization that the one dyonic instanton must carry angular momentum, 
despite its hyper-spherical symmetry emerged from a computation of its gravita- 
tional field [13]. Surprisingly, this turned out to be stationary rather than static, 
and a subsequent calculation of the angular momentum of the flat space dyonic 
instanton yielded the above formula. Should this result not have been anticipated 
from the start? Essentially, the reason that the BPS monopole has no angular 
momentum is that there is nothing in the monopole ansatz that could produce a 
non-zero result; the angular momentum is zero because there is nothing else that 
it could be. Applied to the dyonic instanton ansatz, the same argument shows only 
that L cx: V • rÿ, so the real question is why one should expect the constant of pro- 
portionality to vanish. It vanishes when g = 0 because the instanton is genuinely 
static, but when q ^ 0 we have electric fields and a configuration with electric 
fields is not ^genuinely’ static (for a reason to be explained below) so the formula 
(17) should not really have been a surprise. 

Nevertheless, I still find that the most common reaction to the statement 
spherical symmetry in four space dimensions does not imply vanishing angular 
momentum is surprise. To mitigate the surprise I usually point out that circular 
symmetry in two space dimensions is obviously compatible with non-zero angular 
momentum. Pursuing this point will lead to our third surprise. 

5 Interlude: symmetries and angular momentum: II 

A supersymmetric field configuration of a supersymmetric field theory is one that is 
unchanged by the action of some linear combination Q of supersymmetry charges. 
This means that acts trivially too, but the action of on any gauge-invariant 
field is equivalent to the action of //, which generates time translations. It fol- 
lows that a field configuration can be supersymmetric only if all gauge-invariant 
quantities are time-independent] in particular, this means that the energy density 
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must be t ime- independent ^ It is important to appreciate that it is possible for 
some gauge-dependent field of a supersymmetric field configuration to be time- 
dependent, and this is why there can exist supersymmetric field configurations 
with non-zero angular momentum. For example, a non-zero electric field E can 
have this effect because E = A in an = 0 gauge, and in this gauge A is 
time-dependent if E is non-zero. 

Although the energy density of a supersymmetric field configuration must be 
time-independent, the possibility of a non-zero angular momentum indicates that 
there is motion nevertheless. Consider a circular planar loop of elastic string; this 
may rotate about the axis of the plane, and the associated angular momentum 
will (if it has a sufficiently large magnitude) support the string, against the force 
exerted by its tension, at some equilibrium radius. A rotating configuration of 
this kind would not be incompatible with supersymmetry because the circular 
symmetry ensures that the energy density of the string loop is time- independent. 
This illustrates the point that angular momentum is obviously compatible with 
circular symmetry in two space dimensions. 

It is even more obvious that circular asymmetry is compatible with non-zero 
angular momentum but in this case it might seem unlikely that the energy density 
could be time- independent (as would be required by supersymmetry); any rotating 
Tump’ on the string would clearly imply a time-dependent energy density. 

6 Supertubes 

The supertube, as originally considered [14], is a kind of string theory realization of 
the spinning string loop supported by angular momentum. I say ‘kind of’ because 
a relativistic string that is described by a Nambu-Goto action cannot support 
momentum along the string and hence cannot rotate if it is circularly symmetric. 
However, IIA superstring theory has membrane solitons that appear as D2-branes, 
and a cylindrical D2-brane can be supported against collapse by angular momen- 
tum in a plane orthogonal to the axis of the cylinder. This is possible because the 
D2-brane action is not of Nambu-Goto type but rather of Dirac-Born-Infeld type 
and the angular momentum can be generated by the Born-Infeld (BI) electric and 
magnetic fields. Specifically, the Lagrangian density is 

C= - ^/-det(g + F) (18) 

where g is the induced metric on the 3D worldvolume and F is the worldvolume BI 
field strength 2-form. In principle we have a membrane in (since the spacetime 
is 10-dimensional) but we may choose to consider a membrane in C E®. A 
membrane of cylindrical topology can be parameterized by worldspace coordinates 
(z,a) G R X In a physical gauge adapted to this topology, the geometry of a 

^In the case of gravitational theories this argument needs modification because Q is defined 
only as an integral at infinity, but the end conclusion is similar: a spacetime can be supersymmetric 
only if it is stationary. 
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static membrane is determined by a single function R{z,a) which gives the radial 
position in the plane orthogonal to the axis of the cylinder as a function of position 
on the brane. The BI magnetic field 5 is a worldspace scalar. The BI electric field 
is a worldspace 2- vector but, as we wish to generate an angular momentum in the 
plane orthogonal to the axis of the cylinder, we will choose this 2-vector to be 
parallel to the axis of the cylinder. Thus, the BI 2-form is 

F = E{z,a) dt A dz B{z,a) dz A da . (19) 

The Lagrangian density now reduces to 

c = + Rl) (1 - E ^) + £2 + R 2 J 12 (20) 

where Ra = da R and Rz = dzR- 

We have assumed that the D2-brane has cylindrical topology. If we further 
assume that it has cylindrical geometry then we must set -Rcr = 0 and Rz = 0] the 
radial function R thus becomes a real variable. The BI fields E and B similarly 
reduce to real variables if we assume cylindrical symmetry, and the Lagrangian 
density becomes a function of three variables 

C{E, B,R) = - - £2) + S2 , (21) 



Introducing the electric displacement 



V = 



dE ’ 



the Hamiltonian density 7i = VE — C is 



H{D,B,R) = R-^y^{D^ + R^){B^ + R^). 



This is equivalent to 



( 22 ) 



(23) 



n^ = {D±Bf+(^^Tl^ . (24) 

From this formula we see that the energy is minimized for given B and D when 

R = \BDl (25) 

This is therefore the equilibrium value of the cylinder radius. The equilibrium 
energy is 

Rmin = \D\ + 1^1 . (26) 

This energy formula is typical of 1/4 supersymmetric configurations, and a cal- 
culation confirms that the D2-brane configuration just describes preserves 1/4 
supersymmetry, hence the name ‘supertube’. 




Vol. 4, 2003 Surprises with Angular Momentum 



S191 



As we go round the circle parametrized by cr, the tangent planes to the tube 
at a point with coordinates {z^cr) are rotated by an angle in the plane orthogonal 
to the axis of the cylinder. Under normal circumstances a configuration of this type 
would not preserve supersymmetry because the D2 constraint on the supersymme- 
try parameter associated with one tangent plane would be incompatible with the 
constraint associated with any of the other tangent planes. As an extreme example 
consider two tangent planes at diametrically opposite points on the circle; if we 
declare the constraint associated with one to be the D2-constraint then the con- 
straint associated with the other is the anti-D2-constraint. Thus, the supertube 
is effectively a supersymmetric configuration that includes both D2-branes and 
anti-D2-branes! There are various related ways to understand how this is possible. 
Note that the relation between E and D is 



D [WVb? 
R\ 



(27) 



and that this yields E = \ when R = \BD\. An electric field has the effect of 
reducing the D2-brane tension, and increasing E to its ‘critical’ value E = \ 
would reduce the tension to zero if the magnetic field were zero; this can be seen 
from the fact that C — —\B\ ior E = 1. This explains why the supertube hats 
no energy associated to the D2-brane tension; its energy comes entirely from the 
electric and magnetic fields, which can be interpreted as ‘dissolved’ strings and 
DO-branes, respectively. The energy from the D2-brane tension has been cancelled 
by the binding energy released as the strings and DO-branes are dissolved by the 
D2-brane. Given that the D2-brane energy has been cancelled, it is perhaps not so 
surprising to discover that the D2-brane constraint is also absent, and hence that 
D2-branes can co-exist with anti-D2-branes without breaking supersymmetry. In 
any case, this is what happens and I refer to [14, 15] for a much more complete 
discussion of this point. 

While the supersymmetry of the supertube might appear surprising, this fea- 
ture was not discovered accidentally and, in any case, is not a surprise specifically 
related to angular momentum. Following the initial supertube paper [14], a matrix 
model version of it was introduced by Bak and Lee [16]. A subsequent paper by 
Bak and Karch [17] found a more general solution of the matrix model describing 
an elliptical supertube, which included a plane parallel D2/anti-D2 pair as a limit- 
ing case. The fact that a circular cross-section could be deformed to an ellipse was 
certainly a surprise to me because it was hard to understand how any shape other 
than a circle could be consistent with both rotation and the time-independent 
energy profile required by supersymmetry (the limiting parallel brane/anti-brane 
case, in which angular momentum is replaced by linear momentum, seemed much 
less problematical). And what was so special about an ellipse? Was this some arte- 
fact of the matrix model approach? The possibility of a non-circular cross-section 
had been considered, and rejected, in [14], but what was actually shown there 
is that the cross-section must be circular if / 0, whereas the supertube has 
Rz — 0. Let us return to (20) and set = 0 but keep the variables {E^ B, R) as 
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functions of a. The same steps as before now lead to 

As D and B are now functions of a we should minimize H for fixed average electric 
displacement D and average magnetic field 5, these quantities being proportional 
to the IIA string charge and DO-brane charge per unit length, respectively. The 
result of this minimization procedure is that suggested by the above formula; the 
energy is minimized when 

+ Rl = \BD \ , (29) 

with no restriction on R(cr), and the energy at this minimum is |5| + \B\. This 
implies preservation of 1/4 supersymmetry, as a direct computation confirms [15]. 
Note that the cross-sectional curve described by R{cr) need not even be closed; 
the net D2-brane charge is non-zero for an open curve, which means that the 
supersymmetry algebra will include a D2-brane term in addition to the IIA string 
and DO terms that are present for a closed curve. While this may be another 
surprise it is not one related to angular momentum, so I refer to [15] for details of 
its resolution. 

One way to understand how an arbitrary tubular cross-section can be com- 
patible with supersymmetry is to note that the supertube is TST-dual to a wave 
of arbitrary profile on a IIA string [18] because it has been appreciated for a 
long time that any uni-directional wave on a string preserves supersymmetry [19]. 
However, this explanation provides little in the way of intuition that might help 
us understand the original problem: how can a rotating tubular D2-brane with 
a non-circular cross section have a time-independent profile? There is a simple 
explanation, which can be found in [15]. I shall explain it here in terms of a 
closely related undergraduate mechanics problem that I heard about from Bran- 
don Carter, whose earlier paper with Martin [20] describes a similar phenomenon 
in the context of superconducting cosmic strings. 

Consider an elastic hosepipe through which superfluid flows at variable ve- 
locity u(s), where s parametrizes the curve C described by the hosepipe. If C is not 
straight then the motion of the fluid will produce a centrifugal force proportional 
to f^(s)/r(s) where r{s) is the radius of curvature of C at position s. If we are 
free to choose the function u(s) then we may choose it such that the centrifugal 
force produced by the fluid motion exactly balances, for all s, the centripetal force 
due to the hosepipe tension. Given C, and a parametrization of it, the function 
v(s) will be determined by this local force balance condition. We may choose C 
to be closed, in which case the hosepipe forms a closed loop of arbitrary shape 
that is prevented from collapse by the angular momentum generated by the cir- 
culating superfluid. A crucial feature of this hosepipe loop is that its shape does 
not rotate, so the energy density profile is time- independent, despite the non-zero 
angular momentum. This shows that time-independence does not imply circular 
symmetry. 
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7 Epilogue 

I have discussed three surprises involving angular momentum in even space di- 
mension arising from various research projects on rotating supersymmetric objects 
undertaken over the past few years with Eduardo Eyrais, Jerome Gauntlett, David 
Mateos, Robert Myers, Selena Ng and Marija Zamaklar. Although each surprise 
arose in its own distinct context, this article was motivated by the idea that there 
is something to be learnt by considering them together. Some of the connections 
between the three surprises have been discussed above, but there are others. For 
example, the supertube of circular cross-section is special in that it maximizes the 
angular momentum for given energy. This angular momentum upper bound arises 
in the context of the supergravity solution sourced by a supertube as a condition 
for the absence of global causality violations due to closed time-like curves [21], 
which is precisely the origin of the upper bound on the angular momentum of a su- 
persymmetric rotating 5D black hole. Also, the fact that the shape of a supertube 
does not rotate despite the non-zero angular momentum is reminiscent of the fact 
that the horizon of the ‘rotating’ 5D black hole does not rotate despite its non-zero 
angular momentum; of course, the underlying reason here is supersymmetry, but 
one wonders whether the mechanisms might not also be related. 

There are also several further connections between dyonic instantons and su- 
pertubes. The dyonic instanton of 5D gauge theory has a 3D sigma- model analogue 
known as a Q-lump [22]; this is a charged sigma- model lump that expands to a loop 
for large charge in the same way that the dyonic instanton expands to a 3-sphere 
[23]. In fact, it was this analogy that led to the realization that there should exist a 
tubular supersymmetric D2-brane supported by angular momentum; as explained 
in [14], the supertube can be viewed as an effective worldvolume description of 
the sigma-model Q-lump. Given this, it is natural to wonder whether the dyonic 
instanton has a similar effective realization as a tubular D(3-l-p)-brane with a 3- 
sphere cross section (the 5D SYM/Higgs theory has a D4-brane realization, and 
the dyonic instanton then acquires a string-theory interpretation [24] but this is 
quite different from the effective brane description being suggested here). 

Finally, there is the question, which I leave unanswered, whether the special 
features of angular momentum in two and four space dimensions extend to higher 
even dimensions. For example, is non-zero angular momentum compatible with 
SO{2n) symmetry of the energy density in 2n space dimensions for n > 27 



References 

[1] R. Peierls, Surprises in Theoretical Physics, Princeton University Press 
(1979), Princeton University Press (1991). 

[2] J.P. Gauntlett, R. Myers and P.K. Townsend, Black holes of D=5 supergrav- 
ity, Class. Quant. Grav. 16 , 1-21, (1999). 




S194 



Paul K. Townsend Ann. Henri Poincaré 



[3] R. Wald, The first law of black hole mechanics: vol 1, in Directions in gen- 
eral relativity, eds. B.L. Hu, M.P. Ryan, Jr., C.V. Vishveshwara and T.A. 
Jacobson, pp 358-366, Cambridge University Press (1993); gr-qc/9305022. 

[4] E. Cremmer, Supergravities in five dimensions, in Superspace and supergrav- 
ity, eds. S.W. Hawking and M. Rocek, pp 267-282, Cambridge University 
Press (1981) 

[5] J.C. Breckenridge, R.C. Myers, A.W. Peet and C. Vafa, D-branes and spinning 
black holes, Phys. Lett.B 391 , 93-98, (1997). 

[6] R. Kallosh, A. Rajaraman and W.K. Wong, Supersymmetric rotating black 
holes and attractors, Phys. Rev.D 55, 3246-3249, (1997). 

[7] A.H. Chamseddine and W.A. Sabra, Metrics admitting Killing spinors in five 
dimensions, Phys. Lett.B 426, 36 -42, (1998). 

[8] G.W. Gibbons and C.A.R. Herdeiro, Supersymmetric rotating black holes and 
causality violation, Class. Quant. Grav. 16 , 3619-3652, (1999). 

[9] C.A.R. Herdeiro, Special properties of five-dimensional BPS rotating black 
holes, Nucl. Phys.B 582 , 363-392, (2000). 

[10] T. Damour, Black hole eddy currents, Phys. Rev.D 18, 3598-3604, (1978). 

[11] Black Holes: the membrane paradigm, eds. K.S. Thorne, R.H. Price and D.A. 
Macdonald, Yale University Press (1986). 

[12] N.D. Lambert and D. Tong, Dyonic instantons in five-dimensional gauge the- 
ories, Phys. Lett.B 462, 89-94, (1999). 

[13] E. Eyras, P.K. Townsend and M. Zamaklar, The heterotic dyonic instanton, 
JHEP 0105, 046 (2001). 

[14] D. Mateos and P.K. Townsend, Supertubes, Phys. Rev. Lett. 87 , 011602 

( 2001 ). 

[15] D. Mateos, S.K.L. Ng and P.K. Townsend, Tachyons, supertubes and 
brane/anti-brane systems, JHEP 0203, 016 (2002). 

[16] D. Bak and K. Lee, Noncommutative supersymmetric tubes, Phys. Lett.B 
509 , 168-174 (2001). 

[17] D. Bak and A. Karch, Supersymmetric brane anti-brane configurations, Nucl. 
Phys.B 626, 165-182 (2002). 

[18] D. Mateos, S.K.L. Ng and P.K. Townsend, Supercurves, Phys. Lett.B 538 , 
366-374 (2002). 




Vol. 4, 2003 Surprises with Angular Momentum 



S195 



[19] A. Dabholkar, J.P. Gauntlett, J.A. Harvey and D. Waldram, Strings ais soli- 
tons and black holes as strings, Nucl. Phys.B 474, 85-121 (1996). 

[20] B. Carter and X. Martin, Dynamic instability criterion for circular string 
loops. Annals Phys. 227 , 151-171 (1993). 

[21] R. Emparan, D. Mateos and P.K. Townsend, Supergravity supertubes, JHEP 
0107 , on (2001). 

[22] R. Leese, Q-lumps and their interactions, Nucl Phys.B 366 , 283-314 (1991); 
E.R.C. Abraham, nonlinear sigma models and their Q-lump solutions, Phys. 
Lett.B 278 , 291-296 (1992). 

[23] E.R.C. Abraham and P.K. Towwnsend, More on Q-kinks: a (l+l)-dimensional 
analogue of dyons, Phys. Lett.B 29 , 225-232 (1992). 

[24] M. Zamaklar, Geometry of the non-abelian DBI dyonic instanton, Phys. 
Lett.B 493, 411-420 (2000). 



Paul K. Townsend 

DAMTP, University of Cambridge 

Centre for Mathematical Sciences 

Wilberforce Road 

Cambridge CB3 OWA 

UK 




I Annales Henri Poincaré 



Ann. Henri Poincaré 4, Suppl. 1 (2003) S197 - S210 
© Birkhâuser Verlag, Basel, 2003 
1424-0637/03/01S197-14 
DOI 10.1007/s00023-003-0916-z 



Lattice QCD - from Quark Confinement 
to Asymptotic Freedom 

Martin Lüscher 



Abstract. According to the present understanding, the observed diversity of the 
strong interaction phenomena is described by Quantum Chromodynamics, a gauge 
field theory with only very few parameters. One of the fundamental questions in this 
context is how precisely the world of mesons and nucleons is related to the properties 
of the theory at high energies, where quarks and gluons are the important degrees 
of freedom. The lattice formulation of QCD combined with numerical simulations 
and standard perturbation theory are the tools that allow one to address this issue 
at a quantitative level. 



1 Introduction 

Quantum chromodynamics (QCD) is a gauge field theory that looks rather similar 
to quantum electrodynamics. Apart from the fact that the gauge group is SU(3) 
instead of U(l), the Lagrange density 

^QCD = ^ (1.1) 

q=u,d^s,... 

has the same general form, the first term being the square of the gauge field tensor 
and the second a sum over the contributions of the up, down, strange and the 
heavy quarks. No attempt will here be made to explain the detailed structure of 
eq. (1.1), but an important point to note is that there are no free parameters other 
than the gauge coupling g and the masses rriu, rrid , ... of the quarks. 

QCD is thus an extremely predictive theory. It is also difficult to work out 
so that only too often ad hoc approximations need to be made before theory 
and experiment can be compared. Precision tests of QCD are therefore still rare, 
and complex strong interaction phenomena (such as those observed in heavy ion 
collisions) will probably never be explained from first principles. 

1.1 Quark confinement fc asymptotic fi:eedom 

The Feynman rules derived from the QCD lagrangian suggest that the quarks 
are weakly interacting with each other by exchanging massless vector bosons (see 
fig. 1). Similar to an exchange of photons, the forces that result from this type 
interaction fall off like 1/r^ at large distances r, and the energy required to break 
up a quark- antiquark bound state is hence finite. Quarks have, however, never been 
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Figure 1: In QCD the quarks interact by exchanging massless vector bosons (wiggly line) that 
are referred to as gluons. They carry SU(3) quantum numbers and couple to all flavours of quarks 
with equal strength proportional to the gauge coupling g. 



observed in isolation nor is there any experimental evidence for massless vector 
bosons other than the photon. 

An important hint to the solution of the puzzle comes from perturbation the- 
ory itself. Once higher-order corrections are included, it turns out that the strength 
of the interactions mediated by the vector bosons depends on the magnitude q of 
the energy- momentum that is transferred between the quarks. Explicitly, if we 
introduce an effective coupling 




where the shaded circle stands for the sum of all vertex diagrams with all possible 
exchanges of virtual quarks and vector bosons, it can be shown that [1],[2] 



^siq) 



gjq? 

47T 



c 

ln(g/A) '*'■■■ 



(1.3) 



for large momenta q and some calculable constant c. The logarithmic decay of 
the coupling (which is referred to as asymptotic freedom) is actually observed in 
high-energy scattering experiments (see fig. 2), and from such measurements the 
value of the mass scale A in eq. (1.3) was determined to be 213^3^ MeV [4] *. 

The fact that the gauge coupling is weak at high energies implies that pertur- 
bation theory will, in general, be reliable in this regime. On the other hand, as we 
move towards the other end of the energy scale, the expansion breaks down at some 
point and the physical picture associated with the Feynman diagrams consequently 
becomes invalid. An immediate conflict between theory and the non-observation 
of isolated quarks is thus avoided. 



*The quoted value of A refers to a particular definition of the effective coupling that is known 
as the “MS scheme of dimensional regularization”. 
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Figure 2: The experimentally measured values of the effective gauge coupling as{q) confirm the 
theoretically expected behaviour [eq. (1.3)] at high energies (compilation of the Particle Data 
Group [3]). 



1.2 Lattice formulation 

It remains to be shown, however, that quark confinement is indeed a property of 
QCD. Moreover, once this is achieved, computational tools need to be developed 
to determine the basic properties of the quark bound states such as the pions, the 
kaons and the nucleons. 

The lattice formulation of QCD was introduced many years ago as a frame- 
work in which these issues can be addressed [5]. Introductory texts on the subject 
are refs. [6]-[9], for example, and to find out about the most recent developments 
in the field, the proceedings of the yearly lattice conferences usually provide a good 
starting point [10]. 

Very briefly lattice QCD is obtained by replacing the four-dimensional space- 
time continuum through a hypercubic lattice and by restricting the quark and the 
gauge fields to the lattice points. The expression for the Lagrange density (1.1) 
then needs to be discretized in a sensible way, and the precise relation between 
the correlation functions calculated on the lattice and the physical quantities of 
interest must be understood. A few key elements of this construction are 

- The gauge symmetry can be fully preserved, and no additional unphysical 
degrees of freedom are thus introduced. 

- In general the details of the discretization become irrelevant in the continuum 
limit, i.e. any reasonable lattice formulation will give the same continuum 
theory up to finite renormalizations of the gauge coupling and the quark 
masses. 

- Lattice QCD admits an expansion in Feynman diagrams that coincides with 
the usual expansion up to terms proportional to the lattice spacing. The 
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Figure 3: Recent calculations of the hadron masses (data points) in lattice QCD agree quite 
well with the observed spectrum (horizontal lines) [11]. 



consistency of the lattice theory with the standard perturbative approach to 
QCD is thus guaranteed. 

A last point to be mentioned here is that the lattice provides a regularization of 
the ultra-violet divergences that are usually encountered in quantum field theory. 
The lattice theory is, therefore, mathematically well defined from the beginning. 



1.3 Numerical simulations 

The application of numerical simulation methods to solve the theory has been an 
interesting perspective since the early days of lattice QCD. Today quantitative 
results are practically all based on such numerical studies. 

In the course of these calculations the fields have to be stored in the memory 
of the computer, and only lattices of limited size can thus be simulated. The 
progress in computer technology allows the lattice extents to be doubled in all 
directions roughly every 8 years. At present lattices aa large as 128 x 64^ can be 
accommodated on (say) a commodity PC cluster with 128 nodes. The simulation 
then proceeds by generating a representative ensemble of fields through a stochastic 
process. Eventually the physical quantities are extracted from ensemble averages 
of products of gauge-invariant local fields. 

In general numerical simulations have the reputation of being an approximate 
method that mainly serves to obtain qualitative information on the behaviour of 
complex systems. This is, however, not so in lattice QCD, where the simulations 
produce results that are exact (on the given lattice) up to statistical errors. The 
systematic uncertainties related to the non-zero lattice spacing and the finite lattice 
volume then still need to be investigated, but these effects are theoretically well 
understood and can usually be brought under control. 
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Figure 4: Polyakov loops are Wilson lines that wind around the space-time manifold in the time 
direction. In eq. (2.1) we consider the correlation function of two Polyakov loops with opposite 
orientation that are separated by a distance r in space. 



Lattice QCD is being used to compute a wide range of physical quantities, in- 
cluding the hadron mass spectrum (fig. 3), decay constants, form factors and weak 
transition matrix elements. The technique is not universally applicable, however, 
and quantities like the inelastic proton-proton scattering cross-section or the nu- 
cleon structure functions at small angles remain inaccessible. 

2 Quark confinement 

As can be seen from fig. 3, there is currently no perfect match between experi- 
ment and the lattice calculations. The experts are not alarmed by this, since the 
computations are still incomplete in certain respects. However, rather than going 
into any details here, we now turn to the more fundamental question of quark 
confinement. 

2.1 Static quark potential 

Quarks carry SU(3) quantum numbers and are thus sources of the gauge field. As 
a consequence the latter exerts a force on the quarks, since any change in their 
positions implies a change in field energy. According to the current understanding, 
the confinement of quarks is associated with an unbounded increase of the field 
energy at large quark separations. 

The ground state energy of the gauge field hamiltonian in the presence of a 
static quark-antiquark pair separated by a distance r is referred to as the static 
quark potential V{r). There are various ways to compute V{r) on the lattice. 
Perhaps the most elegant approach is to consider Wilson lines that wind around 
a cylindrical space-time manifold (see fig. 4). The correlation function of any two 
such loops can be shown to satisfy 

(P(r)P(O)*) ,= e-'^'"(''){l+0(e-^')}, € > 0, (2.1) 

T^oo 

where T denotes the circumference of the cylinder, and the potential can thus be 
determined by calculating the correlation function. 
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Figure 5: Simulation results for the static potential V(r) [12] and the force V'{r) [13]. The 
statistical and systematic errors in these calculations are smaller than or at most equal to the 
size of the data symbols. 



Some recent results for the potential V{r) and the force V'{r) are plotted in 
fig. 5. They show that V{r) is monotonically rising and eventually grows linearly 
with a small correction proportional to 1/r. The force is fairly strong, at least 
1 GeV/fm in the whole range of distances, and if this continues to be so at larger 
values of r it will evidently not be possible to separate the quark- antiquark pair. 
At short distances, on the other hand, the data points rapidly approach the curves 
that are obtained in perturbation theory (as it has to be since the effective gauge 
coupling is small in this regime). 

2.2 String model 

The data shown in the second plot in fig. 5 suggest that the force V'{r) approaches 
a constant cr ~ 1.06 GeV/fm at large quark-antiquark separations, a is referred to 
as the string tension, because a constant force is what would be obtained if the 
quarks were held together by an elastic string. 

The idea that quark confinement is linked to the formation of string-like flux 
tubes has in fact been around for very many years. We should then not only see 
the linear rise in the static potential but also a characteristic 1/r correction that 
derives from the zero-point energy of the transversal string vibrations. Explicitly 
the prediction is that [14], [15] 

V{r) = ar + n~~+0{r~^), (2.2) 

r — >oo IZr 

where (from the string theory point of view) a and the mass Jjl are free parameters. 

The string picture may appear to be somewhat naive, but the simulation 
data for the second derivative of the static potential shown in fig. 6 agree very well 
with eq. (2.2). What is plotted there is the dimensionless combination — |r^y"(r), 
which, according to string theory, should converge to ^ at large r. If we allow for a 
small higher-order correction proportional to 1/r, the data are perfectly compatible 
with this. 
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Figure 6: Simulation results for the second derivative of the static potential in d = 4 space-time 
dimensions (upper curves and data points) and in three dimensions [13]. 



An even better matching between string theory and the gauge theory is ob- 
served in three space-time dimensions. In this case a string with fixed ends can 
only vibrate in one transversal direction and the associated zero-point energy is 
consequently reduced by a factor 2. This is exactly what one finds on the lattice 
when the three-dimensional gauge theory is simulated (points and curves in the 
lower half of fig. 6). 

2.3 Summary 

The lattice studies conducted so far leave little doubt that the quarks are confined 
because the field energy of the surrounding SU(3) gauge field rises linearly at 
large distances. As far as the static potential is concerned, we have also seen that 
the behaviour of the gauge theory in this regime is accurately described by an 
effective string model. This correspondence is, incidentally, expected to extend to 
the confinement phase of other gauge field theories [16]-[18] and to a range of 
other observables such as the spectrum of excited states in the presence of static 
quarks [19], [20]. 

3 QCD from low to high energies 

As explained above, lattice QCD allows us to study the phenomenon of quark 
confinement and to compute the basic properties of the light mesons and of the 
nucleons. Among the most obvious quantities to consider in this low-energy regime 
are the mass tUtt of the charged pions and the pion decay constant /^r. Experimen- 
tally the latter is determined by measuring the decay rate while in 

QCD it can be expressed as a matrix element 

(0|^^|7T+p) = (3-1) 

of the appropriate axial quark current between the vacuum state and the one- 
pion state with four-momentum p. 
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The experiments at the big particle colliders, on the other hand, probe the 
interactions of the quarks and gluons at high energies (from 10 to 100 and more 
GeV), where asymptotic freedom has set in and perturbation theory may be ap- 
plied to calculate the reaction rates. The hadronic decay width of the Z-boson, 
for example, is given by 

r z-^qq = constant x |l + — + 0(ag)| , (3.2) 

where Og denotes the gauge coupling (1.3) at momentum transfer q equal to the 
mass of the Z-boson and the constant includes the (calculable) contributions of 
the weak and electromagnetic interactions. 

3.1 Connecting diflferent energy regimes 

Prom a purely phenomenological point of view, it seems unlikely that there is any 
connection between (say) the pion mass and the Z-boson decay rate. However, 
since all strong interaction physics is described by the same underlying field theory, 
there have to be at least some such relations. 

To make this a bit more explicit, first note that in QCD any physical quantity 
is a function of the parameters that appear in the lagrangian (1.1). The gauge 
coupling g is one of them, but for the following discussion it is actually more 
natural to consider the scale A in eq. (1.3) to be a basic parameter of the theory. 
We then infer that there are functions G and F such that 

= G(m„/A,md/A, ...), (3.3) 

fj,/A = F{mu/A,md/A,. . (3.4) 
and for the masses of the kaons and the heavier pseudo-scalar mesons similar 
equations can be written down. 

Now if we take the experimental values of the meson masses in units of the 
pion decay constant as input, the equations for the meson masses can be solved 
for the quark masses (in units of A) and the ratio /^r /A then becomes a calculable 
quantity. Note that this ratio links the physical properties of the pions to the A 
parameter, which is a characteristic scale in the high-energy regime of QCD. Since 
/tt/A is also known experimentally, an interesting test is thus obtained that will 
only be passed (barring miracles) if QCD is the correct theory at all energies. 

3.2 The scale problem 

In lattice QCD the calculation of /tt/A appears to be a difficult task, because 
/tt and A are physical quantities that refer to the properties of the theory at 
energies orders of magnitude apart. A straightforward approach then requires the 
simulation of lattices with a very large number of points. 

It is not difficult to obtain an estimate of what would be needed. For the 
computation of and the meson masses, the spatial lattice size L should be at 
least 2fm as otherwise there will be sizeable finite- volume effects. The effective 
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L — Lq, a — Oq 




Figure 7: Finite-size scaling amounts to the construction of a sequence of matching lattices 
with lattice spacings a — 2~^aQ and decreasing physical sizes L. 



gauge coupling 0!s(ç), on the other hand, can only be reliably determined at mo- 
menta q up to 1/a or so (where a denotes the lattice spacing). Moreover, to able 
to extract A from the asymptotic behaviour (1.3) of the coupling, q must be taken 
to values deep in the high-energy regime of QCD. The combination of all these 
requirements then implies that the number L/a of lattice sites in each direction 
has to be on the order of 100 or maybe even larger than this. 



3.3 Finite-size scaling 

Such lattices may become accessible at some point, but it is much more efficient to 
adopt a recursive scheme, where the large scale difference is bridged by a sequence 
of matching lattices [21] (see ref. [22] for an introduction to the subject). The key 
idea is to introduce an effective gauge coupling a(q) that measures the interaction 
strength at a momentum q proportional to 1/L. In this way the finite lattice size 
becomes a device to probe the interactions rather than being a source of systematic 
errors. 

There are many ways to define an effective coupling of this kind. We may 
choose some particular boundary conditions, for example, and take the response 
of the system to a change in the boundary values of the fields as a measure for 
the interaction strength [23]. The important point to note is that the final results 
(such as /tt/A) do not depend on any of these details. 

The next step is to construct a sequence of lattices with lattice spacings a 
and sizes L that decrease by factors of 2 as shown in fig. 7. Starting with some 
physically large lattice, it is possible to scale the lattices to very small sizes in 
this way without running into technical difficulties. The coupling a{q) (the one 
discussed above that is defined at some q proportional to 1/L) can be calculated 
on each of these lattices and its evolution can thus be followed over a wide range 
of momenta. 

3.4 Simulation results 

So far such computations have been performed in the pure gauge theory and in 
QCD with two flavours of quarks. As can be seen from fig. 8, the momentum 
dependence of the coupling a{q) that has been studied is accurately matched by 
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Figure 8: Momentum-dependence of the effective gauge coupling (in a particular finite- volume 
scheme) in the pure SU(3) gauge theory [23]-[25] and in QCD with two flavours of massless 
quarks [26]. 



perturbation theory already at fairly low momenta. Contact with the asymptotic 
behaviour (1.3) of the coupling can thus easily be made, and since also the low- 
energy regime is safely reached, these calculations provide the desired link between 
that regime and the A parameter. 

For the full theory, with all flavours of quarks properly included, a similar 
study has not yet been made. The reason partly is that these calculations are 
exceedingly expensive in terms of computer time, because the available simulation 
techniques become very inefficient once the quark polarization effects are taken 
into account (fig. 9). We shall return to this issue in a moment and only note at 
this point that practically all applications of lattice QCD meet the same difficulty. 

Since they are often not very large, it is, however, a sensible and still widely 
used approximation to neglect the quark polarization effects. The calculations of 
the pion decay constant and of the pseudo-scalar meson masses that are required 
to obtain /tt/A can then be carried out with the presently available computer 
resources (see refs. [11], [27] for example). As already mentioned, the lattices should 
be at least 2 fm wide in these computations, but this is not a problem here because 
the lattice spacing does not need to be extremely small at the same time. Values 
from 0.05 to 0.1 fm have actually been found to be adequate in this context. 

Once all this is done, and a careful analysis of the systematic errors is made, 
the combination of the results yields the value [24], [27] 

/^/A = 0.56 ±0.05. (3.5) 

This figure agrees with the experimental number /tt/A = 0.62 ± 0.10 within the 
quoted errors, which is somewhat unexpected since the quark polarization effects 
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Figure 9: In the language of Feynman diagrams, quark polarization effects are represented by 
subdiagrams with closed quark lines. For technical reasons, many results in lattice QCD still do 
not include these contributions. 



have been neglected. Maybe the ratio is not strongly affected by them, but there 
is currently no very good theoretical argument for this and the coincidence can, 
therefore, not be taken as a solid confirmation of QCD at this point. The compu- 
tation nevertheless provides important insights into how precisely the low- and the 
high-energy regimes of the theory are connected to each other, and it also shows 
the potential of the lattice approach to lead, in due time, to some very stringent 
tests of QCD. 



3.5 Real-world lattice QCD simulations 

At present the great challenge in lattice QCD is to devise efficient simulation meth- 
ods for the full theory (including quark polarization effects) that will work well 
on large lattices and at small quark masses. The fact that the currently available 
techniques will not lead very far became particularly clear at the lattice confer- 
ence last year in Berlin [10], where an effort was made to assess the cost of such 
calculations. 

The figure that is usually quoted in this context is the number of arithmetic 
operations that are required to generate the next statistically independent field 
configuration. An approximate and mostly empirical formula that was presented 
at the conference is [28] 



^ operations 
field configuration 



'140 MeV 


6 


L 


5 


'0.1 fm' 






3fm 




a 



Tflops year. 



(3.6) 



which shows the dependence on the lattice parameters and on the calculated value 
m^r of the pion mass (which depends on the specified values of the quark masses) . 
The result is given in Tflops years, the number of operations that a computer with 
a sustained speed of 1 Tflops (10^^ floating-point operations per second) performs 
in 1 year of running time. 

While such machines will become available to the lattice community in the 
near future, the poor scaling behaviour of the algorithms (the high powers in the 
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formula) tells us that it will not be possible to vary the lattice parameters over 
a wide range. If the lattice spacing is decreased by a factor of 2, for example, 
the simulation time goes up by a factor of 128 or so. Evidently this unfavourable 
situation calls for new algorithmic ideas, and a significant investment in purely 
technical R&D work will be required in the coming years to solve the problem. 

4 Conclusions 

Lattice QCD was introduced nearly 30 years ago and it has since then turned into a 
powerful quantitative approach to the physics of the strongly interacting particles. 
Only a few topics have been touched in this talk, but this should not hide the fact 
that the technique is being used to calculate many quantities of phenomenological 
interest (hadron masses, decay constants, transition matrix elements, and so on) 
and also to study some of the more fundamental issues such as the breaking of 
chiral symmetry and the phase diagram at non-zero baryon density. 

All this work has had its share in making QCD the unique candidate of 
the theory of the strong interactions. In particular, little doubt is left that the 
non-linearities of the dynamics of the SU(3) gauge field are responsible for the 
confinement of the quarks. Moreover we have seen that quark confinement and 
asymptotic freedom are just two complementary aspects of the theory. The fact 
that the relation between the parameters in the lagrangian and the basic properties 
of the mesons and nucleons can be worked out on the lattice is the key to showing 
this, along with the ability to compute the (non-perturbative) evolution of the 
effective gauge coupling from very low to high energies. 

Currently the quark polarization effects are often neglected in the numerical 
simulations because their inclusion slows down the computations by a large factor. 
It is mainly for this reason that comparisons of the lattice results with experimental 
numbers cannot at present be regarded as hard tests of QCD. Once this technical 
limitation is overcome, it is clear, however, that precision tests will become a 
reality, and the hope of the experts is that this will happen before the next 30 
years have elapsed! 

I am indebted to Rainer Sommer for sending a set of data tables and to 
Peter Weisz and Hartmut Wittig for helpful discussions and correspondence. Many 
thanks also go to Daniel lagolnitzer and Jean Zinn- Justin for having organized this 
unique conference. 
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Abstract. QCD sheds considerable light on several of the most basic features of 
the natural world including the origin of mass, the feebleness of gravity, the extent 
to which the properties of matter can be determined conceptually, the possible 
utility of the anthropic principle, and the meta-theoretic notions of effectiveness 
and computability. I discuss these applications here. 



Contemporary academic or lirmatural philosophy is characterized by extreme 
insistence on purity of abstract thought. A joke will illustrate the point. 

A man walks into a bar, takes a seat on the next-to-last stool, and spends the 
evening chatting up the empty stool next to him, being charming and flirtatious, 
as if there were a beautiful woman in that empty seat. The next night, same story. 
And the next night, same story again. Finally the bartender can’t take it any 
more. She asks, ‘Why do you keep talking to that empty stool as if there were a 
beautiful woman in it?” 

The man answers, “I’m a philosopher. Hume taught us that it’s logically 
possible that a beautiful woman will suddenly materialize on that stool, and no 
one has ever refuted him. If one does appear, then obviously I’ll seem very clever 
indeed, and I’ll have the inside track with her.” 

“That’s ridiculous,” says the bartender, who happens to be a physicist. 
“Plenty of very attractive women come to this bar all the time. You’re reason- 
ably presentable, and extremely articulate; if you applied your charm on one of 
them, you might succeed.” 

“I thought about trying that,” he replies, “but I couldn’t prove it would 
work.” 

In this vein, there are a lot of very attractive questions that working physi- 
cists normally abandon to the philosophers, or to quasi-scientists who speculate in 
reckless disregard of the facts, just because the questions are broad and qualitative. 
We shouldn’t. I’d like in this talk to make a stab at a few such questions. These 
are questions for which a relatively recently firmly established physical theory, 
quantum chromodynamics or QCD, affords us considerable new insight. I will not 
waste words introducing the technical content of QCD, nor defending its validity, 
as this background information is readily available in many places [1]. 
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1 What is the Origin of Mass? 

1.1 Framing the Question 

That a question makes grammatical sense does not guarantee that it is answerable, 
or even coherent. In that spirit, let us begin with a critical examination of the 
question posed in this Section: What is the origin of mass? 

In classical mechanics mass appears as a primary concept. It was a very great 
step for the founders of classical mechanics to isolate the scientific concept of mass. 
In Newton’s laws of motion, mass appears as an irreducible, intrinsic property of 
matter, which relates its manifest response (acceleration) to an abstract cause 
(force). An object without mass would not know how to move. It would not know, 
from one moment to the next, where in space it was supposed to be. It would be, 
in a very strong sense, unphysical. Also in Newton’s law of gravity, the mass of an 
object governs the strength of the force it exerts. One cannot build up an object 
that gravitates out of material that does not. Thus it is difficult to imagine, in the 
Newtonian framework, what could possibly constitute an “origin of mass” . In that 
framework, mass just is what it is. 

Later developments in physics make the concept of mass seem less irreducible. 
The undermining process started in earnest with the theories of relativity. The 
famous equation E = mc^ of special relativity theory, written that way, betrays 
the prejudice that we should express energy in terms of mass. But we can also 
read it as m = Ej(? ^ which suggests the possibility of explaining mass in terms 
of energy. In general relativity the response of matter to gravity is independent of 
mass (equivalence principle), while space-time curvature is generated directly by 
energy- momentum, according to R^jy — ^g^uR = with k ~ SttGn/c^’ Mass 

appears as a contributing factor to energy- momentum, but it has no uniquely 
privileged status. 

At an abstract level, ma.ss appears as a label for irreducible representations of 
the Poincare group. Since representations with m ^ 0 appear in tensor products 
of m = 0 representations it is possible, at least kinematically, to build massive 
particles as composites of massless particles, or of massless particles and fields. 

1.1.1 Lorentz’s Dre 2 im 

At a much more concrete level, the question of the origin of mass virtually forced 
itself upon physicists’ attention in connection with the development of electron 
theory. Electrons generate electromagnetic fields; these fields have energy and 
therefore inertia. Indeed, a classical point electron is surrounded by an electric 
field varying as e/r^. The energy in this field is infinite, due to a divergent con- 
tribution around r ^ 0. It was a dream of Lorentz (pursued in evolved forms by 
many others including Poincare, Dirac, Wheeler, and Feynman), to account for 
the electron’s mass entirely in terms of its electromagnetic fields, by using a more 
refined picture of electrons. Lorentz hoped that in a correct model of electrons 
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they would emerge as extended objects, and that the energy in the Coulomb field 
would come out finite, and account for all (or most) of the inertia of electrons. 

Later progress in the quantum theory of electrons rendered this program moot 
by showing that the charge of an electron, and therefore of course its associated 
electric field, is intrinsically smeared out by quantum fluctuations in its position. 
Indeed, due to the uncertainty principle the picture of electrons as ideal point 
particles certainly breaks down for distances r < h/mc, the Compton radius. At 
momenta p > h/r, the velocity p/m formally becomes of order c, and one cannot 
regard the electron as a static point source. If we cut off the simple electrostatic 
calculation at the Compton radius, we find an electromagnetic contribution to the 
electron mass of order Sm ~ am, where a == e^/Airhc ^ 1/137 is the fine structure 
constant. In this sense the easily identifiable and intuitive electromagnetic contri- 
bution to the mass, which Lorentz hoped to build upon, is small. To go further, we 
cannot avoid considering relativity and quantum mechanics together. That means 
quantum field theory. 

1.1.2 Its Debacle 

In quantum electrodynamics itself, the whole issue of the electromagnetic contri- 
bution to the electron mass becomes quite dodgy, due to renormalization. 

Quantum electrodynamics does not exist nonperturbatively. One can regu- 
late and renormalize order-by-order in perturbation theory, but there are strong 
arguments that the series does not converge, or even represent the asymptotic ex- 
pansion of a satisfactory theory. In a renormalization group analysis, this is because 
the effective coupling blows up logarithmically at short distances, and one cannot 
remove the cutoff. In a lattice regularization, one could not achieve a Lorentz- 
invariant limit. ^ So one cannot strictly separate the issue of electromagnetic mass 
from the unknown physics which ultimately regularizes the short- distance singu- 
larities of QED. 

If we regard QED as an effective theory, essentially by leaving in an energy 
cutoff A, corresponding to a distance cutoff he / we get a contribution to the mass 
at short distances going as Sm oc amlog(A/7n). Quantum mechanics has changed 
the power-law divergence into a logarithm. As a result, Sm is a fractionally small 
contribution to the total mass, at least for sub-Planckian A (i.e., A < 10^^ GeV). 
We know that QED ceases to be a complete description of physics, or even a 
well-isolated theory, far below such energies. 

In any case, since the mass renormalization is multiplicative, an electron field 
whose bare quanta have bare mass zero produces physics quanta with physical 

^Actually this blow-up, the famous Landau pole, arises from extrapolating the perturbative 
result beyond its range of validity. What one can deduce simply and rigorously is that the effective 
coupling does not become small at short distances; QED is not asymptotically free. If there is a 
fixed point at finite coupling, it may be possible to construct a relativistically invariant limiting 
theory. But even if such a theory were to exist, its physical relevance would be quite dubious, 
since we know that there’s much more to physics than electrodynamics at distances so short that 
the logarithms matter. 
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mass zero. Indeed, the massless version of QED has enhanced symmetry - chiral 
symmetry - which is not destroyed by (perturbative) renormalization. 

In short, very little seems to survive from Lorentz’s original dream. I’ve de- 
scribed this fiasco in some detail, since it provides an instructive background to 
contrast with our upcoming considerations. 



1.1.3 Upping the Ante 

Quantum field theory changes how we view the question of the origin of mass 
dramatically. As we have seen, it quashes hope for a simple classical mechanistic 
explanation. 

At a more profound level, however, quantum field theory makes the question 
of the origin of mass seem both better posed and more crucial. Very few such 
theories are known to exist, non-perturbatively, in four space-time dimensions. 
Even if we relax our standards to include renormalizable quantum field theories, i.e. 
those which are reasonably self-contained as effective field theories, we find a very 
restricted set. Such theories, unless they contain many fields and few symmetries, 
allow few continuous parameters. Among those few, parameters specifying bare 
masses of the quanta feature prominently. So massless theories are even more 
tightly constrained, and bring us closer to the ideal of a parameter- free, purely 
conceptual description of Nature. 

Moreover, theories often acquire enhanced symmetry when mass parameters 
vanish. In such cases masslessness is “natural” , in the sense that it can be stated 
as an abstract structural property of the theory, to wit its enhanced symmetry. To 
the extent we feel a more symmetric theory is more beautiful, the massless theories 
are singled out as more beautiful. These features of naturalness and beauty, which 
motivate massless theories, survive even if one is not committed to the technical 
requirement of renormalizability. 

1.2 Most of the Answer: QCD Lite 

Enough of generalities! I want now to describe some very specific and I think quite 
beautiful insights into the origin of mass in the real world. We will construct - 
following Nature - mass without mass, using only c and h. 



1.2.1 Introducing QCD Lite 

My central points are most easily made with reference to a slight idealization of 
QCD which I call, for reasons that will be obvious, QCD Lite. It is a nonabelian 
gauge theory based on the gauge group 5U(3) coupled to two triplets and two 
anti-triplets of left-handed fermions, all with zero mass. Of course I have in mind 
that the gauge group represents color, and that one set of triplet and antitriplet 
will be identified with the quark fields ul,ur and the other with di^dR. 
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Upon demanding renormalizability,^ this theory appears to contain precisely 
one parameter, the coupling g. It is, in units with h = c = 1 , a pure number. 
I’m ignoring the 0 parameter, which has no physical content here, since it can 
be absorbed into the definition of the quark fields. Mass terms for the gluons are 
forbidden by gauge invariance. Mass terms for the quarks are forbidden by chiral 
SU{2)l X SU{2)r flavor symmetry. 

1.2.2 Running Coupling; Dimensional Transmutation 

The coupling constant g that appears in the Lagrangian of QCD Lite, like the cor- 
responding constant e in QED, is a dimensionless number (in units with h — c— 1). 
Likewise for the fine-structure constant ^5 = g'^fAn. But the real situation, when 
we take into account the effect of quantum mechanics, is quite different. Empty 
space is a medium, full of virtual particles, and responds dynamically to charges 
placed within it. It can be polarized, and the polarization clouds surrounding test 
charges can shield (screen) or enhance (antiscreen) their strength. In other words, 
quantum- mechanically the measured strength of the coupling depends on the dis- 
tance scale, or equivalently the (inverse) energy, scale at which it is measured: 
as as{Q)- This is a central feature of QCD Lite, and of course of QCD it- 
self. These theories predict that the effective coupling gets small at large Q, or 
equivalently at short distance. 

This behavior displays itself in a remarkably direct and tangible form in the 
final states of electron-positron annihilation. Hadrons emerging from high-energy 
electron-positron annihilation organize themselves into collimated jets. Usually 
there are two jets, but occasionally three. The theoretical interpretation is profound 
but, given asymptotic freedom, straightforward. The primary products emerging 
from the annihilation are a quark and an antiquark. They emit soft - that is, low 
energy- momentum - radiation copiously, but only rarely hard radiation. That’s 
a restatement, in momentum space, of asymptotic freedom. The soft radiation 
materializes as many particles, but these particles inherit their direction of flow 
from the quark or antiquark, and therefore constitute a jet. In the relatively rare 
case that there is hard radiation, that is to say emission of an energetic gluon, the 
gluon induces its own independent jet. All this can be made completely quanti- 
tative. There are precise predictions for the ratio of three- to two-jet events, the 
rare occurrence of four- or more jet events, how these ratios change with energy, 
angular dependence, and so forth. The observations agree with these predictions. 
Thus they provide overwhelming, direct evidence for the most basic elements of 
the theory, that is the quark- gluon and gluon-gluon couplings. 

Because the coupling runs we can, within any given version of QCD Lite, 
measure any given numerical value a = as(Q), simply by choosing an appropriate 
Q. It appeared, classically, that we had an infinite number of different versions 
of QCD Lite, with different values of the coupling parameter. In reality the only 

^Or, in physical terms, the existence of a relativistically invariant limiting theory. Or alterna- 
tively, radical decoupling from an unspecified, gratuitous high-energy cutoff. 
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difference among all these theories, after they are quantized, is the unit they choose 
for measuring mass. All dimensionless physical parameters, and in particular all 
mass ratios, are uniquely determined. We can trade the dimensionless parameter 
g for the unit of mass. This is the phenomenon of dimensional transmutation. 

Of course the value of the overall energy scale makes a big difference when 
we come to couple QCD Lite, or of course QCD, to the rest of physics. Gravity, 
for example, cares very much about the absolute value of masses. But within QCD 
Lite itself, if we compute any dimensionless quantity whatsoever, we will obtain a 
unique answer, independent of any choice of coupling parameter. Thus, properly 
understood, the value of the QCD coupling constant does not so much govern 
QCD itself - within its own domain, QCD is essentially unique - but rather how 
QCD fits in with the rest of physics. 

1.2.3 Physical Mass Spectrum - QCD Lite and Reality 

Now let us consider more concretely how these dynamical phenomena lead us to a 
non-trivial hadron spectrum. Looking at the classical equations of QCD, one would 
expect an attractive force between quarks that varies with the distance as ^r^/dTrr^, 
where g is the coupling constant. This result is modified, however, by the effects 
of quantum fluctuations. As we have just discussed, the omnipresent evanescence 
of virtual particles renders empty space into a dynamical medium, whose response 
alters the force law. The antiscreening effect of virtual color gluons (asymptotic 
freedom), enhances the strength of the attraction, by a factor which grows with 
the distance. This effect can be captured by defining an effective coupling, p(r), 
that grows with distance. 

The attractive interaction among quarks wants to bind them together; but 
the potential energy to be gained by bringing quarks together must be weighed 
against its cost in kinetic energy. In a more familiar application, just this sort 
of competition between Coulomb attraction and localization energy is responsible 
for the stability and finite size of atoms. Here, quantum-mechanical uncertainty 
implies that quark wave- functions localized in space must contain a substantial 
admixture of high momentum. For a relativistic particle, this translates directly 
into energy. If the attraction followed Coulomb’s law, with a small coupling, the 
energetic price for staying localized would always outweigh the profit from attrac- 
tion, and the quarks would not form a bound state. Indeed, the kinetic energy hc/r 
beats the potential energy g‘^ jAirr. But the running coupling of QCD grows with 
distance, and that tips the balance. The quarks finally get reined in, at distances 
where Oùs{r) becomes large. 

We need not rely on heuristic pictures, or wishful thinking, to speculate about 
the mass spectrum of QCD Lite. It has been calculated by direct numerical integra- 
tion of the fundamental equations, using the techniques of lattice gauge theory^. 

^ There are significant technical issues around realizing chiral symmetry in numerical work 
involving discretization on a lattice. Recent theoretical work appears to have resolved the con- 
ceptual issues, but the numerical work does not yet fully reflect this progress. To avoid a fussy 
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The results bear a remarkable qualitative and semi-quantitative resemblance to the 
observed spectrum of non-strange hadrons, generally at the 10% level, comfortably 
within known sources of error due to finite size, statistics, etc. - and (anticipating) 
small quark masses. Of course, in line with our preceding discussion, the over- 
all scale of hadron masses is not determined by the theory. But all mass ratios 
are predicted, with no free parameters, as of course are the resonance quantum 
numbers. 

QCD Lite is not the real world, of course. So although in QCD Lite we get 
mass without mass in the strict sense, to assess how much real-world mass arises 
this way, we need to assess how good an approximation QCD Lite is to reality, 
quantitatively. We can do this by adjusting the non-zero values of rriu and md to 
make the spectrum best fit reality, and then seeing how much they contributed to 
the fit.'^ Unlike charges in QED, masses in QCD are soft perturbations, and we can 
calculate a meaningful finite difference between the spectra of these two theories. 
There is also a well- developed alternative approach to estimating the contribution 
of quark masses, by exploiting the phenomenology of chiral symmetry breaking. 
Either way, one finds that the quark masses contribute at most a few per cent to 
the masses of protons and neutrons. 

Protons and neutrons, in turn, contribute more than 99% of the mass of 
ordinary matter. So QCD Lite provides, for our purpose, an excellent description 
of reality. The origin of the bulk of the mass of ordinary matter is well accounted 
for, in a theory based on pure concepts and using no mass parameters - indeed, 
no mass unit - at all! 



1.2.4 Comparing With the Old Dream 

While our final result fulfills the fantasy of Lorentz’s dream, the details and the 
mechanism whereby it is achieved are quite different from what he imagined. 

Obviously we are speaking of hadrons, not electrons, and of QCD, not clas- 
sical electrodynamics. The deepest difference, however, concerns the source and 
location of the energy whereby m = Efc? is realized. In Lorentz’s dream, the 
energy was self-energy, close to the location of the point particle. In QCD Lite 
the self-mass vanishes. Paradoxically, there is a sense in which the self-energy of 
a quark^ is infinite (confinement), but this is due to the spatial extent of its color 
field, which has a tail extending to infinity, not to any short-distance singularity. 
To make physical hadrons, quarks and gluons must be brought together, in such 
a way that the total color vanishes. Then there is no infinite tail of color fiux. The 
different tails of the individual quarks and gluons have annulled one another. But 

presentation I’ve oversimplified by passing over these issues, which do not affect my main point. 

Again, there are significant technical issues here, especially regarding the role of the strange 
quark. Fortunately, the uncertainties are numerically small. 

^Infinite self-energy does not conflict with zero mass. E = mc^ describes the energy of a 
particle of mass m when it is at rest; but of course, as we know from photons, there can also be 
energy in massless particles, which cannot be brought to rest. 
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at finite distances the cancellation is incomplete, because Heisenberg’s uncertainty 
principle imposes an energetic cost for keeping color charges precisely localized 
together. The bulk of the mass of the hadrons comes from the residues of these 
long tails, not from singularities near point-like color charges. 

1.2.5 Comparing With the Old Debacle 

It is instructive to review, point by point, how the obstructions that quantum 
field theory appeared to pose for the “Origin of Mass” or “Mass without Mass” 
program are resolved in QCD. 

First of all QCD does exist even nonperturbatively, so the issues connected 
with renormalization can be analyzed consistently down to rock bottom. 

Second, the massless theory is “natural” in the technical sense defined above. 
It displays enhanced symmetry. The gluons have no mass parameter because there 
is no such parameter consistent with local gauge symmetry. For quarks, there is 
an additional chiral SU{2)l x SU{2)r symmetry, expanded from the vectorial 
SU{2)l+r, when they massless. 

Third, these symmetries, which guarantee zero bare mass for the bare quanta, 
do not forbid non-zero mass for physical quanta. The gauge symmetry acts triv- 
ially on the physical quanta (confinement). The chiral symmetry is spontaneously 
broken: it is a property of the underlying equations but not of their stable solution 
(i.e. of their physical realization). Spontaneously broken chiral symmetry predicts 
the existence of massless collective modes, the Nambu- Golds tone bosons - pions 
- along with many of their properties; but it does not forbid mass for the other 
hadrons - protons, neutrons, etc. 

Fourth, the realistic theory, with small non-zero quark masses, is a soft per- 
turbation of the massless theory. It does not require that we make any alteration 
in the renormalization procedure. Thus it is commensurable with the massless 
theory. We can compare them unambiguously and quantitatively. 

Fifth and finally, this quantitative comparison shows that the non-zero quark 
masses in Nature contribute only a small fraction to the masses of protons and 
neutrons. The chiral symmetry of the massless theory is approximately valid in 
reality, and is the basis of a very useful phenomenology. The mass ratios of hadrons 
computed in the idealized massless theory are very nearly the same as in reality 
(apart, of course, from the pions). 

1.3 (Many) Remaining Issues 

While the dynamical energy of massless QCD accounts for the bulk of mass in 
ordinary matter, it is far from providing the only source of mass in Nature. 

Mass terms for quarks and charged leptons appear to violate the electroweak 
gauge symmetry SU{2) x U{1). But gauge symmetry cannot be violated in the 
fundamental equations - that would lead to ghosts and/or non-unitarity, and pre- 
vent construction of a sensible quantum theory. So these masses must, in a sense. 
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have their “origin” in spontaneous symmetry breaking. That is accomplished, in 
the Standard Model, by having a non-singlet Higgs field acquire a vacuum ex- 
pectation value. Why this value is so small, compared to the Planck scale, is one 
aspect of what is usually called the hierarchy problem. Why the couplings of this 
field are so disparate - especially, what is particularly crucial to the structure of 
physical reality, why its dimensionless couplings to e, u, d are so tiny (in the range 
10“^ — 10“^) - is an aspect of what is usually called the fiavor problem. 

Then there are separate problems for generating masses of supersymmetric 
particles (soft breaking parameters, fi term), for generating the mass of cosmolog- 
ical ‘dark matter’ (this might be included in the previous item!), for generating 
neutrino masses, and now apparently for generating the mass density of empty 
space (cosmological term). 

Obviously, many big questions about the origin of mass remain. But I think 
we’ve answered a major one, the one that is most important quantitatively for 
ordinary matter, beautifully and convincingly. 

2 Why is Gravity Feeble? 

Gravity dominates the large-scale structure of the Universe, but only so to speak 
by default. Matter arranges itself to cancel electromagnetism, and the strong and 
weak forces are intrinsically short-ranged. At a more fundamental level, gravity 
is extravagantly feeble. Acting between protons, gravitational attraction is about 
10“^® times weaker than electrical repulsion. Where does this outlandish disparity 
comes from? What does it mean? 

Feynman wrote 

There’s a certain irrationality to any work on [quantum] gravitation, 
so it’s hard to explain why you do any of it . . . It is therefore clear 
that the problem we working on is not the correct problem; the correct 
problem is What determines the size of gravitation? 

I want to argue that in modern physics it is natural to view the problem of 
why gravity is extravagantly feeble in a new way - upside-down and through a 
distorting lens compared to its superficial appearance. When viewed this way, it 
comes to seem much less enigmatic. 

First let me quantify the problem. The mass of ordinary matter is dominated 
by protons (and neutrons), and the force of gravity is proportional to (mass)^. 
From Newton’s constant, the proton mass, and fundamental constants we can 
form the pure dimensionless number 

X = GNTTlplhc , 

where Gat is Newton’s constant, rup is the proton mass, h is Planck’s constant, 
and c is the speed of light. Substituting the measured values, we obtain 

X ^ Qx 10"^^ . 
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This is what we mean, quantitatively, when we say that gravity is extravagantly 
feeble. 

We can interpret X directly in physical terms, too. Since the proton’s geo- 
metrical size R is roughly the same as its Compton radius h/ rripC^ the gravitational 
binding energy of a proton is roughly G^'n^p/R ~ XrripC^ . So X is the fractional 
contribution of gravitational binding energy to the proton’s rest mass! 

2.0. 1 Planck’s Hypothesis 

An ultimate goal of physical theory is to explain the world purely conceptually, 
with no parameters at all. Superficially, this idea seems to run afoul of dimensional 
analysis - concepts don’t have units, but physical quantities do! 

There is a sensible version of this goal, however, that is rapidly becoming 
conventional wisdom, despite falling well short of scientific knowledge. Soon after 
he introduced his constant /i, in the course of a phenomenological fit to the black- 
body radiation spectrum, Planck pointed out the possibility of building a system of 
units based on the three fundamental constants /i, c, Gtv- Indeed, from these three 
we can construct a unit of mass {hc/GjsiY^’^ ^ a unit of of length and 

a unit of time {hGjsi / ~ what we now call the Planck mass, length, and time 
respectively. 

Planck’s proposal for a system of units based on fundamental physical con- 
stants was, when it was made, rather thinly rooted in physics. But by now there 
are profound reasons to regard c, h and G as conversion factors rather than numer- 
ical parameters. In the special theory of relativity, there are symmetries relating 
space and time - and c serves as a conversion factor between the units in which 
space- intervals and time- intervals are measured. In quantum theory, the energy of 
a state is proportional to the frequency of its oscillations and h is the conversion 
factor. Thus c and h appear directly as measures in the basic laws of these two 
great theories. Finally, in general relativity theory, space-time curvature is pro- 
portional to the density of energy - and G^ (actually G^/c^) is the conversion 
factor. 

If we want to adopt Planck’s astonishing hypothesis that we must build up 
physics solely from these three conversion factors, then the enigma of A’s smallness 
looks quite different. We see that the question it poses is not “Why is gravity so 
feeble?” but rather “Why is the proton’s mass so small?” . For according to Planck’s 
hypothesis, in natural (Planck) units the strength of gravity simply is what it is, 
a primary quantity. So it can only be the proton’s mass which provides the tiny 
number VX. 

2.0. 2 Running in Place 

That’s a provocative and fruitful way to invert the question, because we now have 
a quite deep understanding of the origin of the proton’s mass, as I’ve reviewed 
above. 
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The proton mass is determined, according to the dynamics I’ve described, by 
the distance at which the running QCD coupling becomes strong. Let’s call this 
the QCD-distance. Our question, “Why is the proton mass so small?” has been 
transformed into the question, “Why is the QCD-distance is much larger than the 
Planck length?” To close our circle of ideas we need to explain, if only the Planck 
length is truly fundamental, how it is that such a vastly different length arises 
naturally. 

This last elucidation, profound and beautiful, is worthy of the problem. It 
has to do with how the coupling runs, in detail. When the QCD coupling is weak, 
‘running’ is actually a bit of a misnomer. Rather, the coupling creeps along like a 
wounded snail. We can in fact calculate the behavior precisely, following the rules 
of quantum field theory, and even test it experimentally, as I mentioned before. 
The inverse coupling varies logarithmically with distance. So if we want to evolve 
an even moderately small coupling into a coupling of order unity, we must let it run 
between length-scales whose ratio is exponentially large. So if the QCD coupling 
is even moderately small at the Planck length it will reach unity only at a much 
larger distance. 

Numerically, what we predict is that as{lp\.) at the Planck length is roughly a 
third to a fourth of what it is observed to be at 10“^^ cm; that is, a 5 (/pi.) ^ 1/30. 
We cannot measure as{lp\.) directly, of course, but there are good independent 
reasons, having to do with the unification of couplings, to believe that this value 
holds in reality. It is amusing to note that in terms of the coupling itself, what 
we require is gs{lpi.) ~ 1/2! From this modest and seemingly innocuous numerical 
hypothesis, which involves neither really big numbers nor speculative dynamics 
beyond what is supported by hard experimental evidence, we have produced a 
logical explanation of the tiny value of X. 

3 The Reduction of Matter 

The reductionist program, roughly speaking, is to build up the description of 
Nature from a few laws that govern the behavior of elementary entities, and that 
can’t be derived from anything simpler. This definition is loose at both ends. 

On the output side, because in the course of our investigations we find that 
there are important facts about Nature that we have to give up on predicting. 
These are what we - after the fact! - come to call contingencies. Three histori- 
cally important examples of different sorts are the number of planets in the Solar 
System, the precise moment that a radioactive nucleus will decay, or what the 
weather will be in Boston a year from today. In each of these cases, the scientific 
community at first believed that prediction would be possible. And in each case, 
it was a major advance to realize that there are good fundamental reasons why it 
is not possible. 

On the input side, because it is difficult - perhaps impossible - ever to prove 
the non-existence of simpler principles. I’ll revisit this aispect at the end of the 
lecture. 
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Nevertheless, and despite its horrible name, the reductionist program has 
been and continues to be both inspiring and astoundingly successful. Instead of 
trying to refine an arbitrary a priori definition of this program, more edifying is 
to discuss its best fruits. For beyond question they do succeed to an astonishing 
extent in “reducing” matter to a few powerful abstract principles and a small 
number of parameters. 

It is most instructive to build up to our most complete reduction of matter 
from some intermediate models that are extremely important and useful in their 
own right, 

3.1 Structural Chemistry 

The first such model is obtained by regarding both nuclear masses and the speed 
of light as effectively infinite. We then have the theory of nonrelativistic electrons 
in the presence of highly localized, static sources of positive charge, in multiples 
of |e|, where e is the electron charge. We compute the energy as a function of the 
positions of the source charges, using the Schrodinger equation. Local minima de- 
fine molecules, and we can compute their wavefunctions. This model gives a good 
approximation for the vast subject of structural chemistry®. Its only continuous 
parameters are h, e and the electron mass rUg. Since h can be considered as a con- 
version factor, and the Bohr radius f? jm^ just sets the overall length scale, this 
is essentially a one-parameter theory. Indeed, since one cannot manufacture a di- 
mensionless quantity out of /i, e, rrig, the remaining parameter is really a conversion 
factor too. Thus we have a zero-parameter theory of structural chemistry! 

A more accurate version - QED with sources - recognizes oo and brings 
in real and virtual photons. Now there is a pure number in the game, namely of 
course the fine structure constant jAnhc. Regarding h and c as conversion factors, 
and taking into account that hlrrieC can be used to define the overall length scale, 
we arrive at a very accurate one-parameter theory of structural chemistry and 
photochemistry. 

3.2 Matter 

The shortcoming of this theory, of course, is that the matter doesn’t move (except 
by radiative transitions). The theory is missing vibrational and rotational levels, 
not to mention reactions, diffusion, etc. 

To allow motion, we must put in finite values for the nuclear masses. In 
doing this, we open ourselves up to many many more input parameters. At a 
minimum we need the masses of the different nuclei and isotopes, and for accuracy 
in details we need their spins and magnetic moments, etc. At this level, all of 
them must be taken from experiment. There are literally hundreds of parameters. 
Nevertheless, because they are highly overconstrained by hundreds of thousands 

® There is an additional fine point, that we must specify some symmetry rules to take account 
of quantum statistics for identical nuclei. 
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of measurements, the resulting theory is extremely useful. It is the foundation of 
practical quantum chemistry. 

The next step in reduction is the one supplied by QCD. QCD assures us that 
all the nuclear parameters can be calculated crudely - perhaps at the 10% level - 
using QCD Lite. To do this, we need only one parameter beyond h and c, as Tve 
already discussed. That parameter can be taken as the coupling normalized 
at some appropriate energy scale (say lO^meC^), or equivalently the energy scale 
AqcDj which is roughly speaking the scale at which the coupling becomes large. To 
do a more accurate job, we also need to introduce the light quark masses rud^ 

Our faith in QCD is not based primarily on its practical ability to reproduce 
nuclear parameters. The best quantitative tests of the theory come from an entirely 
different domain of behavior, scattering at ultra- high energy, where the quark and 
gluon degrees of freedom and their couplings are clearly revealed. We rely on these 
successful outcome of these tests, and (so far) rather crude results from massive 
numerical calculations of proton structure and the hadron spectrum for empirical 
support. This support is highly leveraged by the fact that the theory is rigidly 
principled. It embodies the basic principles of relativity, quantum mechanics, and 
gauge symmetry. Only a few parameters appear in the most general consistent 
embodiment of these principles. They are just the ones we use!^ 

So in practice the “reduction” we offer the chemists, or for that matter the 
nuclear physicists, is not of much use to them. But it is there, as a matter of 
principle. 

At this point our best available reduction of ordinary matter is in place. A 
fundamental theory that we believe offers an extremely complete and accurate set 
of equations the structure and behavior of matter in ordinary circumstances, with 
a very liberal definition of “ordinary”, requires the parameters /i, c, e, me, Aqcd, 
mu^nfid- Among these h and c can be regarded as conversion factors, tRu and 
rud are relatively minor players, and one combination just sets an overall length 
scale. Thus we have a rough reduction of matter involving two parameters, and an 
accurate one involving four. Not bad! 

3.3 Astrophysics 

In astrophysics, as opposed to condensed matter physics, chemistry, and biology, 
gravity and the weak interactions play important roles. Their intrinsic weakness 
is compensated by the availability of vast amounts of matter and vast amounts 
of time, respectively. To treat these additional interactions we must introduce 
two additional parameters, the Newton constant Gn and the Fermi constant G'i?. 
Altogether, then, we now have h^c^e^me^h.qcD^'^ui'^d^GNiGF- Referring back 
to our discussion of Planck’s hypothesis, we are inclined to regard /i, c and Gjs 
as conversion factors. Also, we derive both e and Aqcd from the value of the 
unified coupling at the Planck scale. So we are left with this “reasonable” number 



^There is also the notorious 6 parameter, a can of worms I don’t want to dive into just now. 
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(s^unified ~ 1/2) and four “unrecisonable” numbers: me,rriu,md, and expressed 
in Planck units. 

The absurd smallness of these numbers, and of the dimensionless ratio^ 
me{GF/àc)~ 2 ^ define important unmet challenges for theoretical physics. They 
are salient aspects of what are called the hierarchy and flavor problems, respec- 
tively. 

Though wonderful challenges remain, the success of the reductionist pro- 
gram is already most impressive. An accurate description of matter, including 
astrophysics, is built up from a few abstract principles and five parameters. 

3.4 Cosmology and Quantitative Anthropics 

Having come this far it would be inhuman not to take the discussion to its logical 
climax in cosmology. That brings in many unsettled questions that go far beyond 
the scope of this lecture. I’ll restrict myself to a very brief discussion of a few 
issues closely tied to the foregoing considerations. I’ll assume as given the emerging 
“standard model of cosmology” , based on small perturbations around a spatially 
flat Friedmann-Robertson- Walker model. 

To specify the standard model of cosmology quantitatively we must add a few 
more parameters to our world-model. These include at least the baryon number 
density pb, the dark matter density pd, the overall amplitude A of a scale- invariant 
spectrum of primordial fluctuations, assumed adiabatic and Gaussian, and the cos- 
mological term A. Further work may uncover the need for additional parameters, 
for example if the Universe is not quite flat, or if the spectrum of perturbations is 
not quite scale-invariant, but for now the four I mentioned appear to form an ade- 
quate, minimal set. If we accept that cosmology is “reduced” to general principles 
and four continuous parameters, beyond the ones we use to describe matter and 
astrophysics, the question arises how far this achievement fulfills the reductionist 
program. I think that most physicists do not feel entirely satisfied with it. That 
is because the parameters in cosmology, given in this way, do not appear directly 
in the description of the behavior of simple fundamental entities. Rather, they ap- 
pear as the description of average properties of macroscopic (very macroscopic!) 
agglomerations. We would like to calculate these parameters in terms of different 
ones more related to the fundamental laws of physics of the early Universe. There 
are rough ideas about how we might do this in the cases of pb (B and CP violation) 
and pd (axion and/or lightest supersymmetric particle production), even rougher 
ideas in the case of A (inflationary potentials), and various tenuous speculations 
regarding A. There is a a very good chance, I think, that the matter of the dark 
matter will detected experimentally in the near future. Then we’ll have a much 
better chance to understand at least pd properly! 

Taken this far the status of the reductionist program in cosmology, though 
much less fully developed, is not very different in character from the preceding 

is important to note that the dimensionless form of this ratio does not involve Gn- Its 
absurd smallness is not an artifax:t of Planck units. 
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discussions of matter and astrophysics. But there are specifically cosmological 
issues, too. 

Although it is usually passed over in silence I think it is very important philo- 
sophically, and deserves to be emphasized, that our standard cosmology is radically 
modest in its predictive ambitions. It consigns almost everything about the world 
as we find it to contingency. That includes not only the aforementioned question 
of the number of planets in the Solar System, but more generally every specific 
fact about every specific object or group of objects in the Universe, apart from a 
few large-scale statistical regularities. Indeed, specific structures are supposed to 
evolve from the primordial perturbations, and these are only characterized statis- 
tically. In infiationary models these perturbations arise as quantum fluctuations, 
and their essentially statistical character is a consequence of the laws of quantum 
mechanics. 

This unavoidably suggests the question whether we might find ourselves 
forced to become even more radically modest®. Let us suppose for the sake of 
argument the best possible case, that we had in hand the fundamental equations 
of physics. Some of my colleagues think they do, or soon will. Even then we have 
to face the question of what principle determines which solution of these equations 
describes the observed Universe. Let me again suppose for the sake of argument 
the best possible case, that there is some principle that singles out a unique ac- 
ceptable solution. Even then there is a question we have to face: If the solution is 
inhomogeneous, what determines our location within it? 

Essentially inhomogeneous solutions of fundamental equations are by no 
means a contrived or remote possibility. They are the heart of all the brane-world 
constructions that have received much attention recently; they are characteristic 
of eternal inflation models; they arise very naturally in axion physics and its gen- 
eralizations [1]. On a less cosmic scale, closer to home, they are what we see all 
around us - and of course, each of us is a highly structured inhomogeneity. 

The standard cosmological model is yet another case, and supplies an espe- 
cially clear parable. As we have just discussed, the laws of reductionist physics 
do not suffice to tell us about the specific properties of the Sun, or of Earth. In- 
deed, there are many roughly similar but significantly different stars and planets 
elsewhere in the same Universe. On the other hand we can aspire to a rational, 
and even to some extent quantitative, “derivation” of the parameters of the Sun 
and Earth based on fundamental laws, if we define them not by pointing to them 
as specific objects (ostensively) - that obviates any derivation - but rather by 
characterizing broad aspects of their behavior. In principle any behavior will do, 
but possibly the most important and certainly the most discussed is their role in 
supporting the existence of intelligent observers, the so-called anthropic principle. 
There are many peculiarities of the Sun and Earth that can be explained this way. 
A crude example is that the mciss of the Sun could not be much bigger or much 



few outliers hope to move back the other way. One can only wish them luck. 
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smaller than it actually is, respectively because it would burn out too fast or not 
radiate sufficient energy. 

Now if the Universe as we now know it constitutes, like the Solar System, 
an inhomogeneity within some larger structure, what might be a sign of it^^. If 
the parameters of fundamental physics crucial to life - just the ones we’ve been 
discussing! - vary from place to place, and most places are uninhabitable, there 
would be a signature to expect. We should expect to find that some of these 
parameters appear very peculiar - highly non-generic - from the point of view 
of fundamental theory, and that relatively small changes in their values would 
preclude the existence of intelligent observers. Weinberg has made a case that 
the value of the cosmological term A fits this description; and I’m inclined to 
think that {rriu — 'n^d)/^QCD and several other combinations of the small number 
of ingredients in our reduced description of matter and astrophysics do too. A 
fascinating set of questions is suggested here, that deserves careful attention [2]. 



4 Metatheory 

QCD is unique among physical theories in its combination of logical closure and 
empirical success. So it is appropriate to consider it from the outside, as an object 
of metatheory. 



4.1 The Unreasonable Effectiveness of QCD 

There is a sense in which QCD is better than it has to be. It is not a complete theory 
of the world. Even within the standard model quarks have additional interactions, 
beyond QCD, that become significant - indeed, problematic - at short distances. 
So there is no logical requirement that QCD, extrapolated on its own down to 
infinitely short distances, should exist as a fully consistent relativistic quantum 
field theory. But it does. Is this just a happy coincidence, or does it point to 
something deeper? (Candidate idea: there must be fully consistent quantum field 
theories, because such theories are the ultimate description of Nature. A few years 
ago this might have appeared as string-blasphemy, now it might be acceptable as 
duality.) 

Although it’s somewhat off the point. I’ll take the opportunity to mention 
that this over-effectiveness reminds me of another apparently quite unrelated case, 
the cosmic censorship hypothesis in general relativity. We know that classical gen- 
eral relativity is not a complete theory, and there’s no logical reason why it should 
hide its defects from view. Yet in many cases, and perhaps generically, it does seem 
that singularities get hidden behind horizons. 



course, to make it interesting I’m assuming that we can’t actually look beyond, due to 
the veistness of the homogeneous region we’re in, or the limitations of our probes. 
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4.2 The Unreasonable Ineffectiveness of QCD 

There are some aspects of QCD I find deeply troubling - though Fm not sure if I 
should! 

I find it disturbing that it takes vast computer resources, and careful limiting 
procedures, to simulate the mass and properties of a proton with decent accuracy. 
And for real-time dynamics, like scattering, the situation appears pretty hopeless. 
Nature, of course, gets such results fast and effortlessly. But how, if not through 
some kind of computation, or a process we can mimic by computation? 

Does this suggest that there are much more powerful forms of computa- 
tion that we might aspire to tap into? Does it connect to the emerging theory of 
quantum computers? These musings suggest some concrete challenges: Could a 
quantum computer calculate QCD processes efficiently? Could it defeat the sign 
problem, that plagues all existing algorithms with dynamical fermions? Could it 
do real-time dynamics, which is beyond the reach of existing, essentially Euclidean, 
methods? 

Or, failing all that, does it suggest some limitation to the universality of 
computation? 

Deeply related to this is another thing I find disturbing. If you go to a serious 
mathematics book and study the rigorous construction of the real number system, 
you will find it is quite hard work and cumbersome. QCD, and for that matter the 
great bulk of physics starting with classical Newtonian mechanics, has been built 
on this foundation. In practice, it functions quite smoothly. It would be satisfying, 
though, to have a “more reduced” description, based on more primitive, essentially 
discrete structures. Fredkin and recently Wolfram have speculated at length along 
these lines. I don’t think they’ve got very far, and the difficulties facing such a 
program are immense. But it’s an interesting issue. 
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RHIC and High Energy Density Matter 



Larry McLerran 



Abstract. The central goal of the RHIC experimental program at Brookhaven Na- 
tional Laboratory is to make and study the Quark Gluon Plasma. Another new 
form of matter, the Color Glass Condensate may be formed in these collisions. The 
recent results from RHIC are discussed in this context. 



1 High Density Matter and High Energy Nuclear Physics 
1.1 The Goals of RHIC 

The goal of nuclear physics has traditionally been to study matter at the highest 
possible densities. This has been done in the past by studying the atomic nucleus, 
where energy densities become of the order of 

e - .15 GeVlFm^ (1) 

High energy nuclear physics has extended this study to energy densities several 
orders of magnitude higher. This extension includes the study of matter inside 
ordinary strongly interacting particles, such as the proton and the neutron, and 
producing new forms of matter at much higher energy densities in high energy 
collisions of nuclei with nuclei, and various other probes. 

RHIC is a multi-purpose machine which can address at least three central 
issues of high energy nuclear physics. These are: 

• The production of matter at energy densities one to two orders of magni- 
tude higher than that of nuclear matter and the study of its properties. 

This matter is at such high densities that it is only simply described in 
terms of quarks and gluons and is generically referred to as the Quark Gluon 
Plasma. The study of this matter may allow us to better understand the 
origin of the masses of ordinary particles such as nucleons, and of the con- 
finement of quarks and gluons into hadrons. The Quark Gluon Plasma will 
be described below. 

• The study of the matter which controls high energy strong interactions. 

This matter is believed to be universal (independent of the hadron), and 
exists over sizes enormous compared to the typical microphysics size scales 
important for high energy strong interactions. (The microphysics time scale 
here is roughly the size scale here is about 1 Fm and the microphysics time 
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scale is the time it takes light to fly 1 Fm, t 10“^^ sec.) It is called a Color 
Glass Condensate because it is composed of colored particles, gluons, evolves 
on time scales long compared to microphysics time scales and therefore has 
properties similar to glasses, and a condensate since the phase space density 
of gluons is very high. The study of this matter may allow us to better under- 
stand the typical features of strong interactions when they are truly strong, 
a problem which has eluded a basic understanding since strong interactions 
were first discovered. The Color Glass Condensate will be described below. 

• The study of the structure of the proton, most notably spin. 

Because of the subject of this meeting, I shall discuss only the first two issues. 

1.2 The Quark Gluon Plasma 

This section describes what is the Quark Gluon Plasma, why it is important for 
astrophysics and cosmology, and why it provides a laboratory in which one can 
study the origin of mass and of confinement. [1] 

1.2.1 What is the Quark Gluon Plasma? 

Matter at low energy densities is composed of electrons, protons and neutrons. If 
we heat the system, we might produce thermal excitations which include light mass 
strongly interacting particles such as the pion. Inside the protons, neutrons and 
other strongly interacting particles are quarks and gluons. If we make the matter 
have high enough energy density, the protons, nucleons and other particles overlap 
and get squeezed so tightly that their constituents are free to roam the system 
without being confined inside hadrons. At this density, there is deconfinement and 
the system is called a Quark Gluon Plasma. This is shown in Fig. 1 




Quark-liluon Plasma ► Hadron Gas 

Figure 1: As the energy density is decreased, the Quark Gluon Plasma condenses 
into a low density gas of hadrons. Quarks are red, green or blue and gluons are 
yellow. 



As the energy density gets to be very large, the interactions between the 
quarks and gluons become weak. This is a consequence of the asymptotic freedom 
of strong interactions: At short distances the strong interactions become weak. 
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1.2.2 The Quark Gluon Plasma and Fundamented Physics Issues 

The nature of matter at high densities is an issue of fundamental interest. Particle 
physics defines a problem of fundamental interest as exclusively the determination 
of the properties of the first principles Hamiltonian which describes all of particle 
interactions. For nuclear physics, this issue is different since this field is interested 
in the properties of matter as it might appear in nature. Fundamental here means 
that one understands the basic behaviour of this matter and the diversity of forms 
in which it can appear For example: the structure of a proton or a nucleus. In the 
case of the quark gluon plasma: the properties of matter in bulk. Of course matter 
in bulk can have a much richer structure than is easily guessed from the funda- 
mental Hamiltonian. Maxwell’s equations together with the Schrodinger equation 
are indeed very simple but in fact predict the wide and complex structure of all 
forms of ordinary matter. 

A hypothetical phase diagram for QCD is shown in Fig. 2. The vertical axis 
is temperature, and the horizontal is a measure of the matter or baryon number 
density, the baryon number chemical potential. The solid lines indicate a first order 
phase transition, and the dashed line a rapid cross over. It is not known whether 
or not the region marked cross over is or is not a true first order phase transition. 
There are analytic arguments for the properties of matter at high density, but 
numerical computation are of insufficient resolution. At high temperature and 
fixed baryon number density, there are both analytic arguments and numerical 
computations of good quality. At high density and fixed temperature, one goes 
into a superconducting phase, perhaps multiple phases of superconducting quark 
matter. At high temperature and fixed baryon number density, the degrees of 
freedom are those of a Quark Gluon Plaisma. 



The Evolving QCD Phase Transition 



t ~ 1980 

Quark Gluon 
Plasma 




Critical Temperature 150 - 200 MeV ( M b ~ ^ ) 
Critical Density 1/2-2 Baryons/Fm-^ ( T = 0 ) 



t ~ 1990 

... ^ Quark Gluon 
X Plasma 



Hadron Gas 






t ~ 2000 

Quark Gluon 
Plasma 



Hadron Gas 



Color 

^upe rconductor 



Figure 2: A phase diagram for QCD collisions. 
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Lattice gauge theory numerical studies, and analytic studies have taught us 
much about the properties of these various phases of matter. [2] At high energy 
densities, the linear confining force disappears, and chiral symmetry is restored, 
which means the masses for quarks which are generated at low energy density 
become small. The study of these various phases is the subject of much of this 
meeting. 



1.3 The Color Glass Condensate 

This section describes what is the Color Glass Condensate, and why it is impor- 
tant for our understanding of basic properties of strong interactions. [3] I argue that 
the Color Glass Condensate is a universal form of matter which controls the high 
energy limit of all strong interaction processes and is the part of the hadron wave- 
function important at such energies. Since the Color Glass Condensate is universal 
and controls the high energy limit of all strong interactions, it is of fundamental 
importance. 



1.3.1 What is the Color Glciss Condensate? 

A very high energy hadron has contributions to its wavefunction from gluons, 
quarks and anti-quarks with energies up to that of the hadron and all the way down 
to energies of the order of the scale of light mass hadron masses, E 200 MeV . 
A convenient variable in which to think about these quark degrees of freedom is 
the typical energy of a constituent scaled by that of the hadron. 



X 



— ^constituent / ^hadron 



( 2 ) 



Clearly the higher the energy of the hadron we consider, the lower is the minimum x 
of a constituent. Sometimes it is also useful to consider the rapidity of a constituent 
which is y ~ ln{l/x) 

The density of small x partons is 

~=xG{x,Q'^) (3) 

ay 

The scale appears because the number of constituents one measures depends 
(weakly) upon the resolution scale of the probe with which one measures. (Reso- 
lution scales are measured in units of the inverse momentum of the probe, which 
is usually taken to be a virtual photon.) A plot of xG(x, Q'^) for gluons at various 
X and measured at the HERA accelerator in protons is shown in Fig. 3. 

Note that the gluon density rises rapidly at small x in Fig. 3. This is the so 
called small x problem. It means that if we view the proton head on at increasing 
energies, the low momentum gluon density grows. This is shown in Fig. 4. 

As the density of gluons per unit area, per unit rapidity increases, the typical 
transverse separation of the gluons decreases. This means that the matter which 
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Figure 3: The number of gluons in a proton per unit rapidity at various rapidities 
and resolutions. 




controls high energy strong interactions is very dense, and it means that the QCD 
interaction strength, which is usually parameterized by the dimensionless scale 
as becomes small. The phase space density of these gluons, p ~ l/nR^ dNtd?pT 
can become at most 1/a^^ since once this density is reached gluon interactions 
are important. This is characteristic of Bose condensation phenomena which are 
generated by an instability proportional to the density p and is compensated by 
interactions proportional to asp^^ which become of the same order of magnitude 
when p ^ 1 /as Thus the matter is a Color Condensate. 

The glassy nature of the condensate arise because the fields associated with 
the condensate are generated by constituents of the proton at higher momentum. 
These higher momentum constituents have their times scales Lorentz time dilated 
relative to those which would be measured in their rest frame. Therefore the fields 
associated with the low momentum constituents also evolve on this long time scale. 
The low momentum constituents are therefore glassy: their time evolution scale 
is unnaturally long compared to their natural time scale. Hence the name Color 
Glass Condensate. 

There is also a typical scale associated with the Color Glass Condensate: the 
saturation momentum. This is the typical momentum scale where the phase space 
density of gluons becomes p « l/a^. 

At very high momentum, the fields associated with the Color Glass Conden- 
sate can be treated as classical fields, like the fields of electricity and magnetism. 
Since they arise from fast moving partons, they are plane polarized, with mutu- 
ally orthogonal color electric and magnetic fields perpendicular to the direction of 
motion of the hadron. They are also random in two dimensions. This is shown in 
Fig. 5. 




S234 



L. McLerran Ann. Henri Poincaré 




Figure 4: The increasing density of wee partons as the energy increases. 



1.3.2 Why is the Color Glass Condensate Important? 

The Color Glass Condensate is the universal form of matter from which all high 
energy hadrons are made. Like nuclei and electrons compose atoms, and nucleons 
and protons compose nuclear matter, the Color Glass Condensate is the funda- 
mental matter of which high energy hadrons are composed. 

The Color Glass Condensate has the potential to allow for a first principles 
description of the gross or typical properties of matter at high energies. For exam- 
ple, the total cross section at high energies for proton-proton scattering, as shown 
in Fig. 6 has a simple form but for over 40 years has resisted simple explanation. 
(It has perhaps been recently understood in terms of the Color Glass Condensate 
or Saturation ideas.) 

The Color Glass Condensate form the matter in the quantum mechanical 
state which describes a nucleus. In the earliest stages of a nucleus- nucleus col- 
lisions, the matter must not be changed much from these quantum mechanical 
states. The Color Glass Condensate therefore provides the initial conditions for 
the Quark Gluon plasma to form in these collisions. A space-time picture of nu- 
cleus nucleus collisions is shown in Fig. 7. At very early times, the Color Glass 
Condensate evolves into a distribution of gluons. Later these gluons thermalize 
and may eventually form a Quark Gluon Plasma. At even later times, a mixed 
phase of plasma and hadronic gas may form. 
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Figure 5: The Color Glass Condensate as a high density of random gluon fields on 
a two dimensional sheet traveling near the speed of light. 



The total hadronic cross section: 




Figure 6: The total cross section for high energy proton-proton interactions. 



2 How Do Heavy Ion Collisions at RHIC 
Address the Scientific Issues? 

Heavy ion collisions at ultrarelativistic energies are visualized in Fig. 8 as the 
collision of two sheets of colored glass. 

At ultrarelativistic energies, these sheets pass through one another. In their 
wake is left melting colored glass, which eventually materializes as quarks and 
gluons. These quarks and gluons would naturally form in their restframe on some 
natural microphysics time scale. For the saturated color glass, this time scale is of 
order the inverse saturation momentum (again, we convert momentum into time 
by appropriate uses of Planck’s constant and the speed of light), in the rest frame 
of the produced particle. When a particle has a large momentum along the beam 
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Figure 7: A space-time diagram for the evolution of matter produced in heavy ion 
collisions. 



axis, this time scale is Lorentz dilated. This means that the slow particles are 
produced first towards the center of the collision regions and the fast particles are 
produced later further away from the collision region. This correlation between 




Figure 8: The collision of two sheets of colored glass. 



space and momentum is similar to what happens to matter in Hubble expansion in 
cosmology. The stars which are further away have larger outward velocities. This 
means that this system, like the universe in cosmology is born expanding. 

As this system expands, it cools. Presumably at some time the produced 
quarks and gluons thermalize. They then expand as a quark gluon plasma and 
eventually as some mixture of hadrons and quarks and gluons. Eventually, they 
may become a gas of only hadrons before they stop interacting and fly off to 
detectors. 

One can estimate the reach of RHIC compared to previous machines by the 
Fig. 9. On this Figure, we attempt to estimate the typical energy densities and 
baryon number densities by an effective temperature and effective baryon number 
chemical potential (a measure of the baryon number density). (In the next section. 
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Figure 9: The reach of RHIC. 



we discuss how the temperature and baryon chemical potential are determined 
by the RHIC experiments.) For non- thermal systems, the effective temperature 
would be proportional to the typical energy per particle in the rest frame of the 
moving system, and is defined so that it reduces to the temperature for a thermal 
system. The baryon chemical potential is defined so that the ratio of effective 
baryon chemical potential to effective temperature determines the baryon number 
to entropy ratio in the same way it would for thermal systems. 

As can be seen from the figure, the effective temperature scales one can 
achieve at RHIC are enormous compared to those appropriate for the transition 
from hadronic degrees of freedom to those of quarks and gluons. 

We make this case clear in the next section: RHIC makes Quark-Gluon Mat- 
ter. About that there is now little doubt. The questions which must be answered 
are about the nature of this matter. What are its properties? Is it thermalized? 
Does it remember its origin as a Color Glass Condensate? Can one use these colli- 
sions as a laboratory to study the generations of mass or the confinement of quarks 
and gluons? 

3 What We Have Learned from RHIC 

In this section, we discuss the results from RHIC and what this implies about the 
search for new forms of matter. We will concentrate on issues related to the Quark 
Gluon Plasma and to the Color Glass Condensate. 

We shall also describe in what directions the recent results from RHIC are 
driving future directions. It may seem a little confusing the different directions 
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which experimentalists are pursuing. It must be understood that science after 
all progresses not as a linear process or entirely logical process. There is an old 
joke which illustrates this and which has it origin in an old story by the famous 
Russian theorist Gribov. A man is lost in the forest and his home is some distance 
away. This is shown in Fig. 10. Being lost in the forest is analogous to making a 




Figure 10: Gribov’s walk home. 



random walk in two dimensions: You go a little ways, change directions go further, 
change directions again, etc. etc. Now in two space dimensions there is a theorem 
that a random walk guarantees that the person will always eventually find his 
way home, but it may take a long time. An experimentalist on the other hand 
knows roughly where his home is located, and makes random walks in the general 
direction of home, and will arrive there in a much quicker time. How can one 
fail to get home? If the person is a theorist, they might think they know exactly 
the direction to home! They begin walking in that direction, miss the house but 
continue on because they are certain of their direction. The experimentalist has 
the added advantage relative to the theorist that in his directed random walk, he 
may discover something unexpected in his excursion from the shortest path. 

The situation at RHIC is somewhat like this. We are exploring new territory, 
and hopefully in addition to understanding properties of the Quark Gluon Plasma. 
We know roughly how to proceed, but since this is a new frontier, there are no 
road maps. It already seems we have stumbled on to something unexpected in 
the RHIC experiments. There is increasing evidence that one is finding effects 
associated with parton saturation and the Color Glass Condensate. 

3.1 The Energy Density is Big 

The particle multiplicity as a function of energy has been measured at RHIC. 
Combining the multiplicity data together with the measurements of transverse 
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Figure 11: Bounds on the energy density as a function of time in heavy ion colli- 
sions. 



energy or of typical particle transverse momenta, one can determine the energy 
density of the matter when it decouples. One can then extrapolate backwards 
in time. We extrapolate backwards using 1 dimensional expansion, since decou- 
pling occurs when the matter first begins to expands three dimensionally. We can 
extrapolate backwards until the matter has melted from a Color Glass. 

To do this extrapolation we use that the density of particles falls as N/V ~ 
1/t. If the particles expand without interaction, then the energy per particle is 
constant. If the particles thermalize, then E/N ~ T, and since N/V ~ for 
a massless gas, the temperature falls as T For a gas which is not quite 

massless, the temperature falls somewhere in the range To > T > 
that is the temperature is bracketed by the value corresponding to no interaction 
and to that of a massless relativistic gas. This 1 dimensional expansion continues 
until the system begins to feel the effects of finite size in the transverse direction, 
and then rapidly cools through three dimensional expansion. Very close to when 
three dimensional expansion begins, the system decouples and particles free stream 
without further interaction to detectors. 

We shall take a conservative overestimate of this time to be of order tmeit ~ 
.3 Fm/c The extrapolation backwards is bounded by < e{t) < 

The lower bound is that assuming that the particles do not thermalize and their 
typical energy is frozen. The upper bound assumes that the system thermalizes 
as an ideal massless gas. We argued above that the true result is somewhere in 
between. When the time is as small as the melting time, then the energy density 
begins to decrease as time is further decreased. 
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This bound on the energy density is shown in Fig. 11. On the left axis is the 
energy density and on the bottom axis is time. The system begins as a Color Glass 
Condensate, then melts to Quark Gluon Matter which eventually thermalizes to 
a Quark Gluon Plasma. At a time of a few Fm/c, the plasma becomes a mixture 
of quarks, gluons and hadrons which expand together. 

At a time of about 10 Fm/c, the system falls apart and decouples. At a time 
of ^ ~ 1 Fm/c, the estimate we make is identical to the Bjorken energy density 
estimate, and this provides a lower bound on the energy density achieved in the 
collision. (All estimates agree that by a time of order 1 Fm/c, matter has been 
formed.) The upper bound corresponds to assuming that the system expands as a 
massless thermal gas from a melting time of .3 Fm/c. (If the time was reduced, the 
upper bound would be increased yet further.) The bounds on the energy density 
are therefore 



2-3 GeVIFrri^ < e < 20 - 100 GeVjFrT? (4) 

where we included a greater range of uncertainty in the upper limit because of 
the uncertainty associated with the formation time. The energy density of nuclear 
matter is about 0.15 GeP/Fm^, and even the lowest energy densities in these 
collisions is in excess of this. At late times, the energy density is about that of the 
cores of neutron stars, e ~ 1 GeVjFw? . 

At such extremely high energy densities, it is silly to try to describe the 
matter in terms of anything but its quark and gluon degrees of freedom. 

In the write up for this talk, there is not space to go through the detailed 
arguments for the specific things which have been shown at RHIC. In the following 
enumeration, I will list my opinions about what has been shown, and the interested 
reader is invited to look at the discussion in for example my talk in Jaipur [4]. 

• The Gross Features of Multiplicity Distributions Are Consistent with Col- 
ored Glass Comparison with predictions of the Color Glass Condensate pre- 
dict the correct dependence of the multiplicity of produced particles on the 
centrality of the collision. One also correctly predicts the rapidity depen- 
dence. 

• Matter Has Been Produced which Interacts Strongly with Itself Measure- 
ments of the momentum space anisotropies of produced particles as a func- 
tion of impact parameter can only be explained by having strong, early, and 
frequent interactions of the quarks and gluons produced in the collision. 

• The Single Particle Distributions at Low pr are Consistent with a Thermal- 
ized Quark Gluon Plasma. Hydrodynamic computations provide detailed 
predictions for the spectrum of produced particles. At low pr < 2 GeV, 
these predictions are in quantitative accord with the data. 

• The High pr Particle Distributions Hint at New Physics. At high pr > 
2 GeV, the spectrum of produced particles is depleted relative to what is 
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expected from hard collisions of the constituents of the protons and neutrons 
in the nuclei. This may be due to rescattering in an exceptionally opaque 
Quark Gluon Plasma, or it may be due to the effects of the Color Glass 
Condensate on the initial state nuclear wavefunction. This will be resolved 
in the dA run at RHIC, since in this environment one can isolate the effects 
of the initial nuclear wavefunction 

• Abundcinces of Produced Particles are Consistent with that Expected from 
a Quark Gluon Plasma Thermal models with fixed chemical potential and 
temperature describe well the abundances of produced species of particles. 
With a more compelling argument for thermalization, this result may provide 
a direct measurement of the flavor composition of the Quark Gluon Plasma. 

• The Lifetime and Size of the Matter Produced at RHIC Has Been Measured 

Using like particle correlations, the lifetime and spatial extent at decoupling 
of the matter produced in heavy ion collisions can be measured. At present, 
the measurements disagree with theoretical computation. The origin of this 
discrepancy is not understood, and is the subject of much theoretical work. 

In addition to the things we have learned from RHIC, there are a set of issues 
about which we expect to understand much more in the not too distant future. 
These are: 

• Confinement and Chiral Symmetry Restoration Low mass dilepton mea- 
surements can determine whether some low mass hadrons have dissociated, 
due to deconfinement, or have had their mass shifted due to chiral symmetry 
restoration effects. Low mass dilepton pair measurements are challenging, re- 
quiring both good resolution ring imaging Cherenkov detection, and a good 
direct measurement of charm in order to subtract backgrounds. 

• Confinement and J/'L Suppression 

The measurement of the J/^ cross section can teach us about the properties 
of matter produced in RHIC if one knows how it is produced in pp and 
pA collisions, and if one has a measure of the direct charm production. If 
the Quark Gluon Plasma deconfines quarks, there should be no in media 
production of the J/^ 
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Generalized Parton Distributions 

and Generalized Distribution Amplitudes: 

New Tools for Hadronic Physics 



Bernard Pire 



Abstract. The generalized parton distributions and the generalized distribution am- 
plitudes give access to a deeper understanding of the quark and gluon content of 
hadrons. In this short review, we select some new developments of their interest- 
ing connections with the physics information that one can extract from exclusive 
reactions at medium and high energies. 



1 Generalized Parton Distributions 

A considerable amount of theoretical and experimental work is currently being 
devoted to the study of generalized parton distributions, which are defined as 
Fourier transforms of matrix elements between different hadron states, such as: 

J d\e^^^^P+p">’^{N’{p',s')\Î!(-Xn)^.mP{Xn)\Nip,s)) 

Their measurements are expected to yield important contributions to our 
understanding of how quarks and gluons assemble themselves into hadrons [1]. 
The simplest and cleanest exclusive processes where these distributions occur are 
deeply virtual Compton scattering (DVCS), i.e., 7 A"' and meson elec- 

troproduction i.e., 7*A — > MN' in kinematics where the 7* has large spacelike 
virtuality while the invariant momentum transfer t to the proton is small (for 
recent reviews, see [2] ). Their “inverse” processes, 7A ^ 7* A' and ttA — 7* A' at 
small t and large timelike virtuality of the final state photon [3] are quite similar. 

DVCS combines features of the inelastic processes with those of an elastic 
process. A relativistic charged lepton is scattered from a target nucleon or nu- 
cleus. A real photon of 4-momentum is also observed in the final state. With 
e(/c), e'(/c') denoting the initial and final electrons of momenta /c, k' respectively, 
and p, p' denoting the momentum of the target, the process is 

e{k) + A(p) ^ e\k’) + A'(p') -h 

The net momentum transfer A to the target is obtained by momentum conserva- 
tion, = k^ — k ^ — q The real photon may be emitted by the lepton beam, in 
which case a virtual photon of momentum — A^ strikes the target. This is the 
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Bethe-Heitler amplitude which is perfectly under control since the nucleon form 
factors at small t are known. A second possibility is that the target absorbs a vir- 
tual photon of momentum Qycs ~ p'^—p^ + Q ^ and emits the real photon. This is 
the genuine DVCS amplitude. It is straightforward to select events where all com- 
ponents of are small compared to a/Q^, with = —QvcsQvcs,^ > GeV^. 
These conditions have recently been realized in experiments [4] at HERMES at 
DESY and CEBAF at JLab. 

The physical interpretation is that the target is resolved by the virtual photon 
on a spatial scale small compared to the target size. A photon of high virtuality 
selects a short-distance region of the target: the spatial resolution is of order 
A6 t ~ h/ Perturbative QCD (pQCD) can be applied to DVCS at large , 
exploiting the short-distance resolution of the virtual photon, despite the presence 
of a real photon in the reaction[l]. 

Extracting the GPD’s from the scattering amplitudes require to study a num- 
ber of observables in different reactions[2]; electroproduction of mesons and pho- 
tons is the main source of information, but for the chiral-odd GPD’s which need 
a quite different process[5]. GPD’s may also be studied for the deuteron[6] and 
other nuclei. 

2 Femtophotography 

Complementarily to ordinary parton distributions which measure the probability 
that a quark or gluon carry a fraction x of the hadron momentum, GPD’s represent 
the interference of different wave functions, one where a parton carries momentum 
fraction x + ^ and one where this fraction is x — ^ is called the skewedness 

and is fixed in a DVCS experiment by external momenta. On the contrary, x is 
an integration variable varying from —1 to -hi. When x < (^, the GPD’s should 
be interpreted as the interference of the hadron wave function with the wave 
function of the hadron accompanied by a qq pair. It is thus very reminiscent of the 
probability amplitude to extract a meson from a hadron. 

Apart from longitudinal momentum fraction variables, GPD’s also depend on 
the momentum transfer t between the initial and final hadrons. Fourier transform- 
ing this transverse momentum information leads to information on the transverse 
location of quarks and gluons in the hadron [7]. Real-space images of the target 
can thus be obtained, which is completely new. Spatial resolution is determined 
by the virtuality of the incoming photon . Quantum photographs of the proton, 
nuclei, and other elementary particles with resolution on the scale of a fraction of 
a femtometer are thus feasible. 

3 Generalized distribution amplitudes 

The crossed version of GPD’s describe the exclusive hadronization of a qq or gg pair 
in a pair of hadrons, a pair of tt mesons for instance. These generalized distribution 
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amplitudes (GDA) [8], defined in the quark- antiquark case, as 

where s is the squared energy of the tttt system, are the non perturbative part of 
the light cone dominated process [9] 



7*7 — > TTTT 

which may be measured in electron positron colliders of high luminosity. This new 
QCD object allows to treat in a consistent way the final state interactions of the 
meson pair. Its phase is related to the the phase of tttt scattering amplitude, and 
thus contains information on the resonances which may decay in this channel. 
Results on the related reaction 7*7 ^ pp may be expected from LEP 2. 

4 Hunting for the Odderon 

A nice application of the GDA’s concerns the search for the Odderon[10]. Pomeron 
and Odderon exchanges are the theoretically dominant contributions to hadronic 
cross sections at high energy. They appear on an equal footing in the QCD descrip- 
tion of hadronic reactions, and in the lowest order approximation they correspond 
to colour singlet exchanges in the t-channel with two and three gluons, respec- 
tively. The Odderon remains a mystery from an experimental point of view. On 
the one hand, recent studies of the elastic pp scattering show that one needs the 
Odderon contribution to understand the data in the dip region [11]. On the other 
hand, the studies of meson production processes which should select the odderon 
exchange didn’t show any clear signal of its importance [12, 13]. In these cases, 
the scattering amplitude describing Odderon exchange enters quadratically in the 
cross section. 

A number of interesting features of the two pion diffractive electroproduc- 
tion process allows lo search for the QCD-Odderon at the amplitude level. Since 
the two pion state described by the GDA’s doesn’t have any definite charge par- 
ity, both Pomeron and Odderon exchanges contribute. The charge asymmetry 
is ideally suited to select the interference of the two amplitudes. As in open 
charm production [14], the Odderon amplitude enters linearly in the asymmetries 
and therefore one can hope that Odderon effects can show up more easily. More- 
over factorization properties allow to perturbatively calculate the short-distance 
part of the scattering amplitude. The long distance part contains the product of 
the C-even and C-odd GDA’s, and shows a dramatic dependence in the two pion 
invariant mass. 

In conclusion, let me acknowledge that I would not have been able to give 
this mini review without all the fruitful discussions I had during the last 6 years 
with all my collaborators and particularly Markus Diehl, Thierry Gousset, John 
Ralston, Lech Szymanowski and Oleg Teryaev. 
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Progress in Study of Renormalization of Theories 
with Nontrivial Internal Symmetry 

A. A. Slavnov 



Abstract. Problems arising in renormalization of theories with nontrivial internal 
symmetry are reviewed. Different approaches to renormalization are discussed. A 
new method which provides automatically gauge invariance of renormalized theory 
independently of regularization used is proposed. 



1 Introduction 

Renormalization is an essential part of quantum field theory (QFT). Historically 
renormalization procedure for QFT was invented to deal with ultraviolet infinities 
which appear in calculations of radiative corrections in quantum electrodynamics 
(QED). The early development of renormalization theory is related to the works 
of V. Weisskopf and H.Bethe who pointed out that at least some of these divergen- 
cies may be reabsorbed in unobservable ’’bare” parameters (mass, charge) which 
characterize fictitious noninteracting particles. This idea was developed into con- 
sistent theory by J. Schwinger, LTomonaga, R.Feynman and F. Dyson. Finally the 
complete and mathematically rigorous renormalization procedure was constructed 
by N.N.Bogoliubov and O.S.Parasiuk. They showed that renormalization is a way 
to define a product of distributions describing the particle propagation and con- 
structed the recurrent subtraction procedure (R-operation) , which allows to write 
for an arbitrary Feynman diagram a well defined convergent expression by sub- 
tracting the first terms of Taylor series in external momenta for the corresponding 
integrals. 

In more intuitive terms renormalization may be described as follows. One 
firstly introduces some intermediate regularization (ultraviolet cut-off A) to make 
all the integrals convergent. At the same time one changes the parameters of ’’bare” 
particles by introducing A-dependent counterterms into Lagrangian 

L = - ieA^)i> + Z^{m + . ( 1 ) 

Renormalization theory states that for a special choice of dependence of ^2,^3, 5 m 
on A the limit A ^ 00 exists for all integrals relevant for calculations of observables. 
This limit is defined up to certain finite terms, which are fixed by assigning to 
particles physical masses, charges and normalizing properly the wave functions. 

Having an analogous procedure for the Standard Model describing weak, 
electromagnetic and strong interactions, one can calculate unambiguously all ra- 
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diative corrections using as the input the renormalized Lagrangian with physical 
masses and charges taken from experiment (for the Standard Model 18 parameters 
are needed). 

One may ask, what is the physical reason for renormalization in QFT? It is 
related to impossibility to decouple naively the regions of low and high energies. 
Indeed, if the theory were free of ultraviolet divergencies, the influence of very 
heavy states on low energy processes would be practically absent. The contribution 
of such states to arbitrary Feynman diagram would be proportional to 

j f(ki . . . /c„)(fc2 - A^)-^dh ■■■dK (2) 

and if the integral converges, for A big enough it is negligible. Although decoupling 
of different energy regions is not uncommon in physics it does not work in QFT 
as in this case the physical ground state is a complicated superposition of virtual 
particle states, including heavy excitations. Renormalization theory allows to de- 
couple the low and high energy regions at the expence of redefining the ’’bare” 
parameters and taking physical values of masses and charges from experiment. 

One could think that renormalization is a specific feature of perturbation 
theory and may be avoided in nonperturbative calculations. However this hope is 
false. Nonperturbative calculations in lattice field theories also require renormal- 
ization and tuning the parameters of the Lagrangian. 

Renormalization is intrinsically connected with our unability to describe in a 
proper way the region of very high energies. Hence a renormalized local quantum 
field theory is an effective theory, which is designed to describe physics below 
certain ultraviolet scale A. The influence of the energies higher than A is taken 
into account by means of renormalization of bare parameters. 

The only hope to avoid ultraviolet infinities and renormalization is to con- 
struct a theory which is valid for all energies. Such a theory must include some di- 
mensional parameters and naturally cannot be strictly local. All previous attempts 
to construct a consistent nonlocal theory failed leading to violation of basic phys- 
ical principles like causality or unitarity. At present great expectations are related 
to string models and their generalizations leading to hypothetical M-theory, which 
is supposed to be the theory of everything. In string and brain models nonlocality 
is build in in a natural way and on the other hand the low energy limit of these 
models reproduces correctly the relevant field theory models with reasonable par- 
ticle spectrum. It is possible that if a consistent M-theory is developed it would 
allow to avoid renormalization and calculate unambiguously all physical quanti- 
ties. However at present these hopes are far from being well founded and there 
are much more questions than answers in this approach. In particular in my opin- 
ion nobody has studied seriously the problem of causality in string models. Most 
probably local quantum field theory and its necessary ingredient, renormalization 
procedure, in a foreseen future will remain the main tool of studying elementary 
particle physics. 




Vol. 4, 2003 



Renormalization of Theories with Nontrivial Internal Symmetry 



S249 



2 Symmetry and renormalization 

One of the main problems in renormalization theory is a compatibility of renorma- 
lization procedure with the symmetry of the theory. At present the main guiding 
principle for the choice of a particular model of interactions is a symmetry, es- 
pecially gauge invariance. Following the classical example of electrodynamics and 
pioneering works of Yang and Mills, practically all modern models are chosen 
on the basis of requirement of gauge invariance. Requirement of invariance with 
respect to local phase transformations of matter fields 

^(x*) ^ l](x)'0(x) = exp{iT^a^{x)}'ip{x) (3) 

leads to existence of the vector field B^{x) — T^B^{x), where are generators 
of the corresponding Lie algebra, with the transformation law 

B^{x) Q{x)Bi_,{x)Cl~^ {x) +5^fîf2“'(x) . (4) 

The interaction with the matter field is introduced via replacement of derivatives 
in the free action by covariant derivatives 

■ (5) 

The gauge invariant Yang-Mills Lagrangian is constructed in terms of the curvature 
tensor 



LYM = -lTrF^,F^, = -^T:T{d^A,-d,A^, + g[A^,A,]Ÿ . ( 6 ) 

The construction presented above may be described by the ’’equivalence principle” 
similar to the equivalence principle in general relativity: A local change of the basis 
in the internal charge space, described by the eq.(3), is equivalent to appearance 
of additional gauge field If one accepts this principle as a guide for model 
building, it fixes practically uniquely the interaction Lagrangian. 

The physical meaning of gauge invariance is simple: some components of vec- 
tor fields which mediate the interactions are unphysical and may be chosen at will 
in accordance with gauge invariance. In particular, using the gauge freedom one 
may eliminate unphysical components completely. Such ’’physical” gauges have 
a transparent physical interpretation, however they are usually inconvenient for 
practical calculations due to lack of manifest Lorentz invariance. In QED observ- 
able electromagnetic field is described by the stress tensor 

Fio = = W (7) 

which depends only on transversal components of vector potential. The longi- 
tudinal components diAi are unphysical. The physical gauge in this case is the 
Coulomb gauge d{Ai =0, eliminating unphysical longitudinal components. 
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Gauge invariance guarantees that observables, i.e. scattering matrix elements 
do not depend on a gauge. Instead of the Coulomb gauge one can choose some 
Lorentz invariant gauge condition like = 0, or modify the classical action as 
follows 

La = - ieA^)ip . ( 8 ) 

Due to gauge invariance 

. (9) 

The r.h.s. of this equation is manifestly Lorentz invariant, whereas the l.h.s. de- 
pends only on physical variables and is manifestly unitary. Hence we conclude that 
observables, like S'- matrix share both these properties. Gauge invariance is essen- 
tial for consistency of QED and non-Abelian gauge models. Its breaking would 
lead to violation of unitarity, Lorentz invariance or some other important physical 
properties. 

The question arises: does the renormalization procedure respect gauge invari- 
ance? As was discussed above, the renormalization is equivalent to some modifi- 
cation of a classical Lagrangian. So it is not obvious that a renormalized theory 
possesses necessary symmetry. Moreover it was found that not all symmetries of 
a classical theory survive renormalization. For some models it is impossible to 
carry out renormalization preserving gauge invariance and these models are phys- 
ically inconsistent. So the question of a gauge invariant renormalization is of great 
importance. 

Renormalization procedure cannot be formulated only in terms of observ- 
ables. It necessary includes correlation functions 

< T'ip{x)'ip(y)A^{z) . . . >. These functions are not gauge invariant. However gauge 
invariance imposes the conditions on these functions which are necessary and suf- 
ficient for invariance of observables. The physical meaning of these conditions is 
rather transparent: the correlators including unphysical degrees of freedom, like 
d,j,A^ may be expressed in terms of correlators depending only on physical com- 
ponents. In QED such relations were obtained long ago by J.Ward [1], and in a 
more general form by E.S.Pradkin [2] and Y.Takahashi [3]. 

The simplest identities are: 

d;<TA^{x)A,{y)> = 0 

< T A^{x)ï){y)'il;{z) > = e < Txjj{x)ilj{z) > 6{x - y) 

— e < T^{y)'ij){x) > 6{x — z) . (10) 

One sees that the correlators including unphysical longitudinal photons are ex- 
pressed in terms of physical correlators. 

The eqs (10) in QED are a consequence of the classical conservation law for 
the electric current 



= 0 • 



( 11 ) 




Vol. 4, 2003 Renormalization of Theories with Nontrivial Internal Symmetry 



S251 



In non- Abelian gauge theories this simple interpretation does not work. The reason 
is that the gauge fixing in Yang-Mills theory breaks the conservation of the classical 
current. Moreover the consistent quantization of Yang-Mills theory requires further 
modification of the classical action: new unphysical fields, Faddeev- Popov ghosts, 
appear. So the complete quantized action looks as follows 

L = - - id,rcD^c (12) 

Here c, c are anticommuting scalar fields. Nevertheless the relations which express 
gauge invariance in terms of correlators may be written in this case as well [4], 
[5]. These relations have the same physical meaning, expressing the correlators 
including unphysical components in terms of physical ones. However in this case 
unphysical components include not only longitudinal polarizations of vector fields 
but also Faddeev-Popov ghosts. In particular the identity for the three point func- 
tion including the longitudinal component of the Yang-Mills field expresses it in 
terms of the two-point Yang-Mills correlator, two-point correlator of ghost fields 
and ghost- Yang-Mills field vertex: 

ia-i < TBl{x)Bl{y)d,Bl{z) > = < Tdl^{x)é{z)Bl{y) > 

+ < TBl{x)ïf{xy{z)Bl{y) > + ...(13) 

where . . . denotes analogous terms with x y.fi ^ v.a ^ b. This identity 
together with the corresponding identities for two and four point correlators guar- 
antees gauge independence of renormalized 5-matrix. 

Contrary to the QED case the identity (13) does not correspond to the conser- 
vation of the classical current. Nevertheless it is related to some global symmetry 
of the effective action, including a gauge fixing term and ghost fields. For the 
Yang-Mills theory the symmetry transformations look as follows [6], [7]: 





= 


5c^ 




6c^ 





Here e is a constant with odd Grassmanian parity, anticommuting with the ghost 
fields. The transformation (14) is therefore a supersymmetry transformation. 

The identities (13) may be interpreted as a quantum analogue of the conser- 
vation law corresponding to the BRST invariance of the effective action. 

The identities (13) or, alternatively, BRST- invariance are the necessary and 
sufficient conditions of gauge invariant renormalizability and therefore must be 
respected by a renormalization procedure for the gauge theory to be consistent. 
This raises some problems, both technical and conceptual. 

A renormalization procedure usually includes two elements. One firstly in- 
troduces an intermediate regularization which makes all the integrals describing 
radiative corrections finite. Then the renormalized correlators are constructed by 
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subtracting according to i?-operation the first terms of Taylor expansion of corre- 
sponding integrals, or, equivalently, by introducing suitable A-dependent countert- 
erms Zi{K). The finite renormalized correlators obtained in this way must satisfy 
the Generalized Ward Identities (GWI) (13). 

Obviously these identities can be violated by an ultraviolet regularization. If 
these violations may be compensated by a suitable choice of local counterterms 
it does not lead to inconsistency. However in some models it is impossible. For 
example in ” chiral QED” , described by the Lagrangian 

~ ~^f fju^ f fj,u T (1^) 

the three point correlation function must satisfy the identity 

< TA^(x)Ay{y)Ap{z) >= 0 . (16) 

Explicit calculation gives for the Fourier transform of this correlator a finite 
nonzero result 

(p + k) = ■ (17) 

This is a simplest example of a quantum anomaly. Classical chiral symmetry is 
destroyed by quantum corrections. Similar anomalies are present in conformally 
invariant theories and some other models whose symmetry is related to the absence 
of dimensional parameters. Renormalization introduces a dimensional parameter 
which causes the violation of the classical symmetry. 

Breaking of the gauge invariance leads to inconsistency of the theory. Unphys- 
ical degrees of freedom do not decouple from physical ones, resulting in violation 
of unitarity. 

There exists a possibility that new degrees of freedom which appear in anoma- 
lous models may have a physical meaning and the quantum theory is consistent, 
but has more degrees of freedom than the classical one. It was demonstrated ex- 
plicitly in some simple model. However no nontrivial four-dimensional models of 
this type are known. At present a physical meaning of quantum anomalies is not 
completely understood and their absence is considered as a criterion for choosing 
self consistent models. 

3 Different renormalization schemes. 

Even for anomaly free models gauge invariant renormalization is not a trivial prob- 
lem and its efficient solution is important in particular for practical calculations 
in the framework of the Standard model and for the complete analysis of different 
supersymmetric models. 

There are two main approaches to renormalization of theories with nontrivial 
internal symmetry. 
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The first approach involves a gauge invariant intermediate regularization, 
so that regularized correlators satisfy GWI. In this case one can prove that the 
counterterms needed to eliminate ultraviolet divergencies also preserve the struc- 
ture of the effective action and the gauge invariance is manifest at all stages of 
calculations. 

The most natural and conceptually simple method of regularization is pro- 
vided by a lattice regularization. Continuum space-time is replaced by a discrete 
lattice with the lattice spacing a. The inverse of a, A = a~^ plays a role of ul- 
traviolet cut-off. Wilson [8] showed that discretization can be introduced in a 
way compatible with gauge invariance. This approach is successfully used for non- 
perturbative calculations in the Standard Model with the help of Monte-Carlo 
method. However for perturbative calculations it is not convenient due to a com- 
plicated structure of the regularized action and breaking of Lorentz invariance. 
Additional complications arise in the theories with chiral fermions, in particular 
in Salam- Weinberg model. 

The second gauge invariant method is the regularization by means of higher 
covariant derivatives [9], [10]. The classical action is modified by introducing the 
terms with higher covariant derivatives 

. (18) 

Introduction of derivatives into kinetic part suppresses the ultraviolet asymptotics 

of propagators 

1 ^ I (19) 

improving the convergence of Feynman integrals. However requirement of gauge 
invariance forces us to introduce covariant derivatives resulting in appearance of 
new vertices. The analysis shows that this regularization is not complete. It makes 
convergent all the diagrams except for a finite number of one loop diagrams, thus 
reducing the problem to analysis of a finite number of relatively simple integrals. 
This regularization, although incomplete, has an advantage of being applicable to 
any gauge invariant theory. Its existence proves in a trivial way the absence of 
anomalies in multiloop diagrams: if the anomalies are absent in one loop diagrams 
they cannot appear in multiloop diagrams which allow universal gauge invariant 
regularization. 

The one loop diagrams require a special treatment. For the Yang-Mills theory 
a complete manifestly gauge invariant regularization was constructed by using an 
additional Pauli- Villars type regularization for one loop diagrams [11], [12]. 

This method may be extended to regularize in a gauge invariant way anomaly 
free models with chiral fermions, in particular Weinberg- Salam model. Pauli- 
Villars type regularization cannot be applied directly to such models as the fermion 
mass term breaks chiral invariance. However compensation of anomalies in lepton 
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and quark sectors leads to the absence of 75 matrix in all divergent one-loop di- 
agrams, which coincide up to the factor 1/2 with the corresponding diagrams in 
a vectorial theory [13]. It allows to use for their regularization the Pauli-Villars 
procedure. This procedure may be also implemented as a modification of the La- 
grangian by introducing an infinite series of auxiliary fields [13]. This approach is 
closely related to treating chiral fermions with the help of auxiliary fifth dimension 
[14]. It got an important development in lattice gauge theories, resulting in the 
construction of the overlap method [15] used for nonperturbative calculations on 
a lattice. 

The method of higher covariant derivatives is a powerful tool for a general 
analysis of renormalized gauge theories, but calculations in perturbation theory 
are rather complicated due to the presence of additional interaction vertices. 

The most economical method of gauge invariant regularization in the frame- 
work of perturbation theory is the dimensional regularization [16]. This method is 
based on the observation that for some dimensions of the space-time the Feynman 
integrals are convergent and the Lorentz invariant amplitudes may be considered 
as meromorphic functions of space- time dimension d having poles at d = 4. For 
the values of d where all integrals are convergent the GWI are fulfilled and the 
poles at d = 4 form a gauge invariant structure. To renormalize the theory in a 
gauge invariant way it is sufficient to drop the pole terms when taking the limit 
d — 00. The dimensional regularization is equivalent to the formal change of 
J d'^ki . . . d"^kif{ki ... A:/) by f d^k\ . . . d'^kif{ki . . . A:/), and it preserves the struc- 
ture of Feynman integrals which makes it convenient for practical calculations. 
It requires however a proper definition of tensor objects like Dirac 7-matrices. 
There is no consistent definition of 75 matrix in arbitrary dimension, compatible 
with chiral invariance. So the dimensional regularization breaks gauge invariance 
in models with chiral fermions, which include such important theories as Salam- 
Weinberg model and supersymmetric theories. 

The second approach to renormalization of theories with nontrivial internal 
symmetry is the algebraic renormalization [6]. It is a two step procedure. One 
firstly defines finite correlation functions by using some, not necessary symmetry 
preserving, subtraction scheme. At the second step the freedom in the choice of 
finite count erter ms is used to satisfy GWI or similar symmetry relations. The 
big advantage of this method is its universality. It may be applied to arbitrary 
quantum field theory model and used with arbitrary regularization procedure. 
In particular some supersymmetric models for which the problem of invariant 
regularization is not solved were considered in this framework [17], [18]. It also was 
applied to the analysis of the Standard Model [19], [20]. However, from the point 
of view of practical calculations this method is rather complicated. It requires 
introduction of most general noninvariant counterterms which in some models 
(e.g. supersymmetric) have a complicated structure. Some method which shares 
universality of algebraic renormalization and manifest symmetry of gauge invariant 
regularization schemes would be welcome. 
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4 Universal inv£iriant renormalization 

In this section I describe a special subtraction procedure which may be used with 
arbitrary regularization scheme and incorporates automatically Generalized Ward 
Identities, hence guaranteeing the symmetry of renormalized theory [21], [22]. The 
main idea can be explained using as an example QED. 

The correlation functions in QED must satisfy the Ward Identities 



Pp^''pÀP) =0 > 


( 20 ) 




( 21 ) 


=0 . 


( 22 ) 



Here H denote purely photonic amplitudes, and P is the electron- photon vertex. 
Our goal is to write down explicit expressions for renormalized correlators which 
satisfy WI for arbitrary intermediate regularization. 

For the vertex function such expression looks as follows 

r;(p, q) = k) - r^ip, o) + . (23) 

Here is the electron self energy renormalized at will. The WI is satisfied 

trivially 

^yr 

r>,0) = e— . (24) 

As P^(p, k) in the limit A 00 diverges logarithmically, the difference P^(p, k) — 
Pj^(p, 0) is finite. Finally, in spite of apparent nonlocality of subtracted terms the 
subtraction is in fact local. Indeed, the subtracted terms may be writ en as follows 

lim [F^ip, 0) - = lim [r;)(p, 0) + (Zi (A) - 1 ) 7 ^ - e— ] 

A^oo ^ dpn A^oo ^ apn 

= lim (Zi (A) - 1 ) 7 ^ (25) 

A^oo 

where ^i(A) is a constant, logarithmically divergent in the limit A ^ oc. 

In a similar way the local subtraction providing automatically WI for four 
photon vertex is 

^%pa{p^(l^k) = . (26) 

Obviously Ward Identities are not sensitive to subtraction of local gauge 
invariant counterterms. In QED such gauge invariant structure is the transversal 
vacuum polarization, which may be renormalized at will, by subtracting 

(Z3 - l){k^gp,^ - kpK) . 



(27) 
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The procedure described above reduces the problem of renormalization in 
arbitrary regularization to the renormalization in invariant regularization scheme, 
where only gauge invariant counter ter ms are needed. Noninvariant counter ter ms 
do not appear at all. 

In arbitrary gauge theory the corresponding scheme looks as follows. Gener- 
alized Ward Identities may be written in the form 

= • (28) 

Here the functions at the r.h.s have one external gauge field line less than the 
function at the l.h.s. The corresponding differential identities are 

r;:....(o,fc...) = ^^^^ip=o (29) 



gr;:,...(p,fc...) 

dpa 



y=0 + 



'9r;^...(p,fe ...), 

dpu 



lp=0 — 



dpudpa 



p=o . (30) 



For arbitrary regularization and subtraction scheme in anomaly free theory these 
identities may be violated by local polynomials. So the renormalized correlators 
satisfying GWI are given by the equation 

r^....(p,fc...)= iim[r;)..,(p,...)- 

A^oo 

r^....(0,fc...) + ^^^%^^^ ]+ 9 -i.c.. (31) 

where g.i.c. denotes gauge invariant local terms which obviously have no influence 
on GWI and may be fixed at will. 

The eq.(31) is written for the case of logarithmic divergency. For linearly di- 
vergent integrals the next differential identity (30) should be subtracted. Quadratic 
divergencies are treated in a similar way. If necessary one can subtract in addi- 
tion a gauge invariant counterterm (like photon wave function renormalization in 
QED). Such terms are not fixed by GWI and as discussed above, are present also 
in gauge invariant regularization schemes. 

Therefore we have the renormalization procedure, which is universal, may be 
used with arbitrary intermediate regularization and requires only gauge invariant 
counterterms. In particular it may be applied to the Standard Model in combi- 
nation with dimensional regularization, providing a simple computational scheme 
preserving chiral gauge invariance. 

Another important application of this method is renormalization of super- 
symmetric gauge theories for which no general invariant regularization is known. 
The universal renormalization of supersymmetric theories was considered in a re- 
cent paper [23]. 
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Approaching the Final State in Perturbative QCD 

George Sterman 



Abstract. Infrared safe differential cross sections, such as event shape distributions, 
can be measured over wide kinematic ranges, from regions where fixed order cal- 
culations are adequate to regions where nonperturbative dynamics dominate. Such 
observables provide an ideal laboratory for the study of the transition between weak 
and strong coupling in quantum field theory. This talk focuses on how resummed 
perturbation theory has been used to deduce the structure of nonperturbative cor- 
rections, and to provide a framework with which to address the transition from 
short- to long-distance dynamics in QCD. 



1 Introduction 

Quantum chromodynamics (QCD) serves both as tool and background for new 
physics searches [1]. Control over QCD corrections is crucial for the interpretation 
of any experiment in which hadronic matter appears in the initial or final states, 
or even (as in the measurement of the muon’s magnetic moment) only in virtual 
states. In addition, QCD is fascinating in its own right as a quantum field theory. It 
combines strong and weak coupling dynamics in any high energy experiment that 
is sensitive to momentum transfers much beyond the scale of the strong coupling, 
Aqcdj and every experiment that probes higher energies encodes the transition 
between partonic degrees of freedom at short distances to hadronic degrees of 
freedom at long. 

This talk approaches these issues from the partonic, short distance side, 
through the study of perturbation theory. I will argue that perturbative QCD 
has much to say about the structure of the larger theory that contains it. The dis- 
cussion begins with a review of some of the basic concepts of perturbative QCD. 

1 will then sketch how resummed perturbation theory implies nonperturbative in- 
formation, with the operator product expansion as the classic example. Analogous 
reasoning is extended to jet- related cross sections, and examples are given of the 
interesting results that emerge from such an analysis. 

2 Some Concepts in Perturbative QCD 
2.1 Asymptotic freedom and infrared safety 

The characteristic feature of QCD is its asymptotic freedom, in which the coupling 
becomes weaker when it is measured at shorter distances. This remains a purely 
abstract attribute, however, until we identify observable quantities that can be 
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expanded in the coupling with finite coefficients. Such quantities are said to be 
infrared safe [2]. A few cross sections, such as the total and suitably-defined jet 
cross sections in e^e“ annihilation, are infrared safe, and may be expanded as 

Q^cr^hysiQ) = ^Cn{QVfi^)a”{n) + 0 

n 

= Çc„(l)<(Q) + o(^), (1) 

where p is some integer and ^ is a renormalization scale. In the second expression 
we have used the renormalization group invariance of physical cross sections. 




2.2 Factorization 

Cross sections for processes with hadrons in the initial state are not by themselves 
infrared safe. Nevertheless, if they involve heavy particle production, or more gen- 
erally involve a large momentum transfer, Q, then short- and long-distance com- 
ponents of the cross section can be factorized [3] , usually into a convolution form. 



<3^o-phys(Q, m) =u)sD{Q/t^,asilJ.)) ® /LD(/i,m) + 0 




(2) 



In this expression, <S> represents a convolution, most familiarly in terms of partonic 
momentum fractions, and /x is a factorization scale, separating the short-distance 
dynamics of the hard subprocess, in o;sd: from the long-distance dynamics in /ld- 
/ld depends on various infrared, often nonperturbative scales, denoted m. For 
inclusive lepton-hadron deep- inelastic scattering (DIS), in which Q is the momen- 
tum transfer, and for various other inclusive and semi-inclusive hadronic hard- 
scattering cross sections, /ld represents products of parton distribution functions. 
For single-particle inclusive cross sections, it includes fragmentation functions. 
Generally speaking, ‘new physics’ is part of the short distance functions a;sD, 
while the long-distance /ld’s are universal among processes, making predictions 
possible on the basis of hypothetical couplings to new particles or rare processes 
at short distances. 

Eq. (2) applies not only to factorized cross sections with hadrons in the initial 
state, but also to many infrared-safe jet cross sections, for which the scale m is 
large enough to be perturbative, but is still much smaller than the highest scale 
Q: Q ^ m ^ Aqcd- An example is the limit of ‘narrow jets’ in e+e“ annihilation, 
where m may be taken as the mass of the jet [4] . 



2.3 Factorization proofs and resummation 

The proofs of relations like Eq. (2) are all based on the observation that short- 
distance processes, organized within the function cjsd? are quantum-mechanically 
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incoherent with dynamics at much longer length scales. This is the same observa- 
tion that makes possible the operator product expansion. Extensions of Eq. (2) to 
jet cross sections and various decay and elastic amplitudes depend on the further 
observation that there is a mutual incoherence in the dynamics of particles reced- 
ing from each other at the speed of light. Often, these factorization relations hold 
to all orders in perturbation theory and to all powers of ln(/u/Q) [5]. Interest in 
these issues has reemerged in analyses of high energy processes in the language of 
effective theories [6]. 

Whenever there is factorization, there is some form of evolution and resum- 
mation. To see why, we observe that a physical cross section cannot depend on the 
factorization scale. 



0 == fi—lnaphys{Q,m) 



( 3 ) 



From this it follows that the variations of long-distance and short-distance func- 
tions with respect to fj, must compensate each other, through a function of the 
variables that they hold in common: 



ln/LD(M.'f«) = --P(ûs(m)) = InüJsD ■ (4) 

In deep-inelastic scattering, and other inclusive hard-scattering cross sections, 
where cjsd and /ld are connected by a convolution in momentum fractions, the 
functions P{z,as) are the DGLAP splitting functions [7]. 

By the same token, wherever there is evolution, there is a resummation of 
perturbation theory, generated by solving the evolution equation. Again thinking 
of DIS, we may fix /i = Q in Eq. (2), compute cjsd as a power series in as(Q)^ 
and derive the Q-dependence of the cross section by solving (4) for the functions 
/ld(Q,^): 



In fTphys(Q,rn) 



<^sd(1,û^s(Q)) 



exp 



/ i /LD(9,m). 



( 5 ) 



The most impressive successes of this approach are in observables with a single 
hard scale. Inclusive DIS is the exemplary case, as illustrated in Fig. 1 [8] for the 
structure function F 2 {x^Q)^ which is proportional to the cross section to produce 
an hadronic final state of total invariant mass squared equal to Q^(l — x)/x, at 
momentum transfer Q. 

As mentioned above, only a limited number of observables depend on a single 
hard scale. A more typical example is the distribution of p == (mj/Q)^, from e“^e“ 
annihilation dijet events, with mj the mass of the heavier jet in the center-of- 
mass (c.m.). The experimental distribution in p at the Z pole is illustrated in Fig. 
2, taken from [9]. In this case, the two scales are the c.m. energy, Q — y/s and 
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Figure 1: Data for F 2 {x,Q) with QCD fit based on evolution, from Ref. [8]. 



the mass of the ‘heavy’ jet. The data is shown as a function oî p = and 

even resummed perturbation theory (next-to- leading logarithm in mjfQ) is not 
adequate to follow it. It is necessary to include power corrections in the lighter 
scale: 1/mj (as organized in a shape function, see below), which become more and 
more important in the ‘exclusive limit’ of light jet masses. Notice that most events 
are in this range; going one step beyond the fully inclusive cross section, we are 
required to find a description with nonperturbative as well as perturbative input, 
even though the observable in question is infrared safe. As we decrease mj, non- 
perturbative effects vary continuously from a few to tens of percent. Thus, within 
this single set of data, at a single energy, we can select events whose formation 
is sensitive to an adjustable mix of perturbative and nonperturbative dynamics. 
This is an ideal testing-ground for quantum field theory. 

I shall discuss below recent attempts to organize nonperturbative corrections 
to infrared safe cross sections starting with perturbation theory. To do so with 
confidence, we should attempt to find relevant structures in perturbation theory 
to all orders in the coupling, the level at which we understand Eq. (2). An example 
is threshold resummation. 
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Figure 2: Heavy jet distribution at the Z pole, from Ref. [9]. The dashed line is 
resummed perturbation theory, and the solid line a fit with a shape function. 



2.4 Threshold resummation 



Threshold resummation addresses a specific set of corrections at all orders in per- 
turbation theory for factorized cross sections. Consider a generic factorized cross 
section for the inclusive production of an observable final-state object F of invari- 
ant mass Q, written in terms of partonic fractions as 



a AB-^ f{Q) = fa/Ai^a) ^ fb/si^b) ^ (^ab^F-\-X 




( 6 ) 



with fc/H parton distribution function for parton c in hadron H. The hard- 
scattering cross section depends on variable z, which is the ratio of to the 
squared invariant mass of the pair of partons that initiate the process aè — > F+X, 
where X denotes additional radiation in the inclusive final state. Of special interest 
is the limit z ^ 1, which is called partonic threshold. At partonic threshold, the 
partons a and b have only enough energy to produce F{Q), with nothing left over 
for radiation. This is a special limit in field theory, because a hard scattering favors 
processes with copious radiation. The consequence of this mismatch is that z = 1 
is associated with singular but integrable distributions, such as 



^ab^F 



(z) 



(-X- 

V 7T / r\ 



In' 



2r-l 



( 1 -.) 



( 7 ) 
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where a plus distribution [g{x)]^ with g{x) singular at x = 1 is defined (like a 
delta function) by its integrals with smooth functions: 



[ dx[g{x)]+f(x) = [ dxg{x)[f{x) - f{l)]~ f{l) ( dxg{x). 
J V J V Jo 



( 8 ) 



In Eq. (6), the singular distributions are in cjab^F-^x which has the information 
on perturbative radiation, and the smooth functions are the parton distributions. 
While all of the plus distributions are integrable, they become more and more 
singular at higher orders, and it is natural to try to estimate their overall effect 
beyond the lowest orders. 

At partonic threshold, we may implement the sequence of reasoning that we 
described above. Quanta are part of well-collimated jets, moving relatively at the 
speed of light, or are very soft. We may (re)factorize the partonic cross section in 
this limit, which leads to new evolution equations, and hence to resummation. For 
the hard scattering of Eq. (6), we find expressions whose expansion generates the 
terms identified in Eq. (7) to all orders in perturbation theory [10]. The resumma- 
tion is most conveniently exhibited in the space of Mellin moments of Uab^F-\-x, 
where they exponentiate as 



/■ 



dzz^ ^Lüab-^F+X{z) 



' exp 



' exp 



1 ^N- 

dz- 



XJ 



-1 - 1 /■«' dm2 



1 



/ 

J{1 



{1-zy 



Ai{as{m)) 



^ dm? ( w ^ 

YAi{as(m)) In I 1 



V Q ) 



(9) 



in terms of anomalous dimensions Ai{as) = CiioLs/ir) + . . ., with Ci — Cp for 
i = q,q and Ca for gluons. Eq. (9) generates the leading logarithms at each order; 
the complete expression involves process-dependent nonleading logs. At each order 
in the exponent, one of the logarithms can be evaluated explicitly, as shown, but 
in the remaining integral the strong coupling runs down to arbitrarily low scales. 
We will explore this intriguing feature of the resummed cross section below. 



3 Power Corrections and Event Shape Functions 

Examining Fig. 2 we have seen that for differential distributions, power corrections 
are necessary to bring even resummed cross sections into agreement with data in 
the exclusive limit. In the range where the kinematics are far from elastic, fixed- 
order perturbation theory is enough, but as we climb the curve of the figure toward 
the exclusive limit, the cross section rises and we move into a region where first 
resummation, and then nonperturbative corrections dominate. The theory itself 
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knows no strict boundary between perturbative and nonperturbative regimes. It is 
therefore natural to analyze perturbation theory to see if it gives signals of its own 
incompleteness, and to seek hints on how to supplement perturbative predictions 
with nonperturbative information. The ultimate goal is to construct a single theory 
that bridges the gap between short- and long-distance dynamics. 



3.1 The operator product expansion 

The approach that I will describe below begins with the simplest one-scale prob- 
lem, the total cross section in e^e“ annihilation [11]. The total cross section is 
related by the optical theorem to the imaginary part of the vacuum polarization 
for elect roweak currents. 



I , ( 10 ) 

whose behavior at large is determined by an operator product expansion: 

(0|i^(0)j^(Æ)|0) = ^Co{x^n^,as{iJ.)) 

+~ Cp. (a:V^ {0\F^.F'^‘'i0)\0) + .... (11) 

Here, I have shown only the contributions of the leading (identity) operator and of 
the gluon condensate F^. Neglecting masses, perturbation theory alone would give 
only the leading term, even though the full theory demands both (and more). The 
question we ask is whether we could have discovered the need for the nonleading 
operators from perturbative QCD alone. 

The answer is yes, and the reasoning is illustrated in Fig. 3, which shows 
graphical configurations where loop momenta k are vanishingly small for the 
vacuum polarization and for the gluon condensate treated as a local operator. 
Diagram- by-diagram, each such configuration is a tiny ‘corner’ of loop momentum 
space, where one or more loops has low momentum, giving a finite contribution to 
the infrared safe integrals. For example, if the loop momentum is required to have 
k‘^ < for some cutoff k,, we find momentum-space integrals of the general form 



= ^H{Q) f\k^k^as{e) 
V Jo 

= ±H{Q)£ -2-2 



dk^k 



(^)/?oln(A:7g2)’ 



( 12 ) 



with H{Q) an infrared safe function free of all contributions where a single loop 
momentum vanishes. H{Q) represents the lower, contracted part of the diagram 
on the left of Fig. 3. Up to this factor, the expression for the right-hand figure, 
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corrections to the condensate, is identical. A generalization of Eq. (12) is valid to all 
orders in perturbation theory, and controls all logarithms in the loop momentum k, 
up to the next power, Q~^ [11]. We will call an expression at this level of accuracy, 
all logs at a given nonleading power, an ‘internal resummation’ at that power. 





Figure 3: Sources of infrared sensitivity in perturbation theory for Cq (left) and 
(F2) (right). 

The foregoing observations may be summarized as an ‘axiom of substitution’, 
in which the infrared-sensitive, nonconvergent portion of an inter nally-resummed 
perturbative quantity is replaced by a matrix element with the same perturbative 
infrared expansion: 

CpT Cp^[Q'^/iJ?,KlQ,as{ij)) 

+ a,(0|F2(0)|0)(«). (13) 

Here, is perturbation theory subtracted in its infrared-sensitive limit. As an- 
ticipated, the nonconvergence of perturbation theory implies the need for a new, 
infrared regularization. For the vacuum polarization the cost is a new nonper- 
turbative parameter ((F^)), necessary to define the theory at the level of l/Q^ 
relative to the leading behavior. But, because it is implicit in perturbation theory, 
this new parameter is at the same time a reward of the analysis. 

3.2 Power corrections in semi-inclusive cross sections 

The viewpoint described above for the operator product expansion has been im- 
plemented, in various ways, to a variety of infrared safe cross sections and hard- 
scattering functions [12]. Of particular interest are resummed hard scattering func- 
tions like Eq. (9), which exhibit the same integral over the scale of the coupling 
as for the vacuum polarization (12). The application of these methods to event 
shape cross sections [13] sheds light on the transition from short- to long-distance 
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dynamics in the formation of final states in QCD. This section will sketch the ideas 
behind these applications, and also touch on internal resummation in this context. 
Such questions are most easily addressed for event shapes of two-jet cross sections 
in e~^e~ annihilation, although their application is much wider [14]. 

We consider inclusive cross sections defined by event shapes, S{{pi}), 

S = ^ E / \Mn{{Pi})\^S {S{{pi}) - S) , (14) 

do 2Q^ ^ Jps{n) 

at c.m. energy Q, computed from QCD matrix elements Mn{{pi]) integrated over 
n-particle phase space, PS{n). This event shape cross section is infrared safe if 
5({pi}) is a smooth function of final state momenta, that is unchanged by soft 
parton emission and collinear rearrangements, as illustrated in Fig. 4: 

S{...pi...pj-i,api,pj+i ...) = S{. . .(1 +a)pi ...pj_i,pj+i ...). (15) 

This condition makes it possible to compute the cross section pert ur bat ively, be- 
cause it deemphasizes the long-time evolution of the system, over which the nearly 
degenerate states of Fig. 4 mix. 

The study of event shapes in QCD, and more generally energy fiow [15], 
continues a long tradition in field theory. We may draw a not too far-fetched 
analogy between how, late in the nineteenth century Poynting discovered the rules 
for energy and momentum flow in classical electrodynamics by considering the 
slow discharge of a condenser [16], and how, late in the twentieth century, the 
short-distance structure of QCD was revealed in the rapid neutralization of a 
quark- antiquark color dipole produced in e+e" annihilation [17]. 

P : (1-a)p 

ap 



Figure 4: Soft emission and collinear rearrangements. 



We restrict ourselves here to a specific set of weights, appropriate to dijet 
events in annihilation cross sections [18], 



>5a(u) = ^ E 



kiT e‘ 






(16) 



all ien 



where the transverse momentum kip and pseudorapidity rji of each particle mo- 
mentum ki is defined relative to the axis that minimizes the specific function So(n) 
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for State n. The variable a takes on any value less than two. The particular choice 
a = 0 is So = 1 — T with T the thrust, while a = 1 is known as the jet broadening 

[19]. 

For these dijet event shapes, sensitivity to long times can be adjusted by 
studying the Laplace transform [13, 19], 




Choosing N large, we select states with small 5a, which are sensitive to long-time 
behavior (roughly, times of order \/SaQ), corresponding to low jet masses in dijet 
events. As illustrated by Fig. 2, however, power corrections are important at low jet 
masses. We thus ask what the reasoning described above for the operator product 
expansion can tell us about the transformed cross sections, especially at large N 
in Eq. (17). 

At low values of 5a, these differential event shape cross sections show the 
same double logarithmic behavior discussed above for threshold resummation, and 
obey a similar resummation in Laplace transform space [18, 19], which is given to 
leading logarithm, accompanied by effects of the running coupling, by 

{ dr? cVtIQ 

àa{N,Q) ~ exp J 2 / {a ^{ pt)) / — 

[ Jo pt jpiiQ^ y 

X *''■“-!] I , (18) 



in terms of the same Sudakov anomalous dimension as above, Aq{as) = 
Cpicts/Tr) + . . .. 

We can use Eq. (18) to suggest the structure of power corrections in differen- 
tial cross sections, as described above for the total annihilation cross section. To 
do so, we split up the pr integral into ‘perturbative’ and ‘soft’ ranges, separated 
at a factorization scale k [20]. If we expand the exponent of (18) in the soft range, 
we find 



\nàa{N,Q) = ln<Ja,PT(A',(5, k) 

nn\ 



2 ^ 1 

n—l 



= lnàa,PT(A^, Q, k) + In {N/Q) + O 



l-(^ 



Q 



n(l — a) j 



N 



(19) 



In the second line we organize the entire contribution from the soft region into 
a single function fsai^lQ)^ with corrections relatively suppressed by powers in 
1/(3^“^ (assuming for now that a < 1). Taking the inverse transform of (19) 
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we arrive at an expression for the physical cross section as a convolution of the 
perturbative cross section with what is known as an event shape function [20], 



da 

dSa 







dag^PTiSg - e/Q) 
dSg 



1 



(20) 



The event shape function fs^ is independent of Q, so that a fit to data at Q = mz 
is sufficient to predict the differential cross section at any Q, including all integer 
powers of N/Q in moment space [9], which translates to all integer powers ofl/SgQ 
at fixed Q. This is the kind of formalism necessary to confront data like those in 
Fig. 2 above. Analogous functions find important applications in the physics of 
b quarks [21]. A recent example appled to the heavy jet mass, closely related to 
a — 0, is shown in Fig. 5 from Ref. [22]. Beyond a = 0, the shape functions derived 
in this fashion for choices of a, 6 < 1 possess a surprisingly simple relation, 



In fsAN/Q) = In fs,(N/Q) , 



( 21 ) 



with corrections as in (20). For 2 > a > 1, on the other hand, we encounter the 
interesting situation noted first in Ref. [23], where the leading power correction 
becomes fractional, and larger than l/Q. 
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Figure 5: Event shape-based fit to the heavy jet mass at various energies, from 
Ref. [22]. 
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The next question we should ask, however, is whether Eqs. (18) and (21) 
hold beyond leading and (suitably generalized [19]) next- to- leading logarithmic 
resummation, and in particular whether these event shapes can be internally re- 
summed to all orders in 1/SaQ^ Rather than try to answer this question for the 
full theory, we will follow Ref. [20], and consider the ‘eikonal’ cross section, in 
which the primary quark-antiquark pair and the partons collinear to them are all 
replaced by nonabelian phase operators. These are Wilson lines, represented by 
products of exponentials of the gauge field ordered along lightlike paths in the jet 
directions. Such cross sections have the advantage that many of their properties 
may be analyzed to all orders in perturbation theory using combinatoric methods 
[24]. Here I will simply quote the result of such an analysis. 

As in Eq. (18), the all-orders eikonal approximation to the cross section can 
be written in transform space in terms of a function in which as runs with an 
integration variable, which we may think of as a transverse momentum, 

cPvrr 

In àa{N,Q) = / —^Asa {pT/fJ^,Q/fi,as{fi),N) , (22) 

Jo Pt 

where the function Asa is infrared safe and renormalization group invariant 
{dAsa/dii = 0). As an example, consider the case a — 1, jet broadening, for 
which we can write 



.451 (Pt/I^,Q/iJ',0:s{h),N) 



Ç ipS(n) 



Wn 



ki -kj ki- pikj ■ fh. , . 



IPS(n) \ • /?2 

xS'‘‘ (pT - ^ S (r]„) ln{Q/pT) 



exp{-^PT}-l 



(23) 



where the ki denote particle momenta in final state n, and where r}n is the rapidity 
of the total momentum of state n. Boost invariance requires that the functions Wn 
[24] depend only on the arguments shown. For other event shapes, such as the 
thrust, slightly more elaborate expressions are necessary. In each case, however, a 
meaningful factorization of the infrared, strong-coupling dynamics from the per- 
turbative region is possible. Eq. (23), although valid for all orders and logarithms, 
applies only to the eikonal cross section. Nevertheless, it appears that the way is 
open for the all-orders analysis of the relation between perturbation theory and 
power corrections in event shape functions and related parameterizations, beyond 
the limitations of next- to- leading logarithm. 



4 Summary 

Short-distance, single-scale observables in QCD are reasonably well understood, 
which is one reason why we have such confidence that QCD is a true theory of the 
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strong interactions. At this time, however, we are still far from a full understanding 
of multi-scale observables, which lead us from perturbative to nonperturbative 
dynamics. Power corrections and event shape functions for infrared safe differential 
distributions may be identified by an extension of the reasoning that leads to 
the operator product expansion for the total e^e“ annihilation cross section. By 
developing the concept of internal resummation for infrared safe cross sections, we 
may find a perturbative window to the formation of final states, and gain insight 
into the transition between short- and long-distance degrees of freedom in quantum 
chromodynamics . 
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The Cosmic Microwave Background 

Joseph Silk 



Abstract. I review the discovery of the temperature fluctuations in the cosmic mi- 
crowave background radiation. The underlying theory and the implications for cos- 
mology are described, and I summarize the prospects for future progress. 



1 Introduction 

The discovery of cosmic microwave background temperature fluctuations in 1964 
has revolutionized cosmology. Prior to 1992, one could only speculate about the 
initial conditions for structure formation. Primordial acausal curvature fluctua- 
tions were first detected by an experiment on the COBE satellite that mapped the 
sky on angular scales in excess of 5 degrees, much larger than the causal horizon 
of approximately one degree at last scattering. This provided the first glimpse of 
what was to come over the next ten years. Provided one adopted the inflation- 
ary prescription of scale- invariant fluctuations, one could make the connection to 
the long sought seeds of large-scale structure formation. But the connection was 
tenuous, and purely theoretical. 

There were two stages to the discovery. As early as 1967, the prediction was 
made [1] that intrinsic horizon-scale density fluctuations could be as large as 10 
percent without exceeding the current limits on the cosmic microwave background 
isotropy at that time. These fluctuations were considered to be on scales in excess 
of the last scattering horizon and were imprinted acausally at the beginning of the 
universe. The connection with galaxy formation arising from primordial irregular- 
ities of infinitesimal amplitude, the current paradigm for structure formation, was 
also made in 1967 [2]. The relic temperature fluctuations are causally generated 
from sound waves in the primordial baryon-photon plasma on the last scattering 
surface of the cosmic microwave background at redshift 2 : ~ 1000. 

The search for cosmic microwave background temperature fluctuations was 
long. Results from the COBE far infrared spectrophotometer (FIRAS) showed in 
1990 that the spectrum of the cosmic microwave background did not deviate from 
a blackbody by more than 0.01% of its peak brightness. Large-angular scale fluctu- 
ations in the CMB were eventually discovered in 1992 using the COBE differential 
microwave radiometer (DMR) [3] at a level ST/T ~ 10~^, to within a factor of 
two of the expectation from the twin hypotheses of structure formation by gravi- 
tational instability and a scale- invariant spectrum of primordial adiabatic density 
fluctuations. However, the extrapolation from the COBE observations to scales of 
direct interest to structure formation, from £ 20 to £ 200, where the spherical 
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harmonic wave- number i ^ 180°/^ for angular scale 9, was large. It took nearly a 
decade after the COBE launch before direct detection of the elusive fluctuations 
that gave rise to large-scale structure was achieved. 

Sub-degree scale temperature fluctuations from the surface of last scattering 
were finally confirmed, and in particular the first peak was measured, by a series of 
balloon and mountain- top experiments [4], [5], [6], [7], [8], [9], [10], that spanned the 
range ^ ^ 20 to ^ ~ 2000. The causal adiabatic fluctuations were amplified from 
primordial curvature fluctuations by subhorizon growth in the matter-dominated 
era. This enhancement was detected as a series of so-called acoustic peaks, the first 
and best-measured feature (known to better than ten standard deviations by the 
end of 2002) corresponding to the angular scale of the horizon at last scattering. 
The peaks are driven by sound waves in the primordial baryon-photon plasma, and 
the sound wave amplitude measures the seeds of present-day structure formation. 

The theory of gravitational instability in the expanding universe as the driver 
of growth of structure from infinitesimal primordial density fluctuations, pioneered 
in a remarkable paper dating from 1946 [11], was dramatically confirmed. Gravita- 
tional instability accounts for the origin of large-scale structure and in particular 
for the formation of the galaxies. The primordial structure was inhomogeneous at 
a level of around one part in 100,000. On comoving scales in excess of 100 Mpc, 
the universe is still uniform today and approaches this level on the horizon scale. 
However subhorizon growth means that the comoving scales of galaxy clusters 
(approximately 10 Mpc) were first nonlinear at a redshift of order unity, massive 
galaxies at a redshift of around 3, and dwarf galaxies at a redshift of order 10. 
These inferences come from measurement of the power spectrum of temperature 
fluctuations, from which the density fluctuation spectrum can be recovered. The 
discovery of the fossil fluctuations that seeded structure formation is a stunning 
intellectual achievement of the 20th century. 

Complementary information comes from deep galaxy redshift surveys and 
from the line-of-sight correlations of intergalactic clouds viewed via Lyman al- 
pha absorption against high redshift quasars. It is remarkable that the fluctuation 
spectrum measured at a redshift of 1000 agrees in shape and normalization with in- 
dependent measurements of large-scale structure [12], [13]. These are performed at 
redshifts of 0.3 and 3, corresponding to the typical depths of the 2DF/SDSS galaxy 
samples and the Lyman alpha forest [14]. Even the bias factor, which measures 
the ratio of the variance in galaxy counts to dark matter density fluctuations, can 
now be inferred via a combination of any two of several independent techniques: 
large-scale clustering [15], CMB data [16], weak lensing [17], [18], and the galaxy 
cluster abundance [19]. A bias of approximately 4/3 is inferred on galaxy cluster 
scales. This means that the rms fluctuations in dark matter density are 75 percent 
of the variance in luminous galaxy counts sampled over randomly placed spheres 
of 10 Mpc diameter. Galaxy formation is only weakly biased. 
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2 Some history 

The cosmic microwave background and the temperature fluctuations therein are 
two of the major predictions of the Big Bang theory. Others are the expansion 
of the universe and the light element abundances. The latter motivated the cos- 
mic microwave background prediction. Friedmann and independently Lemaître are 
credited with the first predictions of the expansion of space. Indeed Lemaître pre- 
dicted a linear relation between galaxy redshift and distance, verified by Hubble 
in 1929. Curiously, Hubble never accepted the Big Bang interpretation of his red- 
shift law as the expansion of space. It has even been speculated that Hubble’s 
reluctance to accept that the eponymous law was a fundamental contribution to 
the confirmation of the theory of the expanding universe may explain why Nobel 
recognition eluded him. 

Gamow pioneered the predictions of element nucleosynthesis in the first few 
minutes of the Big Bang [20], but the important recognition by Hayashi in 1950 
that the neutron abundance was determined by thermal equilibrium in the pri- 
mordial plasma set the scene for the later precise predictions of the light element 
abundances. Gamow’s associates Alpher and Herman [21] realized in 1949 that the 
inference the universe necessarily was hot at the epoch of primordial nucleosyn- 
thesis implied that, in order to get a reasonable helium abundance, the present 
epoch radiation background was characterized by a temperature of 5 K. However 
they failed in any published work over the following decade to make the connection 
with an observable microwave background. Indeed, later papers speculated about 
a radiation temperature today as high as 50K. 

The connection with observations was first mentioned in 1963 by Zeldovich 
[22], and in more detail in 1964 by Doroshkevich and Novikov [23], who even 
interpreted a Bell Labs measurement of the atmospheric temperature as setting a 
limit of only 1 K on the cosmic radiation background to be evidence for an initially 
cold universe. But in effect an extraterrestrial signal was lurking in the measured 
antenna temperature. Evidence of absence did not guarantee absence of evidence, 
and the cosmic microwave background was discovered within a year via the epochal 
detection by Penzias and Wilson of what they described as an excess antenna 
temperature, isotropic in the sky. Penzias and Wilson did not initially realize the 
significance of their discovery, and the connection with the Big Bang was made 
in a pioneering companion paper by Dicke and collaborators at Princeton, who 
had been searching, simultaneously and independently, for the cosmic microwave 
background. The Princeton group realized that the early universe would have 
been an ideal furnace for generating blackbody radiation. Dicke in particular, an 
experimentalist who was apparently unaware of Gamow’s work and was convinced 
that the Big Bang was cyclic, had long realized that light element synthesis in the 
Big Bang implied a present day blackbody radiation held at a few K, peaking at 
microwave frequencies. 

Observations of the blackbody-like spectral dependence of the CMB accu- 
mulated over the next two and a half decades, but it was not until the launch 
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of the COBE FIRAS interferometer that a definitive spectral measurement was 
forthcoming. FIRAS confirmed the CMB spectrum was that of a blackbody at 
2.725 zb 0.002K, with a precision limited only by that of the on-board calibration 
source [24]. The cosmic blackbody spectrum is comparable to that of the best 
laboratory blackbody spectral determination. 

Parallel theoretical developments focussed on angular variations in the CMB 
temperature. In 1967, Sachs and Wolfe had predicted that primordial curvature 
fiuctuations could exist on horizon scales, ~ 1000 Mpc, generating temperature 
fiuctuations on large angular scales with an amplitude of order 1 percent. These 
would have been acausal and attributable to the initial conditions of the Big Bang. 
Subsequent studies of the first instants of the universe led to the development in 
1981 of inflationary cosmology, which predicted adiabatic, gaussian fiuctuations 
that were boosted from quantum fiuctuations to macroscopic scales during an 
early de Sitter phase. The universe expanded exponentially during a brief moment, 
triggered by the spontaneous breaking of the GUT symmetry at a temperature 
of about 10^^ GeV. The theory also predicted that the fiuctuations should be 
approximately scale- invariant, although the amplitude was not specified, in agree- 
ment with earlier conjectures that have become known as the Harrison-Zeldovich 
spectrum ([25], [26], [27]). Many versions of inflation also subsequently predicted 
this, but perhaps the most noteworthy prediction of inflation is that the universe 
should be fiat ([28], [30], [29]). 

Another pre- inflationary prediction, and one that inflation did not, and could 
not, provide, was that of the strength of the primordial density fiuctuations. This 
yielded the first quantitative estimate of the fiuctuations that are required to have 
seeded structure by gravitational instability-induced growth. A simple prediction 
based on adiabatic fiuctuations required ST jT to be a few parts in ten thousand at 
last scattering on angular scales of a few arc minutes in order to have formed galax- 
ies and clusters by the present epoch [2]. The prediction simply utilized the adia- 
batic relation that, if recombination were instantaneous, would give STjT^ ^Spj p. 

The current prediction is about a factor of 10 smaller than implied by this 
simple expression. What causes the diminution? The realization that decoupling 
of the two fluids was not instantaneous [31] results in a substantial decrease of 
the relic temperature fiuctuations [36]. Modern treatments, which incorporated 
cold dark matter, were first performed in 1984 ([32], [33]), and allow fluctuation 
growth after the epoch of matter- radiation equality but prior to last scattering, 
anticipating a peak value ST jT ~ 3 x 10“^ on an angular scale of about half a 
degree. This corresponds to the horizon scale at last scattering. 

The predictions refer to causal fiuctuations generated by gravitational insta- 
bility on subhorizon scales during the era of matter domination, and apply on the 
largest scales where structure is seen, namely galaxy clusters and superclusters, 
and hence to angular scales of order tens of arc minutes. The coherent peaks in the 
radiation density are due to compressions and rarefactions of sound waves in the 
primordial plasma as viewed at last scattering. The occurrence of multiple peaks in 
the radiation power spectrum was first described in 1976 [34], and were only accu- 
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lately described as a generic description developed from a multipole expansion of 
the CMB sky in 1981 [35]. This approach incorporated the first application to tem- 
perature fluctuation predictions of the coupling of the primordial photon-baryon 
plasma by the Boltzmann equation, originally given in a classic paper in 1970 [26]. 
The latter paper indeed reported the existence of coherent baryon oscillations of 
the adiabatic mode in the matter-radiation plasma at matter-radiation decoupling, 
also independently noted by [36]. These two papers quantify the so-called Sakharov 
oscillations [37] in the matter power spectrum. These matter power spectrum os- 
cillations have recently been detected, albeit with modest significance, in the 2DF 
galaxy survey [38]. 

Progressively more detailed discussions in the 1990s describe a series of acous- 
tic peaks, interpreted in the context of inflationary amplification of quantum 
fluctuations. These result in temporally coherent fluctuations on the surface of 
last scattering ai z ^ 1000 that are driven by the growing adiabatic mode of linear 
theory. 

The connection between ST /T and structure formation presented a definitive 
experimental target in the 1970s and 1980s, although the goalposts were moved in 
1984 when cold dark matter was first included into the computational recipe. The 
prevalence of weakly interacting dark matter that dominates the baryon density 
by a factor ~ 10 means that substantial fluctuation growth occurs between the 
epoch of the onset of matter domination at 2 ~ 4 x 10'\ prior to last scattering 
at 2 ~ 10^. This reduces the predicted fluctuation strength by a factor ~ 4, and 
significantly modifies the detailed structure of the radiation power spectrum. 

Upper limits were successively improved. Only with the COBE DMR detec- 
tion in 1992 of large angular scale fluctuations attributed to the acausal conditions 
at the beginning of the universe was it realized that the elusive, causally amplified, 
subhorizon, fluctuations on the last scattering surface that seeded the observed 
large-scale structure might be within reach once foregrounds could properly be 
accounted for. 

A series of experiments led to multifrequency measurements that could elim- 
inate dust and synchrotron galactic foregrounds. These had plagued much of the 
earlier work, but bolometer sensitivities improved and microwave receiver noise 
temperatures were reduced while bandwidths were increased. The backgrounds 
could be modeled with sufficiently high sensitivity, wide spectral frequency and 
sky coverage. The definitive measurements came in at the beginning of the new 
millennium, from the BOOMERANG and MAXIMA bolometric balloon experi- 
ments and the DASI, VS A and CBI mountain-based interferometers. 

The conclusions have revolutionized cosmology. The hypothesis of structure 
formation by gravitational instability of primordial infinitesimal inhomogeneities 
has been verified. The flatness of the universe is measured: the first peak occurs at 
£ = 215±10 and requires Hq = 1.02±0.06. The power spectrum of the fluctuations 
is described by a constant index power law fit by n = 0.96 dz 0.06, essentially the 
scale- invariant expectation. The baryon abundance is confirmed to be = 

0.021 ± 0.04. All of these quantities are cited at 68% confidence (e.g. [10], [39]). 
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3 Some theory 

Detection of the CMB fluctuations is a triumph of the simplest Big Bang cos- 
mology: confirmation of a linearly perturbed expanding universe. To describe the 
anisotropies in a Euclidean geometry, one expands the temperature field on the 
celestial sphere in spherical harmonics 

/VT ^ 

= E! °‘(my(m{n) . (1) 

e^2 m^-e 

By definition the mean value of the a£rn is zero. 

The correlation function (or 2— point function) of the temperature field C{9) 
is the average of AT /T {hi) AT /T{fi 2 ) across all pairs of points in the sky (n 1 , 722 ) 
separated by an angle 0 ( cosO = ni • n 2 ). One usually assumes statistical isotropy 
and Gaussian statistics. One can now write down [40] a multipole expansion 

C(0) = A Y^{2ê+l)CePe{cose) (2) 

e 

where Pi{cos0) are the Legendre polynomials and the are the multipole mo- 
ments. 



Ci — ^ ^ (^) 

and where i oc 6~^. 

Each Ce comes from averaging over + 1 modes, and the sample variance 
error on Ce is 

^Ce _ I 2 ^- 1/2 

where a fraction /sky of the sky is observed [41], [42]. The contribution to the 
logarithmic power is usually written as ^(^-h 1)C^, this combination being constant 
at low £ for scale- invariant fluctuations. 

The radiation power spectrum shows two characteristic angular scales. A 
prominent peak occurs diX, i ^ 220 or about 30 arc-minutes. This is the angular 
scale that corresponds to the horizon at the moment of last scattering of the 
radiation. The corresponding comoving scale, corresponding to the wavelength of 
a wave that just spans this horizon, is approximately 100 Mpc. The second scale 
is the damping scale of about 6 arc- minutes, where the finite thickness of the last 
scattering surface just spans a wave of comoving scale around 10 Mpc and damps 
out temperature fluctuations on smaller scales [43] . 

The principal features that separate these scales are the Sachs- Wolfe plateau 
at low £, the series of peaks between i = 200 — 2000, and the damping tail. The 
peaks are the relic of coupled photon-baryon oscillations, in effect sound waves 
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in the primordial plasma, at the epoch of last scattering of radiation and matter. 
One finds a series of peaks because the growing mode of fiuctuations is the only 
surviving mode, and are phase- locked on super- horizon scales most likely due to 
their infiat ionary origin. Waves that just crest at last scattering produce the first 
adiabatic peak in the CMB. Waves of half the last horizon scale also crest, as 
do smaller waves until damping sets in at about 10 percent of the wavelength 
of the first acoustic peak. The peaks in the power spectrum are associated with 
successive maxima and minima of density waves in the coupled photon-baryon 
plasma, alternately in and out of phase with the dark matter potential wells that 
dominate the gravity and correspond to compressions and rarefactions on the last 
scattering surface that in turn generate hot spots and cold spots in the radiation 
on the last scattering surface. In quadrature, these are represented as a series of 
peaks in the power spectrum. 

Coherent peaks are generated because an inflationary, or more generally, a 
very early, origin means that only the growing mode of fluctuations are present, 
and since no growth occurs on super horizon scales, the phases of the waves can 
only evolve once growth commences after entering the horizon. Hence for exam- 
ple any wave that just spans the horizon at last scattering has the same phase. 
Any wave that spans half the horizon at last scattering has had exactly the same 
amount of subhorizon growth, so these too are in phase. Now the wave amplitude 
is proportional to cos{kt/a), where k/a is the wave-number, a{t) is the cosmo- 
logical scale factor, and the initial conditions are such that the amplitude must 
be constant (and independent of wave-number for a scale- invariant spectrum) on 
super- horizon scales 27ra/A; ^ ct. This is the Sachs- Wolfe effect. The wave crest- 
ing at last scattering Ils gives the first peak, and there is a series of compres- 
sion/rarefaction peaks at kihsl^^ = (n + 1)7t, n = 0, 1, 2 . . . , and Doppler peaks 

at ktLstd — {n 1/2)7t, n = 0, 1,2 The compression mode is found to be 

dominant. 

The Doppler effect [36] generates smaller peaks that are 90 degrees out of 
phase with the compression mode, and the effect of enhancing the baryon compo- 
nent is to strengthen the compressions relative to the rarefactions because of the 
baryonic contribution to the gravity [44] . The baryon density controls the rarefac- 
tion amplitude relative to the compression amplitude, so the odd-even alternation 
of peak strengths enables to be read off the sky. 

The influence on the radiation power spectrum of spatial curvature with ac- 
curate matter-radiation coupling was first computed in 1981 [45]. In curved space, 
a generalized multipole expansion is necessary. One finds that spatial curvature 
modifies the angular distribution but its effects are readily disentangled. In the 
modern language of acoustic peaks, the peaks are found to be shifted by a factor 
of order towards smaller angular scales in hyperbolic spaces, for progressively 
lower O [46], [47]). Confirmation of the flatness of the universe via measurement 
of the peak location represents a major triumph of inflationary cosmology. 

There are many complications, some welcome, some less so. The overall am- 
plitude of the peaks is largely controlled by or more precisely the ratio of 
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matter to radiation densities and by There is a degeneracy between Çlrn and 
ÇIa that cannot be resolved by the CMB alone [48]. Reionization suppresses the 
total power, and in particular all of the peak heights, and can be constrained [49]. 
In fact, Üa results in lensing of the peaks and redistributes power to small scales, 
so only very high i measurements can help break this degeneracy [50] In practice, 
the degeneracy is best broken by correlation with deep redshift surveys since the 
effects of Üa are most important at 2 : 1 [51]. 

4 Implications 

One frontier has recently been breached in the CMB. The first detection of po- 
larization has been reported by the DASI interferometer. The polarization in an 
FRW universe is due to the anisotropy of the Thompson scattering cross-section 
combined with the intrinsic quadrupole component of the radiation fluctuations 
[52]. This results in polarization that typically peaks at the angular scale of the 
last scattering surface. Detection provides additional confirmation of the standard 
model. 

The cosmic microwave background fluctuations provide an important probe of 
inflation. This arises in two ways. Firstly fluctuations on angular scales larger than 
a degree are acausal at last scattering and so are generated at GUT scales during 
inflation or even earlier as part of Planck era physics. One is directly viewing the 
impact of quantum fluctuations. Secondly, gravity waves are generically generated 
as a tensor mode in inflationary models. These contribute only to the large angular 
scale anisotropies, since the tensor mode is incompressible and redshifts away on 
subhorizon scales once the universe is matter-dominated [53]. One can constrain 
the tensor/scalar ratio from the ratio of the low i amplitude to the peak heights, 
combined with the 2DF power spectrum, to be less than about 70 percent [55]. 

Polarization of the temperature fluctuations will eventually provide a much 
more constraining probe of both the ionization history and primordial gravity 
waves [54] . In general, the polarization can be separated into two modes that can 
be expressed as the gradient and the curl of a potential. Electron scattering of 
a radiation held with an intrinsic quadrupole component generates the gradient 
mode of polarization. Gravitational lensing couples this gradient to the curl mode. 
However only gravity waves give an intrinsic curl component of polarization. This 
provides a unique signature that can in principle be disentangled from the induced 
curl polarization, and should be detectable on degree scales or larger at the 0.5pK 
level for typical inflationary models [56]. 

Reionization further complicates matters, since late electron scattering also 
polarizes the CMB. For typical models, the detailed Ci spectrum of scalar- induced 
polarization should be distinguishable at a level of 5-10/iK on small angular scales 
and around l^K on large scales {£ ^ 200) [57]. Late reionization produces the 
gradient mode of polarization and has a characteristic low £ bump corresponding 
to the horizon size at reionization [58] , [59] . In fact the gradient terms contribute to 




Vol. 4, 2003 The Cosmic Microwave Background 



S283 



polarization at a level of about 0.1(5T/T, and this has been detected by the DASI 
interferometer on 5-30 arc-minute scales [60]. The MAP satellite should be capable 
of seeing the characteristic peak associated with reionization, if this occurred at 
2 > 10 . 

The tensor contribution to the curl component of polarization is only of order 
O.OIJT/T, and detection will require dedicated experiments that must await the 
post-Planck era, 2008 or later. Detection of the gravity wave mode would provide 
another powerful confirmation of inflation, which generically predicts a mixture 
of scalar and tensor relic fluctuation modes with approximately scale- invariant 
spectra. 

In contrast, pre-Big Bang and ekpyrotic models usually predict blue spectra 
of primordial gravity waves that would not leave any detectable CMB relic polar- 
ization. Intrinsic non-gaussianity is an aspect of generic very early universe models 
that could also help constrain inflation. There are hundreds of variants of inflation- 
ary cosmology, and it is difficult to come up with any definitive tests, other than 
perhaps the prediction of flatness of space. Multiple scalar fields that drive several 
phases of inflation, or interact during the primary phase, will lead to non-gaussian 
fluctuations. Topological defects such as cosmic strings or textures that are gen- 
erated during post-inflationary reheating are a promising source of non-gaussian 
fluctuations on small angular scales, although these must be subdominant with 
regard to structure formation. 

At high i, the observed damping provides an independent probe of 0&, and 
confirmation of the physics of last scattering. Beyond i ~ 2000, non-linear effects 
play an important role. The most prominent of these is the Sunyaev-Zeldovich 
(SZ) effect, which is the cumulative contribution of hot gas in distant unresolved 
clusters along the line of sight, and is uniquely distinguishable by its spectral 
characteristics as an apparent deficit below 218 GHz in the rest frame and excess 
at higher frequency, relative to the CMB. There is a reported 2a detection at 
t ~ 2500 by the CBI experiment which however only covers a narrow frequency 
range near 30 Ghz [61]. 

The Ostriker- Vishniac [62] and Rees-Sciama [63] effects are also of interest at 
i ^ 3000, due respectively to correlations of linear compressions and rarefactions 
with Doppler contributions along the line of sight and to traversal by CMB photons 
across the time-dependent potential wells associated with forming galaxy clusters, 
respectively. These anisotropies are expected to amount to only a percent or so of 
the SZ effect, as also is the kinematic SZ effect. However the challenge of detecting 
secondary fluctuations is important, since they probe the epoch of reionization. 
Another aspect that is characteristic of secondary fluctuations is that they are 
expected to be intrinsically non- Gaussian, as well as being anti- correlated with 
the primary fluctuations. In general, reionization damping of primary fluctuations 
generates secondary fluctuations and enhances the polarization: a recent discussion 
is given in [64]. 

The acausality of large angular scale fluctuations ( ^ 1°) means that they 
were generated either as initial conditions or by an early de Sitter phase of inflation. 
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The latter occurred in the first 10“^^ second, and amplified quantum fluctuations 
onto macroscopic scales. An example of a novel approach to initial conditions 
comes from the ekpyrotic cosmology. This model dispenses with inflation and sets 
up the fluctuations in a pre-Big Bang phase that can be identified with the past 
history of two colliding branes. The brane intersection signals the initial moment 
of the Big Bang at the Planck time. This is also a singularity in space-time, and 
the calculations of the emergent density fluctuations are controversial. 

The synthesis of baryons occurs at electroweak (100 GeV) or GUT (10^^ GeV) 
scales, and is responsible for the specific entropy of the universe, that is, the number 
of photons per baryon observed in the GMB. It is tied in with lepton synthesis and 
generation of lepton number L , since B — L is generally conserved in subsequent 
phase transitions that may dilute any primordial baryon number B. The blackbody 
spectrum itself was generated in the first hours of the Big Bang. Double Compton 
scattering and bremsstrahlung are the key processes responsible for thermalization. 
An essentially perfect blackbody is generated. No trace of thermalization processes 
is left, other than by possible late energy injection after the thermalization epoch. 

The matter and radiation remain coupled until a redshift of about 1000. 
Hence the temperature fluctuations trace the degree of matter inhomogeneity over 
the first 300,000 years of the universe. One mostly measures the imprint of density 
fluctuations on the last scattering surface. These dominate the acoustic peaks. 
There is also a contribution at lower i from the integrated Sachs- Wolfe effect, 
both before and after last scattering, due to the decay of the ratio of the radiation 
to the matter densities. 

The temperature fluctuation mapping has led to an era of precision cosmol- 
ogy. One measures in addition to Hq and n, as well as a combination of Qa 
and Hq. With independent determinations of from large-scale structure, the 
standard model is well defined and well constrained. 

5 Towards the Future 

There are some 10 parameters that describe the GMB fluctuations, in addition to 
the 6 or so that characterize the standard moodel of cosmology. What is remarkable 
is that precise measurements of the GMB fluctuations can potentially measure 
almost all of these parameters with unrivaled precision. There are degeneracies, 
most notably between flrn and ÇIa, but these can largely be resolved by cross- 
correlating with other data such as wide area weak lensing surveys. These are 
exciting times for the cosmic microwave background, especially in combination 
with other data sets. The precision of the measurements is vastly superior to that 
of a decade ago. The overall picture is one of consistency with what is rapidly 
emerging as the standard model of cosmology. 

There are less conventional issues that the GMB may eventually tackle. Grav- 
ity could change on very large scales, and the GMB uniquely probes the largest 
scales via the integrated Sachs- Wolfe effect, thereby providing a potentially unique 




Vol. 4, 2003 The Cosmic Microwave Background 



S285 



probe of exotic gravity models [65] . The topology of the universe is not predicted 
by general relativity, or for that matter by quantum gravity. The CMB may pro- 
vide the only means of ever detecting a topological signature of space-time. A flat, 
compact topology can undergo inflation from a quantum origin [66]. If the topo- 
logical scale is on the order of the horizon scale, a detectable imprint in the form 
of a non-Gaussian pattern is generated on the CMB sky [67]. The existing COBE 
data eliminate all toroidal topologies if the topological scale is of order the present 
horizon scale or less [68] . The universe is locally isotropic but globally anisotropic 
in many of the topological representations, which, in the case of a flat universe, 
only amount to a handful of more general possibilities. It will be interesting to 
eventually explore some of the resulting patterns with more sensitivity and better 
resolution [69]. 

One can hope to probe the beginning of the universe; indeed, one already 
does this with large angular scale fluctuations. There is considerable support for 
inflation from the observed flatness of the universe, as well as from the approximate 
scale invariance of the power spectrum and the coherence of the acoustic peaks, 
although one hesitates to cite these latter two predictions as robust inferences from 
a theory which is neither unique nor able to account for the amplitudes of the 
peaks. The acausality at the epoch of last scattering on large angular scales could 
equally be created in the quantum chaos of the Planck epoch or even earlier, via 
pre-Big Bang physics, all that is required being phase coherence on superhorizon 
scales of matter and radiation fluctuations. The polarization signal of a relic gravity 
wave background presents the ultimate challenge, and the ultimate probe. 

Most of this is for the future. At present, one has some noteworthy highlights. 
Gravitational instability theory of structure formation has been verifled. The uni- 
verse has been found to be spatially flat. The CMB radiation power spectrum has 
provoked the death of topological defects as a signiflcant source of seeds for struc- 
ture formation. A purely baryonic universe can be discarded. This is remarkable 
progress in modern cosmology. 
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Abstract. 

We briefly review two aspects of string cosmology: (1) the presence of chaos 
in the generic cosmological solutions of the tree-level low-energy effective actions 
coming out of string theory, and (2) the remarkable link between the latter chaos 
and the Weyl groups of some hyperbolic Kac-Moody algebras. 

A crucial problem in string theory is the problem of vacuum selection. It 
is reasonable to believe that this problem can be solved only in the context of 
cosmology, by studying the time evolution of generic, inhomogeneous (non-SUSY) 
string vacua. Let us recall that Belinskii, Khalatnikov and Lifshitz (BKL) have 
discovered [1] that the generic solution of the four- dimensional Einstein’s vac- 
uum equations has a very complex structure, characterized by a non-monotonic, 
never ending oscillatory behaviour near a cosmological singularity. The oscillatory 
approach toward the singularity has the character of a random process, whose 
chaotic nature has been intensively studied [2]. [See [3] for a summary of the ev- 
idence supporting the BKL conjectural picture.] The qualitative behaviour of the 
generic solution near a cosmological singularity depends very much: (i) on the field 
content of the system being considered, and (ii) on the spacetime dimension D. 
For instance, it was surprisingly found that the chaotic BKL oscillatory behaviour 
disappears from the generic solution of the vacuum Einstein equations in space- 
time dimension D > 11 and is replaced by a monotonie Kasner-like power-law 
behaviour [4]. Second, the generic solution of the Einstein-scalar equations also 
exhibits a non-oscillatory, power- law behaviour [5], [6] (in any dimension [7]). 

In superstring theory [8, 9] there are many massless (bosonic) degrees of 
freedom which can be generically excited near a cosmological singularity. They 
correspond to a high-dimension (Z) = 10 or 11) Kaluza-Klein-type model contain- 
ing, in addition to Einstein’s Zl- dimensional gravity, several other fields (scalars, 
vectors and/or forms). In view of the results quoted above, it is a priori unclear 
whether the full field content of superstring theory implies, for the generic cos- 
mological solution, a chaotic BKL-like behaviour, or a monotonie Kasner-like one. 
It was found in [10, 11, 12] that the massless bosonic content of all superstring 
models {D = 10 IIA, IIB, I, hetE, hetso)? a,s well as of M-theory {D = 11 su- 
pergravity), generically implies a chaotic BKL-like oscillatory behaviour near a 
cosmological singularity. [The analysis of [10, 11, 12] applies at scales large enough 
to excite all Kaluza-Klein-type modes, but small enough to be able to neglect the 
stringy and non-perturbative massive states.] It is the presence of various form 
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fields (e.g. the three form in SUGRAn) which provides the crucial source of this 
generic oscillatory behaviour. 

Let us consider a model of the general form 

S = l d^x^[R{g)-d^^æ^-^ (1) 

Here, the spacetime dimension D is left unspecified. We work (as a convenient 
common formulation) in the Einstein conformal frame, and we normalize the ki- 
netic term of the “dilaton” (p with a weight 1 with respect to the Ricci scalar. The 
integer p > 0 labels the various p- forms Ap = present in the theory, with 

field strengths Fp^i = dAp, i.e. F^xo/xi-.-^p = permutations. The 

real parameter Ap plays the crucial role of measuring the strength of the coupling 
of the dilaton to the p-form Ap (in the Einstein frame). When p = 0, we assume 
that Ao ^ 0 (this is the case in type IIB where there is a second scalar). The 
Einstein metric is used to lower or raise all indices in Eq. (1) {g = — det g^y). 
The model (1) is, as it reads, not quite general enough to represent in detail all 
the superstring actions. Indeed, it lacks additional terms involving possible cou- 
plings between the form fields (e.g. Yang-Mills couplings for p = 1 multiplets, 
Chern-Simons terms, {dC 2 — Co d B 2 )^-type terms in type IIB). However, it has 
been verified in all relevant cases that these additional terms do not qualitatively 
modify the BKL behaviour to be discussed below. On the other hand, in the case 
of M-theory (i.e. D = 11 SUGRA) the dilaton cp is absent, and one must cancel 
its contributions to the dynamics. 

Let us sketch the derivation of the structure of the general cosmological so- 
lution of Eq. (1). [See [13] for a more detailed derivation.] The leading Kasner-like 
approximation to the solution of the field equations for g^y and derived from 
(1) is, as usual [1], in the Einstein-frame 

d 

gpy dx^ dx^ —dt^ -f (cj^)^ , ~ p^p (x) In t -h (x) , (2) 

where d = D — 1 denotes the spatial dimension and where co^ (x) = e'j (x) dx^ 
is a time- independent d-bein. The spatially dependent Kasner exponents pi (x). 
Pep (x) must satisfy the famous Kasner constraints (modified by the presence of 
the dilaton): 

( 3 ) 

2=1 \ 2=1 / 2=1 

The set of parameters satisfying Eqs. (3) is topologically a (d — l)-dimensional 
sphere: the “Kasner sphere”. When the dilaton is absent, one must set p<p to zero 
in Eqs. (3). In that case the dimension of the Kasner sphere is d — 2 = D — 3. 

The approximate solution Eqs. (2) is obtained by neglecting in the field equa- 
tions for gpy and (p: (i) the effect of the spatial derivatives of g^y and p, and (ii) 
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the contributions of the various p-ïoTm fields Ap. The condition for the “stabil- 
ity” of the solution (2), i.e. for the absence of BKL oscillations at t 0, would 
be that the inclusion in the field equations of the discarded contributions (i) and 
(ii) (computed within the assumption (2)) be fractionally negligible as f 0. As 
usual, the fractional effect of the spatial derivatives of (f is found to be negligible, 
while the fractional effect (with respect to the leading terms, which are oc t~^) of 
the spatial derivatives of the metric, i.e. the quantities (where R^j denotes 

the d-dimensional Ricci tensor) contains, as only “dangerous terms” when ^ 0 a 

sum of terms oc ^ where the gravitational exponents gijk (i 7 ^ j, i ^ k, j ^ k) 
are the following combinations of the Kasner exponents [4] 

3yfc(p) = 2p,+ ^ pe = l+p^-pj -pk- (4) 



The “gravitational” stability condition is that all the exponents gijk (p) be positive. 
In the presence of form fields Ap there are additional stability conditions related 
to the contributions of the form fields to the Einstein-dilaton equations. They are 
obtained by solving, à la BKL, the p-form field equations in the background (2) 
and then estimating the corresponding “dangerous” terms in the Einstein field 
equations. When neglecting the spatial derivatives in the Maxwell equations in 
first-order form dF = 0 and Ô (e^p"^ F) =0, where d = * d * is the (Hodge) dual of 
the Cart an differential d and Fp+i = dAp^ one finds that both the “electric” com- 
ponents y/g e^p"^ , and the “magnetic” components are constant 

in time. Combining this information with the approximate results ( 2 ) one can es- 
timate the fractional effect of the p-form contributions in the right-hand-side of 
the gpi^- and (^-field equations, i.e. the quantities F and F where 
denotes the stress-energy tensor of the p-form. [As usual [ 1 ] the mixed terms 
which enter the momentum constraints play a rather different role and do not need 
to be explicitly considered.] Finally, one gets as “dangerous” terms when t ^ 0 a 



sum of “electric” contributions oc t 



p(p) 



and of “magnetic” ones oc t 



2 6 ^' 



Here, the electric exponents ■ (where all the indices in are different) are de- 
fined as 



Ap) 



ip) =Ph+Pi2+---+Pip- 2 ^PPV > 

l ( p ) 



( 5 ) 



while the magnetic exponents FF (where all the indices jn are different) are 



• (p) 

Jl • • -Jd — p— 1 ' 



■ Pji + 4 + Pjd-p-i + ^ . 



( 6 ) 



To each p-form is associated a (duality-invariant) double family of “stability” ex- 
ponents The “electric” (respectively “magnetic”) stability condition is 

that all the exponents e^F (respectively, be positive. 

In [10], it was found that there exists no open region of the Kasner sphere 
where all the stability exponents can be simultaneously positive. This showed 
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that the generic cosmological solution in string theory was of the never-ending 
oscillatory BKL type. A deeper understanding of the structure of this generic 
solution wais then obtained by mapping the dynamics of the scale factors, and 
of the dilaton, onto a billiard motion. Let us recall that the central idea of the 
BKL approach is that the various points in space approximately decouple as one 
approaches a spacelike singularity {t ^ 0). More precisely, the partial differential 
equations that control the time evolution of the fields can be replaced by ordinary 
differential equations with respect to time, with coefficients that are (relatively) 
slowly varying in space and time. The details of how this is done are explained in 
[1, 10, 13]. Let us review the main result of [12], namely the fact that the evolution 
of the scale factors and the dilaton at each spatial point can be be viewed as a 
billiard motion in some simplices in hyperbolic space which have remarkable 
connections with hyperbolic Kac- Moody algebras of rank 10. 

To see this we generalize the previous Kasner-like solution by expressing it 
in terms of some local scale factors, a^, without assuming that these scale factors 
behave as powers of the proper time (as was done in (2) which had assumed 
Oi oc t^^). In other words, we now write the metric (in either the Einstein frame 
or the string frame) as = —N‘^{dx^Y -h where d = 

D — 1 denotes the spatial dimension, and where, as above, üü'^(x) = ej{x)dx^ is 
a d-bein whose time-dependence is neglected compared to that of the local scale 
factors Oi. Instead of working with the 9 variables , and the dilaton (/?, it is 
convenient to introduce the following set of 10 field variables: /3^, /Li = 1, . . . , 10, 
with, in the superstring (Einstein- frame) case, = — In (i = 1,...,9), and 
(3^^ = —(p where is the Einstein-frame dilaton. [In M-theory there is no dilaton 
but /Li = z = 1, . . . , 10. In the string frame, we define /3g = — ln(y^e“^^) and 
label )Li = 0, . . . , 9.] 

We consider the evolution near a past (big-bang) or future (big-crunch) space- 
like singularity located at t = 0, where t is the proper time from the singularity. 
In the gauge N = —y/g (where g is the determinant of the Einstein-frame spa- 
tial metric), i.e. in terms of the new time variable dr = —dtj^Jg^ the action (per 
unit comoving volume) describing the asymptotic dynamics of as t ^ 0"^ or 
T ^ -\-oo has the form 



V{p) 

A 

In addition, the time reparametrization invariance (i.e. the equation of motion of N 
in a general gauge) imposes the usual “zero-energy” constraint E = Gf^i,{df3^ /dr) 
{d/3^ /dr) -h V{0^) = 0. The metric in field-space is a 10-dimensional met- 
ric of Lorentzian signature — h H h. Its explicit expression depends on the 

model and the choice of variables. In M-theory, dj3^ d/S"/^ = {d(3^)‘^ - 









-2wa{I3) 



( 7 ) 

(8) 
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(E^ii . while in the string models, d0^ = (d0^s)'^ - {d!5%f 

in the string frame. Each exponential term, labeled by A, in the potential V{/3^), 
Eq. ( 8 ), represents the effect, on the evolution of of either (i) the spatial 

curvature of gij (“gravitational walls”), (ii) the energy density of some electric- 
type components of some p-form ( “electric p-form wall” ), or (iii) the energy 

density of some magnetic- type components of (“magnetic p-form wall”). 

The coefficients Ca are all found to be positive, so that all the exponential walls 
in Eq. ( 8 ) are repulsive. The Cas vary in space and time, but we neglect their 
variation compared to the asymptotic effect of wa{P) discussed below. Each expo- 
nent —2wa{P) appearing in Eq. ( 8 ) is a linear form in the (3^ : wa{P) = wa^iP^- 
The wall forms wa{P) are exactly the same linear forms as the “stability expo- 
nents” which appeared above; one just need to replace the variables pi by 
For instance, one of the “electric” wall forms for a 3-form coupled with A = 0 is 
'^i 2 s{P) = ^i 23 (/^) = P^ 3- P“^ P^ . The complete list of “wall forms” wa{P), was 
given in [10] for each string model. The number of walls is enormous, typically of 
the order of 700. 

At this stage, one sees that the r-time dynamics of the variables P^ is de- 
scribed by a Toda-like system in a Lorentzian space, with a zero-energy constraint. 
But it seems daunting to have to deal with ~ 700 exponential walls! However, the 
problem can be greatly simplified because many of the walls turn out to be asymp- 
totically irrelevant. To see this, it is useful to project the motion of the variables 
P^ onto the 9-dimensional hyperbolic space (with curvature —1). This can 
be done because the motion of P^ is always time- like, so that, starting (in our 
units) from the origin, it will remain within the 10-dimensional Lorentzian light 
cone of This follows from the energy constraint and the positivity of V . 

With our definitions, the evolution occurs in the future light-cone. The projection 
to is performed by decomposing the motion of P^ into its radial and angular 
parts (see [14, 15, 16, 13]). One writes P^ = +P 7 ^ with = -Gfj,u p^ P^ ^ p> Q 
and = — 1 (so that 7 ^ runs over , realized as the future, unit hyper- 

boloid) and one introduces a new evolution parameter: dT = kdrj (? . The action 
(7) becomes 







Vr^pn) 



( 9 ) 



where = G^u d^^ d'y^ is the metric on and where Vr = k~‘^ p^V = 
k~‘^CAp^ exp(— 2 pu’^( 7 )). When t 0“^, i.e. p — » + 00 , the transformed 
potential Vt{p^^) becomes sharper and sharper and reduces in the limit to a set 
of p-independent impenetrable walls located at WA^'y) = 0 on the unit hyper- 
boloid (i.e. Vt = 0 when WAij) > 0, and Vr = +00 when WAij) < 0). In this 
limit, d In p/dT becomes constant, and one can choose the constant k so that 
dlnp/dT = 1. The (approximately) linear motion of P^(r) between two “col- 
lisions” with the original multi-exponential potential V(P^) is thereby mapped 
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onto a geodesic motion of 7 (T) on interrupted by specular collisions on sharp 
hyperplanar walls. This motion has unit velocity (d'y/dT)‘^ = 1 because of the 
energy constraint. In terms of the original variables (3^, the motion is confined to 
the convex domain (a cone in a 10-dimensional Minkowski space) defined by the 
intersection of the positive sides of all the wall hyperplanes wa{(3) 0 and of the 

interior of the future light-cone (3^ (3^ = 0. 

A further, useful simplification is obtained by quotienting the dynamics of j3^ 
by the natural permutation symmetries inherent in the problem, which correspond 
to “large diffeomorphisms” exchanging the various proper directions of expansion 
and the corresponding scale factors. The natural configuration space is therefore 
which can be parametrized by the ordered multiplets < • • • < 

(3^^, This kinematical quotienting is standard in most investigations of the BKL 
oscillations [1] and can be implemented in by introducing further sharp walls 
located at These “permutation walls” have been recently derived from 

a direct dynamical analysis based on the Iwaaawa decomposition of the metric 
[13]. Finally the dynamics of the models is equivalent, at each spatial point, to a 
hyperbolic billiard problem. The specific shape of this model-dependent billiard is 
determined by the original walls and the permutation walls. Only the “innermost” 
walls (those which are not “hidden” behind others) are cisymptotically relevant. 

The final results of the analysis of the innermost walls are remarkably simple. 
Instead of the 0(700) original walls it was found, in all cases, that there are only 
10 relevant walls. In fact, the seven string theories M, IIA, IIB, I, HO, HE and the 
closed bosonic string in = 10 [17], split into three separate blocks of theories, 
corresponding to three distinct billiards. The first block (with 2 SUSY’s in D = 10) 
is B 2 = {M, IIA, IIB} and its ten walls are (in the natural variables of M- theory 

/3" = Æ), 

4o(-S) = (10) 

The second block is Bi = {I, HO, HE) and its ten walls read (when written in 
terms of the string- frame variables of the heterotic theory = P's ^ = Ps'-> 

defined in [12]) 

Bi : iCi^^(o;) = w;f^(o:) = -0;'“^ + a\i = 2 , ... ,9), 

^io(^) — — — (11) 

The third block is simply Bq = {D = 10 closed bosonic} and its ten walls read (in 
string variables) 

Bo : wf\a) — , wf\a) = —a^~^ a^(i = 2, . . , ,9), 

'^lok^) = — oP . (12) 

In all cases, these walls define a simplex of which is non-compact but of finite 
volume, and which has remarkable symmetry properties. 
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The most economical way to describe the geometry of the simplices is through 
their Coxeter diagrams. This diagram encodes the angles between the faces and 
is obtained by computing the Gram matrix of the scalar products between the 
unit normals to the faces, say • wY^ where Wi = Wij y/wi • wi^ i = 

1, . . . , 10 labels the forms defining the (hyperplanar) faces of a simplex, and the 
dot denotes the scalar product (between co-vectors) induced by the metric : 
Wi • Wj = Wi/j, Wjy for Wi{f3) = Wi^ j3^. This Gram matrix does not depend on 
the normalization of the forms Wi but actually, all the wall forms wi listed above 
are normalized in a natural way, i.e. have a natural length. This is clear for the 
forms which are directly associated with dynamical walls in D == 10 or 11, but this 
can also be extended to all the permutation-symmetry walls because they appear 
as dynamical walls after dimensional reduction [12, 13] . When the wall forms are 
normalized accordingly (i.e. such that oc exp(— 2 ii;i(/3)), they all have a 

squared length w^^ = 2, except for = 1 in the Bi block. We can then 

compute the “Cartan matrix” , = 2 corresponding 

Dynkin diagram. One finds the diagrams given in Fig. 1. 

123456789 
52 

10 

123456789 





Bo 



DE\q 



10 




Figure 1: Dynkin diagrams defined (for each n = 2,1,0) by the ten wall forms 
wY\p^)^i — 1 , . . . , 10 that determine the billiard dynamics, near a cosmological 
singularity, of the three blocks of theories B 2 = {M, IIA, IIB}, B\ = {I, HO, HE} 
and B{) = {D = 10 closed bosonic}. The node labels 1, . . . , 10 correspond to the 
form label i used in the text. 

The corresponding Coxeter diagrams are obtained from the Dynkin diagrams 
by forgetting about the norms of the wall forms, i.e., by deleting the arrow in BEio. 
As can be seen from the figure, the Dynkin diagrams associated with the billiards 
turn out to be the Dynkin diagrams of the following rank- 10 hyperbolic Kac-Moody 
algebras (see [18]): Eio, BEio and DEio (for H 2 , B\ and Bq^ respectively). It is 
remarkable that the three billiards exhaust the only three possible simplex Coxeter 
diagrams on with discrete associated Coxeter group (and this is the highest 
dimension where such simplices exist) [19]. This analysis suggests to identify the 
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10 wall forms i = 1, . . . , 10 of the billiards 82 , B\ and Bç) with a basis of 

simple roots of the hyperbolic Kac-Moody algebras Eio, BEio and DEio, so that 
the cosmological billiard can be identified with a fundamental Weyl chamber of 
these algebras. Note also that the 10 dynamical variables p = 1, . . . , 10, can 
be considered as parametrizing a generic vector in the Cartan subalgebra of these 
algebras. 

It was conjectured some time ago [20] that Eio should be, in some sense, 
the symmetry group of SUGRAn reduced to one dimension (and that DEio be 
that of type I SUGRAio, which has the same bosonic spectrum as the bosonic 
string). Our results, which indeed concern the one-dimensional “reduction”^, à la 
BKL, of M/string theories exhibit a clear sense in which Eio lies behind the one- 
dimensional evolution of the block B 2 of theories: their asymptotic cosmological 
evolution as t 0 is a billiard motion, and the group of reflections in the walls of 
this billiard is nothing else than the Weyl group of Eio (i*e. the group of reflections 
in the hyperplanes corresponding to the roots of Eio, which can be generated by 
the 10 simple roots of its Dynkin diagram). It is intriguing - and, to our knowledge, 
unanticipated (see, however, [21])- that the cosmological evolution of the second 
block of theories, B\ = {I, HO, HE}, be described by another remarkable billiard, 
whose group of reflections is the Weyl group of BEio- The root lattices of Eio 
and BE 10 exhaust the only two possible unimodular even and odd Lorentzian 
10- dimensional lattices [18]. 

A first consequence of the exceptional properties of the billiards concerns the 
nature of the cosmological oscillatory behaviour. They lead to a direct technical 
proof that these oscillations, for all three blocks, are chaotic in a mathematically 
well-defined sense. This is done by reformulating, in a standard manner, the bil- 
liard dynamics as an equivalent collision- free geodesic motion on a hyperbolic, 
finite-volume manifold (without boundary) M obtained by quotienting by an 
appropriate torsion-free discrete group. These geodesic motions define the “most 
chaotic” type of dynamical systems. They are Anosov flows [22], which imply, in 
particular, that they are “mixing” . Furthermore, the two Coxeter groups of Eio 
and BEio are the only reflective arithmetic groups in [19] so that the chaotic 
motion in the fundamental simplices of Eio and BEio will be of the exceptional 
“arithmetical” type [23]. We therefore expect that the quantum motion on these 
two billiards, and in particular the spectrum of the Laplacian operator, exhibits 
exceptional features (Poisson statistics of level-spacing,. . .), linked to the existence 
of a Hecke algebra of mutually commuting, conserved operators. 

Recently [13], it has been possible to go beyond the (leading- wall) billiard 
approximation, and to compare the cosmological dynamics of SUGRAn with an 
Eio-invariant one-dimensional model (null geodesic in the infinite dimensional 
coset space Eio/K(Eio) where K(Eio) is the maximal compact subgroup of the 
hyperbolic Kac-Moody group Eio(M)). It was found that one could identify, up 

^Note again that the analysis above concerns generic inhomogeneous solutions depending 
upon D variables. The strict one-dimensional reduction (one variable only) of M-theory has also 
been considered, and has been shown to still contain the Weyl group of Eio [H]- 
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to “height 30”^, the first rungs of the infinite tower of Eio coset fields with the 
values of geometric quantities constructed from SUGRAn fields and their spatial 
gradients taken at some comoving spatial point. 

The discovery that the chaotic behaviour of the generic cosmological solution 
of superstring effective Lagrangians was rooted in the fundamental Weyl chamber 
of some underlying hyperbolic Kac-Moody algebra prompted us to reexamine the 
case of pure gravity [24]. It was found that the same remarquable connection 
applies to pure gravity in any dimension D = d + 1 > 4. The relevant Kac-Moody 
algebra in this case is AE^. It was also found that the disappearance of chaos in 
pure gravity models when D > 11 dimensions [4] becomes linked to the fact that 
the Kac-Moody algebra AEd is no longer “hyperbolic” for d > 10 [24]. 

An interesting aspect of the above “billiard” analysis is to provide hints for 
a scenario of vacuum selection in string cosmology. If we heuristically extend our 
(classical, low-energy, tree-level) results to the quantum, stringy (t ~ tg) and/or 
strongly coupled (ps ~ 1) regime, we are led to conjecture that the initial state of 
the universe is equivalent to the quantum motion in a certain finite volume chaotic 
billiard. This billiard is (as in a hall of mirrors game) the fundamental polytope of 
a discrete symmetry group which contains, as subgroups, the Weyl groups of both 
Eio and BEio [25]. We are here assuming that there is (for finite spatial volume 
universes) a non-zero transition amplitude between the moduli spaces of the two 
blocks of superstring “theories” (viewed as “states” of an underlying theory). If we 
had a description of the resulting combined moduli space (orbifolded by its discrete 
symmetry group) we might even consider as most probable initial state of the 
universe the fundamental mode of the combined billiard, though this does not seem 
crucial for vacuum selection purposes. This picture is a generalization of the picture 
of Ref. [27] and, like the latter, might solve the problem of cosmological vacuum 
selection in allowing the initial state to have a finite probability of exploring the 
subregions of moduli space which have a chance of inflating and evolving into our 
present universe. 
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Solar Neutrino Production 

Douglas Gough 



Abstract. It is commonly and quite plausibly presumed that neutrino flavour tran- 
sitions are responsible for essentially all of the discrepancy between the observed 
flux of neutrinos from the Sun and the theoretically calculated neutrino production 
rate. Indeed, the comparison between the detection rates by SuperKamiokande and 
by the Sudbury Neutrino Observatory are consistent with this presumption. Helio- 
seismological analysis has set quite tight constraints on the conditions in the Sun’s 
core, where the neutrino-emitting nuclear reactions take place. These constraints 
have been obtained subject to certain assumptions which need to be investigated 
before secure precise conclusions concerning neutrino production can be drawn. 



1 The standard solar model 

The so-called standard solar model is not a true standard. It is a standard only 
so far as the general principles of its construction are concerned - and even those 
vary with time. Therefore, when discussing differences between measured neutrino 
fluxes and a standard solar model, one must always be meticulous in recording 
which standard model is being referred to. 

A standard solar model is spherically symmetrical, in hydrostatic equilibrium. 
It is evolved either from some early stage of its existence as a star, such as the 
base of the probably fully convective Hayashi track, or, more frequently, from only 
the putative zero-age main sequence: a homogeneous state in thermal balance, the 
rate of generation of thermal energy by nuclear reactions being balanced by 
the luminosity L radiated by photons at the surface r — R. The only process that 
modifies the structure (on the main sequence) after the zero age is the change in 
chemical composition. This is brought about principally by nuclear transmutation 
in the core, and, to a much lesser extent, by the gravitational settling of heavy 
elements. The gravitational energy liberated by these processes is tiny compared 
with the nuclear energy. No internal motion is explicitly considered, with the ex- 
ception that in the convection zone (which occupies the outer 29 per cent by radius 
of the model) the chemical composition is taken to be homogenized, and a simple 
mixing-length formula is adopted to relate the flux of heat and, rarely, momentum 
to the superadiabatic temperature gradient. Thus there is no macroscopic mixing 
of the products of the nuclear reactions in the core. The mass of the model is con- 
served - there is neither mass loss nor accretion - and rotation and the magnetic 
field are ignored. It seems that all of these simplifications are quite reasonable. 
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Moreover, they render it possible, in principle, for anyone to reproduce the results 
(within numerical error). 

Given the mass Mq of the star, the model is evolved for a time to its 
current state, defined according to its radius R and its luminosity L, and the 
surface abundance ratio ZsJXs, which is determined spectroscopically; here X 
is the relative hydrogen abundance (by mass) and Z is the abundance of heavy 
elements (i.e. all elements other than H and "^He), the abundance of ^He being 
Y = 1 — X — Z. That current state is achieved by choosing appropriate values for 
the initial composition {Xq,Yq, Zq), and also for a parameter a which relates the 
mixing length in the prescription for convection to the pressure scale height. The 
age to is believed to be about 4.55 Gy (e.g. von Hippel, Simpson and Manset, 
2001), but the precise value adopted for modeling differs from model to model. 
In order to reduce predictions of different models to a common age it is adequate 
to extrapolate linearly, using partial derivatives that have been quoted in the 
literature (e.g. by Bahcall and Ulrich, 1988), some of which I reproduce in §4, 

Standard solar models have been computed with a variety of equations of 
state, opacity formulae and nuclear reaction rates. To assess the dependence of 
the neutrino fluxes on the values of the cross-sections that have been adopted, one 
can again use linear extrapolation with published partial derivatives; to assess the 
dependence on equation of state and opacity is more difficult, and usually requires 
numerical computation. However, Bahcall and his collaborators in several publi- 
cations (e.g. Bahcall and Ulrich, 1988) have explicitly addressed the sensitivity of 
the neutrino fluxes to a host of the uncertainties in the theory, and Christensen- 
Dalsgaard (1996) has considered the sensitivity of other properties of the models. 

As is well known, the fluxes of solar neutrinos measured at the Earth are 
substantially less than the corresponding theoretical values computed from stan- 
dard solar models under the assumption of no neutrino flavour transitions. The 
discrepancies have triggered an enormous amount of research into solar model- 
ing, resulting in keenly honed models which can be relied upon to reproduce the 
consequences of the physics that has been put into them. 

2 The principal nuclear reactions 

To appreciate how neutrino production depends on conditions in the solar core it 
is necessary to consider the balance of the thermonuclear reactions. I shall con- 
centrate on the principal neutrino-emitting reactions of the pp chain: 

^He(p, e'^i/)^He hep (2 x 10“^%) 

I 

p(p,e+j/)^H(p, 7 )^He(^He, 2 p)^He ppl (85%) 

I I 

p(pe ^He('*He, 7 )^Be(e , i')’'Li(p,^ He)^He ppll (15%) 



^Be(p,7)«B(e+î/)«Be*(4He)4He ppIII(0.02%) . 
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The figures in parentheses are approximately the percentage terminations via the 
corresponding branch of the chain at the centre of a standard model; about 0.25% 
of the deuterium is produced by the pep reaction. 

Because the temperature does not vary by a very large amount through the 
energy- generating core, it is adequate to approximate the temperature dependence 
of the reaction rates by power laws. All but the pep reaction and the spontaneous 
decays of and ^Be are two-body reactions, between species i and ji, say, whose 
rates per unit mass may be written rij RijXiXjpT^^^ , where Xk is the abun- 
dance of species k, p is density, and Rij and rjij are constants. For the reactions 
of interest I shall use atomic number unambiguously for the nuclear-species la- 
bel /c, for I shall not need to consider explicitly the details of the relatively rapid 
terminating branches of the chain. I shall use the label e for electrons. Since the 
abundant elements are almost fully ionized, the electron abundance is approx- 
imately ^(1 + X). The three-body pep reaction rate can therefore be written 
rpep^i^pep(l + ^)^V^"-^- 

Except in the outermost reaches of the core, where the reactions are slow, 
all the components of the chain of reactions are in balance, the overall reaction 
rate being determined by the slow pp and pep reactions which gradually reduce 
the abundance X of hydrogen fuel. Since the reaction rates decrease with radius, 
r, through the core, as also do p and T, hydrogen exhaustion decreases outwards, 
leaving a positive gradient of X (Figure 1). The proton capture by deuterium is 
essentially instantaneous, so the entire process creating ^He from H is controlled 
by the pp reaction. 




r/R 

Figure 1: Neutrino production in a standard solar model. The ^Be and neutrino fluxes, 4>i 
with i = 7 and 8 respectively, are proportional to Jq (j)idr, where r is a radius variable and R is 
the radius of the Sun. The pp neutrino production rate and, approximately, the thermal energy 
generation rate are proportional to </>pp; the pep production rate varies similarly (although not 
identically). The functions that are plotted are the normalized production rates Ax = ^>x Vx- 
The temperature T and the hydrogen abundance X (whose ordinate scale is on the right of the 
diagram) are included for comparison. 
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For a good simple first approximation to the sensitivities of the neutrino- 
producing reactions to conditions in the core it is adequate to balance the ^Re- 
producing reaction against ^He destruction by only the ppl chain, and similarly 
to balance the ^Be- producing reaction against ^Be destruction by only the ppR 
chain. Thus 



AuX^pT^^' ~ AssXipT^^^ ( 1 ) 

and, because Z <C 1, 

^34(1 - ~ ^ 7 e(l + , ( 2 ) 

from which the abundance X7 of ^Be can be calculated. Finally, the abundance 
Xs of ^B can be calculated from the p,7 reaction with ^Be: 



r^=AsX^p = A,,X,XpT^^^ . (3) 

In the solar core, r^pep ~ pn ~ 4, 7731 ~ 8, 7733 - 16, 7734 ~ 17,777e - and 
7771 ~ 13. Therefore the neutrino- producing reaction rates are given approximately 
by 



rpep ~ i?pep(l + X)X‘^p^T^ , 


(4) 


m-RuX^pT^ , 


(5) 


ri3 Risy/Rii/2Rs3X‘^pT‘^ , 


(6) 


rze ^ i?34 Vi?ll/2J^33(1 ~ X)XpT^^ , 


(7) 


R 34 R 7 I 1 Rn i-X 2„rr24.5 

R,. + ^ 


(8) 



The exact functional forms of the neutrino production rates per unit radius r in 
the Sun, 0pp = 47rr^/9rn, 0hep = 47rr^pri3, </)7 = Airr'^prje and </)8 = 47rr^pr8, 
are plotted in Figure 1. Because most of the terminations are via ppl, the thermal 
energy generation rate is essentially, although not exactly, proportional to <^ppQ, 
where Q is the total energy released per p-p reaction in converting 4 protons and 2 
electrons to ^He (it is not exactly proportional because there are losses associated 
with neutrino emission, and because the branching ratios between ppl and ppll 
and, much less importantly, between ppll and ppIII vary with temperature); the 
functional form of 0pep is similar, but not identical, to that of 0pp. 

In addition to the reactions I have discussed, there are also those of the CNO 
cycle. Like the ^B rate, the neutrino production rates are quite sensitive to tem- 
perature - they are roughly proportional to XZpT^^ - but the total neutrino flux 
is quite small, so I shall not discuss them explicitly. They must, of course, be taken 
into account when making detailed comparison between theory and measurement. 
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3 Helioseismology 

Some aspects of the structure of the Sun have been determined seismologically. 
Observable seismic modes are resonant acoustic waves, which sense, above all, the 
sound speed c. In addition, but to a lesser extent, they sense the density, p, through 
both buoyancy and the self-gravity of the waves. From accurate measurements of 
thousands of resonant frequencies it has been possible to determine certain prop- 
erties of both c(r) and p(r) with remarkable precision. (Deviations from spherical 
symmetry are tiny, and are hardly detectable; in any case they appear to be con- 
fined to the near-surface layers of the Sun, and are therefore unlikely to have a 
substantial impact on studies of neutrino production.) 

Given a stellar model (usually referred to as a reference model) whose struc- 
ture is pointwise close to that of the Sun, it is adequate to estimate the small 
deviation of the model from the Sun by linearizing the frequency differences 6ui of 
modes i in the differences Sc and Sp between the sound speed and density of the 
Sun and the corresponding values in the model. Thus Siy-i is the sum of weighted 
integrals of Sc and Jp, the weight functions Kd, Kp^ being different for differ- 
ent modes. By taking an appropriate linear combination of the data, Oi(ro)^^'i, 
it is possible to design corresponding weight functions Ac{r;ro) = and 

-^/ 9 (^;^o) = ^OLiKpi which make the frequency differences much easier to inter- 
pret. For example, Ac(r; ro) can be designed to be localized about r = ro with Ap 
everywhere small; if the coefficients ai are normalized to make Ac unimodular (i.e. 
having an integral of unity), then considered as a functional of the model, 

represents a localized average of Sc. Similarly one can construct a localized average 
of Sp, but that is harder because the density kernels are smaller than the sound- 
speed kernels (in the sense that they project, without the combinations aiSi/i 
being too seriously contaminated by data errors, into an accessible function space 
of lower dimension than that effectively spanned by {Kd})^ Alternatively, one can 
seek combinations of the data that generate functions Sc, Sp which, when added 
to the sound speed and density pm of the model, reproduce the observations 
precisely (or, more usefully, merely reproduce the observations to within the obser- 
vational errors; functions that satisfy the real, necessarily erroneous data precisely 
tend to have highly oscillatory components which are extremely sensitive to the 
data, and therefore to their errors, and which therefore have no useful physical 
meaning. It is more prudent to try to select functions that are robust to data er- 
rors yet which reproduce the data adequately. How that selection is made, formally, 
and often really, depends on prejudice). The resulting representations of the Sun 
constitute an extremely valuable data set with which to compare the theoretical 
solar models. 

To illustrate how closely a modern standard model represents the Sun, I 
illustrate in Figure 2 the relative deviations S(? j(? — 2Scfc and Sp/ p between the 
Sun and the standard model S of Christensen-Dalsgaard et al. (1996). The reason 
for considering and not c is that for a perfect gas, which roughly represents 
the solar material, is proportional to temperature (and is inversely proportional 
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Figure 2: Relative differences (a) 6c? jc? and (b) àpj p between the spherically averaged squared 
sound speed and density in the Sun (inferred seismologically) and the squared sound speed and 
density in the standard model of Christensen- Dalsgaard et al. (1996). The seismic data are spatial 
averages, weighted by localized kernels whose characteristic widths (approximately full widths 
at half maxima) are denoted by the horizontal bars; the vertical bars represent formal standard 
errors (which are correlated). The theoretical model was computed with Zq/Aq = 0.0245, and 
Yq ~ 0.27 (from Takata and Gough, 2001). 



to the mean molecular mass), so is perhaps a more natural variable to think 
about. The quantities plotted are averages of the pointwise differences, weighted by 
localized kernels each of whose characteristic spreads (roughly 1.04 of the full width 
at half maximum) is denoted by a horizontal bar. Each vertical bar represents ± 
one standard error arising from the random errors (assumed to be independent) in 
the frequency data. Care must be taken in interpreting small-scale undulations in 
any curve one might draw by eye through the data plotted in the figure, because 
the errors in those data are correlated. If one is interested in trends and does 
wish to imagine such a curve, then as a rule of thumb one might first multiply 
the errors by a factor 3, but even that should be taken with a large pinch of salt 
because the only published analysis of such error correlation (Gough, Sekii and 
Stark, 1996) was carried out on a somewhat different problem with a different (and 
rather smaller) frequency data set. 

A remarkable property of the functionals plotted in Figure 2 is that their val- 
ues are so small. That almost certainly implies that the reference model is quite 
a good representation of the Sun. However, the model differs from the Sun by 
a significant amount, because the deviations jc? and bpj p exceed the formal 
standard errors by a large factor, more than 10 in places. The situation is qualita- 
tively similar if other standard models are used as the reference, such as the model 
of Bahcall and Pinsonneault (1992), the model of Brun, Turck-Chièze and Zahn 
(1999), or that of Bahcall, Pinsonneault and Basu (2001). It is important that in 
the fullness of time the cause of these deviations be understood. 

The functionals plotted in Figure 2 each appear to have a component that 
varies gradually with r, and a prominent localized feature (in the values of à(? j(? 
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immediately beneath the base of the convection zone at rjR ~ 0.71, and also 
in the apparent derivative of àpj p in the same location if one does not heed my 
warning about error correlation). The large-scale component is probably a product 
of small errors that have a global impact on the model, such as in one or more of the 
nuclear reaction rates, or the chemical composition; these have a direct influence 
on the temperature or pressure, which are directly constrained by the equations of 
stellar structure through their spatial derivatives. The somewhat larger- amplitude 
localized feature is the product of an error in a local variable, whose derivative is 
not directly so constrained. It is probably a result of rotationally induced mixing 
in a thin layer beneath the convection zone called the tachocline. Material that 
would otherwise have been richer in helium than the convection zone as a result 
of gravitational settling is homogenized with the convection zone; consequently 
the mean molecular mass, and also the density, is decreased in the tachocline, and 
the sound speed is thereby increased (Elliott and Gough, 1999). Such a localized 
anomaly far out in the envelope has a negligible direct effect on the neutrino ffuxes. 
The smoothly varying component, however, is likely to have some repercussions, 
albeit perhaps small, on our inferences about conditions in the core. 

4 Neutrino fluxes 

In the days when modelers were trying to adjust their models to reproduce the 
measured neutrino fluxes without admitting neutrino transitions, it was essential 
to appreciate how the theoretical fluxes respond to perturbations to a model. That 
requires one to know where in the core neutrinos of different energy are produced, 
and how the fluxes vary with p, T and X. Much of that information is contained 
in Figure 1 and equations (4)-(8), together with the energy information contained 
in Figure 3. At present, however, with the many new parameters characterizing 
neutrino flavour transitions that need to be determined, reliable flne tuning of the 
models against neutrino data is hardly possible. Only the high-energy produc- 
tion rate, inferred from SuperKamiokande and the Sudbury Neutrino Observatory 
(SNO), is currently available (Ahmad ei al.^ 2001, 2002). Fortunately, that is con- 
sistent with the models, as I explain below. The lower-energy neutrinos contribute 
somewhat to the radiochemical detections by ^^^Cl at Homestake (Cleveland et al.^ 
1998) and, more importantly, to the ^^Ga detections by SAGE (Abdurashitov et 
al.^ 1999) and GALLEX (Hampel et al. 1999). By combining all the measurements, 
estimates of values of neutrino parameters such as those indicated in Figure 4 have 
been constructed on the basis of transitions between electron neutrinos and neu- 
trinos of some one other flavour, but we must await new neutrino data of different 
kinds before it is possible to determine all the parameters. 

One can estimate the neutrino production rates in the Sun by adjusting the 
fluxes of a standard model, using differences 6c^ and such as those illustrated 
in Figure 2, and the scaling laws (4)-(8). The results depend, of course, on the 
equation of state and the putative nuclear reaction rates that are adopted. To close 
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Figure 3: Spectrum of neutrinos from a standard solar model. The energy ranges in which the 
various solar neutrino detectors are sensitive are indicated above the graph (from J.N. Bahcall). 



the scaling it is necessary also to estimate X and T, which are related to and 
p by the equation of state but cannot be inferred directly in a radiative zone by 
seismology alone. How one estimates X depends on what is assumed about the 
state of the radiative zone. If only the seismically more precisely determined 
is used, then perhaps the closest approximation to the standard- model aissump- 
tions is to presume that the zone is in hydrostatic and thermal balance, with no 
macroscopic material motion to transport any of the products of the nuclear re- 
actions. These assumptions were used for this purpose by Gough and Kosovichev 
(1990): they are prescribed by the energy equation r~^d{r‘^ F) /dr = Anr'^pe, where 
F — —{AacT^/SKp)dT/dr is the radiative heat flux; here e is the nuclear heat- 
generation rate, k is the opacity, and a and c are respectively the radiation density 
constant and the speed of light. These constraints, together with the equation of 
state relating c^, p, T and X, enable one to determine X and T. Similar proce- 
dures have been adopted by Shibahashi (1993, 1999) and Shibahashi and Takata 
(1996). (It is important to realize that in determining the opacity k, it is neces- 
sary to specify the distribution of the heavy elements in the Sun; it is perhaps 
most natural to adopt the distribution of the standard reference model, although 
taking a uniform distribution has been more common.) If, on the other hand, the 
seismological determination of both p and are used, then it is more robust to 
estimate X by adding to the reference abundance, after due homogenization of 
the tachocline, a (small) constant whose value is determined by demanding that 
the luminosity be unchanged; that procedure also leaves essentially unchanged 
the amount of hydrogen consumed by nuclear reactions during the main-sequence 
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Detector 




Capture rates (snu) 


Flux 

(10®cm-2s-i) 


pep 


hep 


PP 


^Be 


^B 


CNO 


total 


^B 




BPB 


0.22 


0.04 


0 


1.15 


5.76 


0.42 


7.6±1.2 


5.05 




BTCZ 














7.0 


4.99 




JCD 


0.22 


0.04 


0 


1.13 


5.56 


0.41 


7.4 


4.87 




Sun 


0.22 


0.04 


0 


1.11 


5.43 


0.40 


7.2 


4.82 




Obs. 














2.56±0.23 




'■"Ga 


BPB 


2.8 


0.1 


69.7 


34.2 


12.1 


9.0 


128±8 


5.05 




BTCZ 














127 


4.99 




JCD 


2.8 


0.1 


69.8 


33.8 


11.7 


8.8 


127 


4.87 




Sun 


2.8 


0.1 


69.7 


33.2 


11.4 


8.6 


126 


4.82 




Obs. 














74.6±5 




SK+SNO 


Obs. 






5.44±0.99 



Table 1: Neutrino fluxes produced by various representations of the Sun, and the measured 
fluxes on Earth. BPB, BTCZ and JCD are the standard solar models of Bahcall, Pinsonneault 
and Basu (2001), Brun, Turck-Chièze and Zahn (1999) and Christensen-Dalsgaard et al. (1996); 
‘Sun’ represents the predictions of the seismic representation of the Sun presented by Gough and 
Scherrer (2002), with T and X determined from and p as described in the text, using the 
nuclear reaction rates adopted by Bahcall, Pinsonneault and Basu (2001). The ‘^^Cl observations 
are from Cleveland et al. (1998), SAGE and GALLEX from Abdurashitov et al. (1999) and 
Hampel et al. (1999) respectively, and SuperKamiokande and SNO from Pukuda et al. (2001) 
and Ahmad et ai (2001). The ages of the standard models from the zero- age main sequence (onset 
of hydrogen burning) are 4.57 Gy (BPB), probably 4.55 Gy (BTCZ) and 4.52 Gy (JCD). The 
fluxes can be reduced to a common age with the help of the logarithmic derivatives d\n^/d\ntQ 
quoted by Bahcall and Ulrich (1988); 0.00, -0.11, -0.07, 0.69, 1.28 and 1.13 for pep, hep, pp, ^Be, 
^B and CNO neutrinos respectively. 



evolution. The neutrino fluxes that result when the reaction rates of Bahcall, Pin- 
sonneault and Basu (2001) are adopted are listed in Table 1. Included in the table, 
for comparison, are the fluxes of the standard models of Bahcall, Pinsonneault and 
Basu (2001), Brun, Turck-Chièze and Zahn (2001) and Christensen-Dalsgaard et 
al (1996). The differences are, at present, less than the uncertainties in the infer- 
ence that can be drawn from the neutrino flux measurements. 

There are three kinds of neutrino flux measurements that have been under- 
taken to date, two radiochemical and two scattering. The first of the radiochemi- 
cal measurements (Homestake) involved neutrino capture by ^^Cl, the first results 
from which were announced nearly four decades ago (Davis, 1964); the others 
(SAGE and GALLEX) involve capture by ^^Ga. The scattering measurements use 
water: ordinary water in the case of Kamiokande (Fukuda et al, 1996), and subse- 
quently the larger SuperKamiokande (Fukuda et a/., 1998), and heavy water in the 
case of SNO. All the detectors are more sensitive to neutrinos of higher energy, but 
the thresholds differ (Figure 3). SAGE and GALLEX have the lowest threshold; 
they are the only detectors in operation that are sensitive to the pp neutrinos, and, 
because there are so many pp neutrinos, these neutrinos dominate the counts. The 
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Homestake threshold is too high to detect pp neutrinos; its counts are a combi- 
nation of principally pep, ^Be and neutrino captures, together with a small 
contribution from the CNO cycle, with ^B neutrinos dominating. The scattering 
detectors are sensitive essentially to only the ^B neutrinos (there is only a very 
small contribution from hep), and therefore provide a cleaner, and more sensitive, 
diagnostic. 

The only observational inference that can be compared directly with the 
inferred solar fluxes at present is that of Ahmad et al (2000, 2001). It concerns only 
the high-energy ^B neutrinos, and was drawn initially by combining the electron- 
scattering data from SuperKamiokande (Fukuda et al 2001), which provide a 
nonuniformly weighted average ^es of the fluxes of neutrinos of all flavours: $es = 
(where is the actual electron- neutrino flux at the detector, is 
the combined flux of /i and, presumably, r neutrinos which have been created by 
transitions from electron neutrinos, and /3 = 0.154 was calculated theoretically) 
with the charge-current data from SNO, which are sensitive to only $e- The data 
are plotted on a diagram in Figure 5; they intersect at = 1.75 ± 

0.14cm“^s“^ and = 3.69 ± 1.13 x 10^cm~^s“\ from which one obtains a total 
neutrino flux $ = -p = 5.44 ± 0.99cm“^s“^ at the Earth (actually at a 
distance of 1 AU from the Sun) . This result has been confirmed recently by Ahmad 
et al in their second paper, in which they report neutral-current data from SNO 
which are also sensitive to all neutrino flavours, although with a relative weighting 
which is somewhat different from the electron-scattering value. The newer SNO 
measurement is not as precise as the SuperKamiokande measurement because there 
has not yet been time enough to accumulate the data. 

The total neutrino flux $ corresponds to the production rate of electron neu- 
trinos in the Sun. The agreement with the theoretical predictions recorded in Table 
1 is a remarkable scientific achievement. It suggests very strongly that the solar 
models actually provide a good representation of the Sun. This is especially the 
case because the ^B-neutrino production rate is so very sensitive to temperature. 
One anticipates, therefore, that the theoretical predictions of the less- temperature- 
sensitive ^Be and particularly the pp and pep neutrinos are much more robust. 

5 Concluding remarks 

The remarkable agreement between the neutrino production rates predicted by 
the solar models and the direct neutrino-flux measurements is partly fortuitous. 
Although, as I have explained, the models are in quite good agreement with he- 
lioseismological inference, there are significant discrepancies. These need to be 
understood, and not merely explained away, before one can have confidence in our 
view of the state of the solar interior. 

It is not uncommon for solar modelers to compare their models with just the 
seismologically determined sound speed through the Sun, for that is the quantity 
we know the most accurately; sometimes only a particular simple feature of the 
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Active 

C1 + Ga + SNO 
+ SKsp(D/N) 



10 ^ 10 '^ 10 “ 10 ' 1 10 
tan^ S 

Figure 4: An MSW neutrino-oscillation solution for Am^ (difference between the (principal) 
squared mass eigenvalues associated with the electron-neutrino transitions) and the mixing angle 
(5, obtained from the standard solar model of Bahcall, Pinsonneault and Basu (2001) and the 
neutrino flux measurements (from Bahcall, Gonzalez-Garcia and Pëna-Garay, 2002). 



sound speed is considered, such as the inferred depth of the convection zone. 
Such scant comparison is dangerous, for it can give one a false impression of 
the faithfulness of the models. This was highlighted recently by Watanabe and 
Shibahashi (2001). By artificially varying the heavy-element abundance Z(r) in 
the core, they constructed a range of low-flux solar models, with ^ci varying from 
7.2 to 2.45 snu, ^ca from 126 to 101 snu, and from 4.8 to 1.3 x 10^cm“^s"^; 
models with higher fluxes could equally easily have been constructed. These were 
not standard models, evolved from zero age, but were seismic models in hydrostatic 
and thermal balance. Unlike a similar set of standard models, their sound speeds 
all agree with those inferred for the Sun. But their densities do not agree, so many 
of the models can be ruled out. However, there is some leeway, because density is 
not determined with high accuracy. Although Watanabe and Shibahashi did not 
consider how their Z profiles might have come about, their results provide a stark 
warning that processes currently ignored in standard models, such as putative weak 
macroscopic circulatory flow in the radiative zone, or material and possibly heat 
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Figure 5: Flux ^8 of neutrinos in the form of ji and r flavours, plotted against the flux of 
electron neutrinos. The charge-current data from SNO and the electron-scattering data 

from SuperKamiokande are depicted by bands of width two standard errors (±cr from the 
most likely value); their centres intersect at the spot. The diagonal bands bounded by solid lines 
inclined at -45° represent the total flux to within ±cr; the dashed lines represent the predictions 
of the standard solar model of Bahcall, Pinsonneault and Basu (2001) (after Ahmad et ai, 2001). 



transport by waves, could be present; such processes might influence the neutrino 
production rates. Additionally, some of the parameters influencing microscopic 
processes, such as nuclear reaction rates or opacity, could also be in error. 

As a concluding example of the latter, it is interesting to note that recent work 
by Grevesse and Sauvai (2002) and Prieto, Lambert and Asplund (2002) has led to 
substantial decreases in the spectroscopically estimated photospheric abundances 
of several heavy elements. This effectively reduces Z^fX^. What are the implica- 
tions? One cannot determine that without a detailed calculation, but one can make 
an estimate. Although the abundance revisions modify the relative abundances of 
the heavy elements, one can, as a first approximation, ignore that. From the stand- 
point of stellar structure, the effective change in opacity is roughly equivalent to 
reducing Z^jX^ by some 16 per cent. It is interesting that the change is broadly con- 
sistent with the spectroscopic error estimates, but it is rather greater than the pre- 
sumably overoptimistic estimates of Basu (1998), which are based on a helioseismo- 
logical analysis in which the OPAL opacity was accepted, with due warning, as also 
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was the OPAL equation of state, without comment, despite evidence that it is de- 
ficient at the helioseismological level of precision (Kosovichev et ai, 1992; Baturin 
et al, 2000). With the help of the partial derivatives dln^pep/d\n{Zs/Xs) ^ —0.1, 
ain$hep/^ln(Zs/As) - -0.2, d\n^7/dln{Zs/Xs) ^ 0.6, d\n^8/d\n{Zs/Xs) - 1.3 
and Mn$CNo/^ln(Zs/Xs) ~ 1.9, which were computed from the grid of theoreti- 
cal solar models of Gough and Novotny (1990), one can estimate that the fluxes 
^ci, ^Ga and ^8 from standard solar models are reduced by 1.4 snu, 8.3 snu, 
and 1.0 xl0^cm“^s“^, respectively. Seismic models with given c^(r), on the other 
hand, suggest that the reductions are 0.8 snu, 3.3 snu and 0.62 xl0^cm“^s“L In 
either case, the deviation from the values inferred by Ahmad et al (2001, 2002) 
are increased. Evidently, there remains more work to do to increase the reliability 
of the predictions of the neutrino production rates for the purpose of constraining 
neutrino-transition parameters more tightly. 
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Quantum Fields on dS and AdS 

Ugo Moschella 



Abstract. We give a short account of general approach to de Sitter and anti-de Sitter 
Quantum Field Theories that is based on the assumption of analyticity properties 
of the n-point correlation functions. We briefly discuss the status of the AdS/CFT 
and dS/CFT correspondences in this context. 



1 Introduction 

Einstein’s cosmological equations in vacuo are easily solved. When the cosmologi- 
cal constant A is positive one gets the de Sitter (dS) universe while for negative A 
the solution is the so-called anti-de Sitter (AdS) universe. The two-sided limiting 
case A = 0 is the Minkowski spacetime. The Minkowski, de Sitter and anti-de 
Sitter universes belong therefore to a single one-parameter family of general rela- 
tivistic spacetimes, the parameter being the cosmological constant, which, in turn, 
is proportional to the (constant) scalar curvature. While there are similarities be- 
tween these spacetimes, the most important being their maximal symmetry, they 
are however very different either geometrically or in their physical meaning. Let us 
start by discussing the geometry. The simplest way to visualize the de Sitter man- 
ifold is to think of it as a quadric embedded in a (d + l)-dimensional Minkowski 
spacetime Md+i (here indices are Lorentz tensor indices in the ambient space): 

dSd = {x°^ = -R^}. (1) 

The anti-de Sitter manifold can also be visualized as a quadric: 

AdSd = {x^"^ -x^"^ = R'^} (2) 

embedded in a linear space Ed-\-i endowed with the metric = diag(+l, —1, . . . , 
— 1, +1). The metric is induced on both dSd and AdSd by their respective ambient 
spaces: 



ds‘^ = [(dx^)^ — (dx^)^ — ... — (dx^)^] de Sitter, (3) 

ds^ = [(dx^)^ — (dx^)^ — ... — (dx^“^)^ + (dx^)^] anti-de Sitter . (4) 

These expressions provide solutions of the Einstein’s equations with suitable values 
of the cosmological constant. The physical picture described by the de Sitter space- 
time with the coordinate system plotted at the left side of Fig. 1 is that of a closed 
universe exponentially contracting and then re-expanding. The other coordinate 
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Figure 1: This figure represents the two-dimensional dS universe as an hyper- 
boloid embedded in a three-dimensional space, with closed and flat coordinates. 
The asymptotic cone, that is the light cone of the origin in the ambient space, 
plays a major role in our constructions. Turning the figure by 90° the same figure 
represents also the AdS manifold with covering and Poincare coordinates. The 
asymptotic cone in the ambient space can be regarded as a representation of the 
boundary at spacelike infinity of the AdS manifold and carries a natural action 
of the conformal group that is the basis for our treatment of the AdS-CFT corre- 
spondence. 



systems describes one half of dSd as an exponentially expanding flat universes. 
This interpretation lies at the basis of the central role that the de Sitter geometry 
plays in the inflationary paradigm, where the universe is thought to have under- 
gone a phase of exponential expansion in the very early epochs of its life (see e.g. 
[1]). Recent observations [2] have also pointed towards an accelerated expansion of 
the universe and the existence of a positive cosmological constant (or of some form 
of ’’dark energy”) now. The de Sitter physics may play again an important role. 
The anti-de Sitter universe has a completely different physical meaning. AdSa is 
an example of a non-globally hyperbolic spacetime in that it admits closed timelike 
curves and has a boundary at spacelike infinity. The existence of that boundary is 
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source of difficulties in quantizing fields on the AdS manifolds; on the other side 
it offers the geometrical basis for the famous AdS/CFT (Conformal Field Theory) 
correspondence [3]. 

2 Quantum Field Theory 

Quantum fields are systems with infinitely many degrees of freedom; Von Neu- 
mann’s theorem does not apply and there exist inequivalent “quantizations” (i.e. 
representations of the field algebra) of the same field theory. To illustrate this point 
let us briefly reconsider canonical quantization. The following remarks apply to 
general curved spacetimes but also in particular to the Minkowski manifold. Given 
a field equation, say the Klein-Gordon equation, one introduces a scalar product in 
the space of its solutions, and looks for a complete set of mode solutions Ui{x), The 
field 0 is then given the mode expansion (j){x) = [aiUi{x) + a\ui{x)] and canoni- 
cal quantization is achieved by assuming the canonical commutation relations and 
by choosing the corresponding vacuum. The procedure is inherently ambiguous: 
the previous mode expansion is based on an arbitrary choice of local coordinates. 
Different choices may produce inequivalent quantizations. Moreover, Bogolubov 
transformations also provide different vacua and again inequivalent quantizations. 
One needs therefore criteria to select, among the infinitely many inequivalent re- 
alizations of the (same) field algebra, those that are physically meaningful. This 
choice has striking consequences even in the case of free fields: the Hawking thermal 
radiation [4, 5], the Unruh effect [6] and the Gibbons and Hawking temperature 
[7] are the most famous examples. 

Let us take now the viewpoint of the general theory of quantized fields. A 
theory is characterized by specifying the n-point ” vacuum” expectation values of 
the field (the word expresses here a mathematical concept, the GNS vacuum): 
Wn(xi, . . . , Xn) = (ri, (j>{xi ) . . . 0(xn)f^). In the Minkowskian case the general re- 
quirements of quantum mechanics and of special relativity are translated into 
mathematical properties of the n-point functions as follows (see [8] for a classic 
treatment of this subject): 

1. Covariance: for any Poincaré transformation g 

Wn[gx-[,...gxn) =Wn(xi,...Xn). (5) 

2. Local Commutativity: for space-like separated 

yVn{Xl, . ..Xj,Xj+i,.. .,Xn)= Wn(xi, . ..Xj-^^Xj, . . . ,Xn). (6) 

3. Positive Definiteness: this property warrants a quantum theoretical interpreta- 
tion. 

These properties are not sufficient to select among the vacua; an insight into the 
dynamical properties of the theory is needed. In Wightman’s QFT one demands 
the existence of a complete set of states having positive or zero energy. This can 
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be reformulated as a property of the n-point functions by using their Fourier rep- 
resentation. By translation invariance Wn(xi , . . . ,Xn) = • • • ^Cn-i) where 

= Xi — XiJ^i, and the spectral condition becomes: 

4. Spectral Property: 

SUpp VFrz(çl,Ç2,--.,Çn-l) C F+ X . . . X F+ (7) 

where is the closed forward light-cone (in momentum space) . 

At this point the crucial step is to introduce the complexification of the 
Minkowski spacetime; the spectral property then implies that the n-point function 
Wn{xi ^ . . . , Xn) is the distributional boundary value of a function Wn( 2 :i, . . . , Zn) 
that is analytic in the tube 

T- = {{zu- ■■,Zn)e : lm{z,+i - Zj) e V+}. (8) 

(see [8]). To put our results in perspective it may be useful to review the role of 
the spectral condition within the usual Klein-Gordon QFT. A free field theory is 
characterized by the knowledge of the two-point function; in our case it must also 
solve the KG equation w.r.t. both variables: 

+m‘^)W{x,y) = 0, (Dy + m^) W(a;, 2 /) = 0. (9) 

By Fourier transforming the previous equations w.r.t. the different variable ^ = 
X — y one gets the KG equation in momentum space: (p^ + m^)W{p) = 0. The 
general solution can be written W{p) = a0{p^)6{p^ — m^) + b0{—p^)6[p^ — m^), 
and the spectral condition (7) sets 6 = 0. The inverse Fourier transform finally 
gives the two-point function of the field: 

W(a;, y) = W{x - y) = f e-^P-^^~y'>0(p°)S(p^ - m^) dp. (10) 

2(27t)2 J 

The structure of the two-point function is that of a superposition of the plane 
waves exp(— zp • x) and their complex conjugate exp(zp • p), taken respectively at 
the points x and p; the momentum variable p is integrated w.r.t. the Lorentz 
invariant measure dp = 0{p^)6{p‘^ — m?)dp satisfying the spectral condition [8]. 
As a consequence of the spectral condition (or by direct inspection of the integral 
at the r.h.s. of Eq. (10)) the distribution yV{x\,X 2 ) is the boundary value of a 
function W(zi, 2 : 2 ) analytic in the backward tube Tff . A further application of 
the spectral property shows that W(zi,Z 2 ) is maximally analytic, i.e. it can be 
analytically continued to the cut-domain A = x {{zi,Z 2 ) 6 x 

■ (^1 “ = a > 0 }. 

When the background is curved, it is generally impossible to characterize the 
physically relevant vacuum states as the fundamental states of the energy operator 
in the previous sense and the analogue of the spectral condition is lacking. This 
happens in de Sitter case. For the anti-de Sitter case the situation is better in 
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the sense that a global energy operator can be identified. On the other hand the 
situation is complicated by the lack of global hyperbolicity. 

We describe in the following a general framework for the study of dS and AdS 
QFT which mimics as closely as possible the Minkowskian case [9, 10, 11, 12]. 
The requirements of dS or AdS covariance, local commutativity and positive- 
definiteness are easily adapted from the Minkowski case, but, as in the Minkowski 
case, these properties are not sufficient to characterize a QFT: we need a spectral 
condition. We have found that a general spectral condition for dS and AdS QFT’s 
can be conveniently formulated in terms of the properties of the analytic continu- 
ation of the correlation functions in the complexified dS and AdS manifolds. Let 
us see how it works. 



3 de Sitter 

The complex de Sitter spacetime is identified with the complex hyperboloid 

= {z = x + iye : 2 °^ - 2 ^^ - ... - 2 <^^ = -R^}, (11) 

equivalently characterized as the set = {{x,y) e Md+i x M^+i '■ x^ — = 

— X • y = 0}. The relevant one-point tubes are obtainable by intersecting the 
complex dS manifold with the future and past tubes of the ambient complex 
Minkowski space (see Fig. 2): 

= T+ n dS^^\ Tf = t;- n dS^^\ (12) 

where Tf = Md+i + and are the future and past cone of the origin in 
the complex ambient space (see Fig. 1). dSdU'T^ contains the “euclidean 

sphere” Sd = {z ^ ~ 

Let us consider the de Sitter Klein-Gordon equation {dds = 0, where 

m is a mass parameter. For any ^ G = dV ^ (in the ambient space, see Fig. 
1)) one can introduce plane- wave solutions: 



(^) , Aec. (13) 

These waves are indeed holomorphic for z in or in . Physical values of the 
parameter s, corresponding to w? > 0, are given by A = ~ with either 
i/ G R or 1 / G ?'R with \u\ < We are now able to construct the two-point 

function of a dS KG field as a superposition of holomorphic plane waves, precisely 
as in Eq. (10): 

W^(2i,22) = / {Z ■ dfl~/{^~^), 



(14) 
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Figure 2 : Sections of the future tubes of the complex dS and AdS manifolds. In 
the dS case the real part x of a point x-{-iy oi the tube T-^ belongs to a de Sitter 
manifold of smaller radius {x^ = —[E? — o?)}, depending on on the length a of the 
imaginary part y: y^ = o? < E?. The imaginary part y is represented as a future 
directed tangent vector. The AdS future tube is characterized by its chirality. Now 
the real part of a point x iy oi the AdS future tube belongs to a quadric 
{x^ = E? whose radius is larger than E with no restriction on the length 

y^ = o? of the imaginary part y. 



where Zi G , Z 2 G . The integration is performed along any basis submani- 
fold 7 of the future cone C~^ w.r.t. a measure dfi^ induced by the invariant measure 
on the cone; actually the integrand in (14) is the restriction to 7 of a closed dif- 
ferential form. Wjy is a solution in both variables of the (complex) de Sitter Klein- 
Gordon equation which is analytic in the domain 

G 7^ , 2:2 G }. It is possible to show that it is actually a function of the single 
de Sitter invariant variable ( 2 : 1 — 22 )^ = —2E^—2zyZ2 and that, as in the Minkowski 
case, it enjoys the maximal analyticity property: it can be analytically continued in 
the “cut-domain” A = xdS^[\ {{zi,Z 2 ) G dS^f"^ xdS^[^ : ( 2:1 -^ 2 )^ = o; > 0}. 
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What is the physical basis for these analyticity properties in the de Sitter 
case? It turns out that they can be understood in terms of a thermal spectral 
condition [7, 10]. For free fields this is an equivalence and can be proven easily by 
using the maximal analyticity property of the two point function. Consider indeed 
the geodesical observer through the base point xq = (0, . . . , 0, i?) contained in the 
x^)-plane. This choice singles out the region U = {x ^ dSd : x^ > \x^\} 

together with its event horizons x^ = ±x^, x^ > 0. The region U is foliated by 
hyperbolic trajectories according to the following parametrization: x{r, = [x^ = 
\/i ?2 _ 7*2 ^ ^ ^ _ y/B? — cosh ^); r is the proper time of the 

observer. These curves are the orbits of the one-parameter group T of isometries 
of U\ x^ = T{t)[x{T,r^] = x{t + r, f), t G R. The complexified orbits of T have 
(2z7ri?)-periodicity in t and all their non- real points in the tubes and T~ . 

If xi and X 2 are arbitrary events in U, the maximal analyticity of 
implies that ^^[xi.x^) is a 2z7ri?-periodic function of t analytic in the strip {t G 
C, 0 < Im^ < 2i7rR} and satisfies the following K.M.S. relation at temperature T 
= l/27Ti?: 

= lim < € R, (15) 

e -^0 + 

where (Q, 0(xi)0(x2)r2) = Wiy{xi,X 2 ) ($2, 0(x2)</>(xi)Q) = Wy{x 2 ,xi), The 
“energy operator” S associated with the observer is obtained by the spectral 
decomposition of the unitary representation of the time translation group T in 
the Hilbert space H of the theory, namely which yields 

^ The previous K.M.S. condition is then equivalent to the fact 

that energy measurements performed by an observer at rest at the origin on states 
localized in U are exponentially damped by a factor exp(— 27ri?o;) in the range of 
negative energies. In the limit of flat spacetime R ^ oo this factor will kill all 
negative energies, so that one recovers the usual spectral condition of positivity of 
the energy. 

In the general interacting case one cannot expect maximal analyticity proper- 
ties to hold for n-point functions (n > 2). Let us consider the case of a theory that 
can be characterized by a sequence of Wightman distributions Wn {xi, . . . ,Xn) = 
{Q,(j)(xi) . . .(j) (xn) on dS^^ . The first three properties the Wightman functions 
should satisfy are literal transcriptions to the dS spacetime of the corresponding 
Minkowskian properties: 1. de Sitter Covariance, 2. Local Commutativity and 3. 
Positive Definiteness. As regards the spectral condition we cannot directly formu- 
late it in terms of an energy operator but we can propose analyticity properties 
similar to those equivalent the usual spectral condition in the Minkowskian case. 
Let us introduce the open set 

{z = {zi,. . . ,z„); Zj = Xj + iyj e yj+i - yj G V+ , l<j<n-l}. (16) 

Since is a domain of which is moreover a tuboid (see [10] for a precise 

definition) above we can state the following axiom: 
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4. (Weak Spectral Condition). For each n the distribution >Vn(xi, . . . ,Xn) is the 
boundary value of an analytic function Wn{zi, . . . ,Zn), defined in the tuboid 
of the complex manifold 

There are obviously other possible choices. But it is also true that our choice 
allows a (not so easy as in the free field case) proof of the thermal interpretation of 
the theory. Another feature that this approach assures the possibility of the tran- 
sition to the euclidean sphere. In the case of the two-point functions these axioms 
are equivalent to the maximal analyticity property (see [11] for more details). 

4 Anti-de Sitter 

The complex anti-de Sitter spacetime is identified with the complex quadric 

= {z = x^iye 4+1 : - . . . + (17) 

equivalently characterized as the set AdS^^ = {{x,y) E F^d+i x ^d+i • x"^ - 
y‘^ = R‘^ ^ X ■ y = 0}. The Euclidean submanifold of AdS^f^ is defined by putting 
real, x^ > 0. By changing z^ to iz^ for 0 < /u < d, AdS^f^ 
becomes a complex sphere in which has the same homotopy type as the real 

unit sphere. In particular AdS^^ is simply connected for d > 2. However the real 
manifold AdSd admits a nontrivial covering space AdSd whose “physical” role is 
to suppress the time- loops of pure AdS. This is visualized in the easiest way by 
using “covering par ametrizat ion” x = x(r, r, e) (see Fig. 1) : 

{ x° = sin r 

= re^ z = l,...,d-l (18) 

x^ = y/r‘^ + R‘^ cos r 

with e^ = e^^ -h . . . + = 1 and r > 0. For each fixed value of r, the cor- 
responding “slice” Cr = AdSd H {x^^ + x^^ = of AdSd is a Lorentz 

manifold Si x Sd- 2 - The relevant coverings AdSd and Cr are obtained by unfold- 
ing the 27 t— periodic coordinate r. The notion of space-like separation can now 
be extended and both these coverings admit a global causal ordering while in the 
pure AdS case this is possible only locally. 

To introduce the relevant one-point tubes [12, 13, 14] let us consider the 
infinitesimal generator of the (covering of) the subgroup of rotations in the (0, d)- 
plane Mod] such transformations can be interpreted as time translation relative to 
the covering coordinate system represented in Fig. (1). The set Ci = {A A“^ : 
AE50o(2,d-l)}is the subset of all the elements of the AdS Lie algebra Ç that 
are conjugate to ^1+ denote the cone generated in Ç by Ci , i.e. C+ = Up>o P • 

The future tuboid and the past tuboid Zf of AdS^^ are then introduced as 
follows: 



Z+ = {exp(rM) x : M eCi, x E AdSd, r E C+} = (Zf )* . (19) 
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where C-t_ = {(^ e C : ImC > 0} = — C_. The tuboids (and their universal 
coverings Z^) are domains of holomorphy. Z^ are equivalently charaterized by 
their chirality (see Fig. 2): 

Z^ = {z = x-hiy e -y‘^ ^ 1, {x - y) = 0, > 0, e(z) = ±}, (20) 

where for z = x iy G have defined e{z) = y^x^ — x^y^. The form 

(20) shows that Zf are open subsets of AdS^^ with empty intersection with each 

other, while their definition shows that they are connected. As it happens in the 

(S) 

Minkowski and dS cases, the “Euclidean” AdS spacetime AdS^ is contained in 
AdSd U .^1+ U Zi_. 

An anti-de Sftterian QFT theory is characterized (as before) by a set of AdS 
invariant rz-point functions {Wn(a:i, . . . , x^)} that satisfy the local commutativ- 
ity in the (global) sense of the covering AdSd^ The positive definiteness property 
allows the reconstruction of a Hilbert space 7i carrying a continuous unitary rep- 
resentation g U{g) of (the covering of) the AdS group Gq = *SOo(2,d — 1). 
To every element M of the Lie algebra G one can associate the one-parameter 
subgroup t exp^M of and a self-adjoint operator M acting in H such that 
exp itM = U (exp tM) for all ^ G R. In the AdS case the a spectral condition can 
be formulated as follows: 

Spectral Condition: for every M G the operator M has its spectrum contained 
in R+. 

Let us consider now in particular a free field characterized by a two-point 
function W{xi,X 2 ). The above spectrum condition gives rise to the following 
normal analyticity condition: the two-point function yV(xi,X 2 ) is the boundary 
value of a function W (z\,Z 2 ) which is holomorphic in the domain Z^ x Z^ of 

(c) (c) 

AdS^ X AdS . AdS invariance and the normal analyticity condition together 
imply again the maximal analyticity property: V^{zi,Z 2 ) can be continued in the 
covering A of the cut-domain A = AdS^^^ x AdS^\ {(zi,Z 2 ) ^ AdS^^ x AdS^^ : 
C — ' ^ 2 ) IE? G [—1, 1]}. For special theories which are periodic in time W is in 

fact analytic in A itself. The simplest example of the previous structure is again 
given by the AdS Klein-Gordon fields. The corresponding two-point functions are 
expressed in terms of generalized Legendre functions [15] 

( d\ poo 

+ (21) 

V V 2 / 

by the following formula: 

p — iTld / rj 1 \ 

= «,+^(0 = ^r (^) C = {ZUZ2), 

( 22 ) 
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where the parameter A is related to the maiss by the formula m? = A(A + d — 1 ) = 
^2 _ (^~^) normalization can be obtained by imposing the local Hadamard 

condition, that gives 



hd+iW 



(2A + d- l)r(A + d- 1) 
f(d) r(A + i) 



(23) 



Theories with i/ > —1 are acceptable in the sense that they satisfy all the axioms 
including the posit ive^definiteness property. The previous two-point functions are 
holomorphic in A or A according to whether A is an integer or not. It is worthwhile 
to point out that also in the anti-de Sitter case we are able to provide a spectral 
representation of the two-point functions in terms of AdS plane waves, that is 
closely similar to that produced in Eq. (14). However, the construction of such 
representation is more tricky in the AdS case. We limit ourself to writing the 
formula without comments in the two-dimensional integer (i.e. periodical) case: 



W{zi,Z2) = Cif [21 • ^1'’ K • 22 ] ‘ (24) 

where z\ G and Z 2 G and the integration cycle 7 ( 2 : 1 , Z 2 ) that belongs to a 
certain class of cycles taken in the complexification of the asymptotic cone C. 

To study interacting theory we do need to generalize the previous results and 
introduce certain n-point future tuboids and past tuboids 7^“. The n-point 
functions are then assumed to be the the boundary value, in the sense of tempered 
distributions, of functions holomorphic and of tempered growth in such tubes. 
The construction is however rather elaborate an we refer the reader to [ 12 ] for the 
relevant details. 



5 Correspondences 

The AdS-CFT correspondence [3] is one of the most important advances in the 
recent panorama of theoretical high energy physics. Our framework allows for 
a very simple understanding of how, given an AdS QFT satisfying the general 
properties briefly discussed in the previous section, one can extract a Conformal 
Field Theory living on the boundary. 

The heuristic idea is as follows: given an AdS QFT let us consider its restric- 
tion to the Lorentzian manifold Cr. Alternatively one may consider its restriction 
to a Poincaré slice H^ = {x G AdSd : = exp(n)} (see Fig. 1). The problem 

is that, generally speaking, distributions cannot be restricted to lower dimensional 
manifolds. It is the AdS spectral condition that in our framework guarantees that 
the above restrictions are well-defined [12, 13]. In particular one can show that 
the restriction of an AdS QFT to the branes H^ defines a one parameter family 
(the parameter being i;) of Poincaré invariant QFT’s satisfying all the Wightman 
axioms. If it is possible to perform the limit v 00 , this automatically gives rise 
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to an acceptable Poincaré invariant Wightman’s QFT on the boundary, identified 
here with a copy of a Minkowski spacetime. It remains to show the conformal 
invariance of the so-obtained QFT. 

Similarly, it is n^aningful to consider the restrictions of the distributions >Vn 
to the submanifolds These theories are obviously invariant under the product 
of the time translation group w.r.t. the r parameter times the orthogonal group 
SO{d— 1) of space transformations acting on the sphere Sd-2- Moreover, the local 
commutativity property of the original theory on AdS d implies that the restricted 
Æeories also satisfy local commutativity in the sense of the spacetime manifolds 
Cy. Finally, the spectrum of the generator of time-translations is positive. 

The limit r — ^ cxd, if it is possible, generates a QFT on the boundary, identified 
here with the covering of the compactification of the Minkowski spacetime (this 
manifold is the right place where to study conformal invariance; on the contrary 
the conformal group acts on the Minkowski spacetime only infinitesimally). The 
invariance group of the limiting QFT is by construction M x SO{d — 1) and it 
remains the task of showing its conformal invariance. 

To actually perform the previous limit some control on the power-decrease 
at infinity of the Wight man functions is needed; with the help of the coordinates 
(18) we assume the following dimensional boundary condition at infinity: a scalar 
QFT on AdSd is said to be of asymptotic dimension A if the following limits exist 
in the sense of distributions: 

lira (r)”^W„(a;i(r,ri,ei), . . . , a;„(r, r„, e„)) = W^((ri,ei), . . . , (r„,en)) (25) 

r— >-(-oo 

In a basic work by Liischer and Mack [16] the concept of global conformal 
invariance in Minkowskian QFT has been associated with the general framework 
of QFT on the covering of a quadratic cone with signature (+,+,—,•'* 5 ~) 
one dimension more, which is precisely the asymptotic cone of the AdS quadric 

C 2 ,d-i = {v = (^^5 • • • 7 77 ^^^); 77 ^“^^ + = 0} (see Fig. 1) . The 

covering coordinates 77 = rj{r^ r, e) of the asymptotic cone are now 

{ 77 ^ = r sin r 

rf — re^ z = l,...,d-l (26) 

_ r cos r 

with e^^-h. . .Te*^^ = 1 and r > 0. The parametrization (26) allows one to introduce 
the covering C 2 ,d-i and of C 24 by again unfolding the 27 t— periodic coordinate r. 
Let us now consider a general QFT on AdSd whose Wightman functions Wn satisfy 
AdS invariance together with the other properties described in the previous section 
and the dimensional boundary condition. In view of Eq. (25), by using the covering 
coordinates of the cone we can construct the following set of ti— point distributions 
OnC 2 ,d-T 

WniVi,---,Vn) = (ri •••r„)“'^W^((Ti,ei),...,(T„,e„)). 



(27) 
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The set of distributions satisfy the required positive-definiteness condition for 
defining a QFT on 624-1 because the positivity of the QFT’s on the spacetimes 
Cr is preserved in the limit. It follows from the reconstruction procedure [8] that 
the set of distributions define a quantum field 0{t]) on C24-1. 0{t]) enjoys 
the local commutativity and the spectral condition as those of the Liischer-Mack 
field theories [16]. The important result on which our AdS/CFT correspondence 
is based is that invariance of the AdS n— point functions, together with the other 
assumption we made, implies the conformal invariance of the field 0{r])] more 
precisely, the Wightman functions VV^ of this field are invariant under the action 
on C24-1 of the conformal group [16]: 



W„{gvi,---,9Vn) (28) 

for all conformal transformations g. A part of this invariance is trivial in view of 
how the limiting procedure is constructed: it is the invariance under the rotations 
in the (0, d)— plane (i.e. the translations in the time variables r) and the invariance 
under the spatial orthogonal group of the subspace of variables (77^, . . . , 77^) (acting 
on the sphere Sd-i)- For the proof of the nontrivial part we refer the reader to 
[13]. We have therefore a general AdS/CFT correspondence for QFT’s: 

(j){x) 0(77) (29) 

between a scalar quantum field (/>(x) on AdSd+i satisfying our axioms and a confor- 
mally invariant local field 0{rj) on C24-1 enjoying the Liischer-Mack axioms; the 
degree of homogeneity (dimension) A of 0{r]) is equal to the asymptotic dimension 
of the AdS field ^{X). The relationship of this approach with the standard way 
of understanding the AdS/CFT correspondence [17] has been elucidated in [18]. 

It has been also proposed a duality between a quantum theory on de Sitter 
space and a euclidean theory on its boundary [19] which should encode the de 
Sitterian quantum gravity degrees of freedom [19]. We have shown [20] that one 
can associate with a general (scalar) de Sitter quantum field theory satisfying a 
dimensional boundary condition a conformal Euclidean field theory on the bound- 
ary, again identified with a copy of the cone asymptotic to the de Sitter manifold 
in the embedding spacetime (see Fig. 1). However, the field theory that one gets 
this way does not in general satisfy refiection positivity, which is required to admit 
a physical interpretation. Therefore, the proposed construction can have in general 
a technical interest but no obvious holographic physical interpretation seems to 
be available. 
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6 Conclusions 

We have shown that there exists a common way to undertake the study of 
Minkowski, de Sitter and anti-de Sitter QFT’s that is based on the complexifi- 
cations of those manifolds and the study of complex domains associated with the 
conditions on the spectra of the possible energy operators. In this context we are 
able to characterize general structural properties of the theories but the method is 
also useful to perform concrete calculations. The next step would be to test these 
ideas in perturbation theory, with important possible consequences for high energy 
physics and cosmology. 
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The New Cosmology: 

Mid-term Report Card for Inflation 

Michael S. Turner 



Abstract. Inflation has been the driving idea in cosmology for two decades and 
is a pillar of the New Cosmology. The inflationary paradigm has now passed its 
first round of significant tests, with two of its three basics predictions confirmed 
at about the 10% level (spatially flat Universe and density perturbations produced 
from quantum fluctuations with |n — 1| ~ 0(0.1)). The Inflationary Paradigm has 
some of the truth. Over the next decade the precision of these tests, most of which 
involve measurements of CMB anisotropy and polarization, will improve 30 fold 
or more(!), testing inflation more sharply and possibly elucidating the underly- 
ing cause. Especially important in this regard is detecting the inflation-produced 
gravitational waves, either directly or through their CMB polarization signature. 
While inflation has by no means been verified, its successes have raised the bar for 
competitor theories: Any alternative must feature the two hallmarks of inflation: 
superluminal expansion and entropy production. 



1 Introduction 

Today, cosmology has a comprehensive and self-consistent mathematical model 
that accounts for all the observed features of the Universe. However, unlike its 
predecessor, the hot big-bang model or standard cosmology, there is no standard 
name. For now, I will refer to it as the New Cosmology [1]. 

The New Cosmology incorporates the hot big-bang model (every successor 
theory eats its predecessor whole!), as well as an early inflationary epoch and 
the present stage of accelerated expansion. The New Cosmology includes a full 
accounting of the shape and composition of the Universe today: spatially flat, 
with the critical density distributed as follows: baryons (4 zb 1%), nonbaryonic 
dark matter (29 ± 4%) and dark energy (67 ± 6%) [2]. 

While we can now say that massive neutrinos account for between 0.1% and 
5% [3] of the nonbaryonic dark matter - comparable to the 0.5% contributed by 
stars - most of the dark matter is thought to be slowly moving elementary particles 
(cold dark matter), with the leading candidates being the axion and neutralino [5]. 

Dark energy is the name I use for the mysterious “energy stuff” that dom- 
inates the mass-energy budget and whose large negative pressure {p < —p/2) is 
responsible for the accelerated expansion [4] . Dark energy could be as “mundane” 
as the energy of the quantum vacuum, or something so exotic that it has thus far 
eluded the minds of the most creative theorists [6] . Dark energy is truly one of the 
great mysteries in all of science. 
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The standard cosmology can properly claim to give a reliable and tested 
description of the Universe from a fraction of a second onward. While the New 
Cosmology cannot yet make a similar claim for extending our understanding back 
to an early, inflationary epoch, inflation is nonetheless a pillar of the New Cos- 
mology, and, as I will describe, precision CMB observations are beginning to test 
its basic predictions (so far, so good). This is a remarkable development that be- 
lies the prediction made by some astronomers (and the fear of many inflationary 
theorists) that inflation would never be tested. 



2 The Inflationctry Paradigm 

Few ideas in theoretical physics have had as much impact as inflation. Introduced 
by Alan Guth in a 1980 paper [7] that explained the cosmological virtues of expo- 
nential expansion as well as why his version of it (based upon a symmetry-breaking 
phase transition) did not work(!), inflation has been the driving idea in cosmology 
since. 

The virtues of inflation trace to its ability to lessen the dependence of the 
present state of the Universe upon initial conditions (though quantifying how suc- 
cessful it is at achieving this is difficult [8]). In addition to the predictions discussed 
below, it explains the high degree of homogeneity and isotropy observed in our 
Hubble volume, the heat of the big bang, and the absence of super heavy magnetic 
monopoles. 

In a flurry of activity during the early 1980s the basic inflation paradigm [9] 
was worked out [10, 11, 12, 13]. In brief, essentially all models of inflation can be 
described in terms of the classical evolution of a single scalar field (dubbed the 
inflaton) initially displaced from the minimum of its potential [U((/>)]: 



0 + -h U' 




0 

87t r V{<t))d(l) 

^ J ~vW 



( 1 ) 

( 2 ) 



where N is the number of e- folds of inflation, prime denotes d/d0 and dot denotes 
d/dt. Models of inflation differ only in the form of their scalar potential and their 
motivations. 

Inflation occurs while the inflaton is slowly rolling: during the “slow roll”, 
the nearly constant potential energy density associated with the inflaton drives a 
nearly exponential expansion (“superluminal expansion”). At the end of the slow 
roll, the inflaton is left oscillating about its potential energy minimum; these oscil- 
lations correspond to a condensate of zero-momentum 0 particles. Their decay (by 
coherent and/or incoherent processes) produces lighter particles which thermalize; 
this “reheats” the Universe and exponentially increases its entropy (per comov- 
ing volume). (Note: The Universe need not be hot before inflation.) The standard 
hot big bang phase commences thereafter, with the tremendous entropy release 
accounting its heat. 
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Quantum fluctuations in 0 (A0 ~ Hl2'ir) give rise to energy density fluc- 
tuations {ôp = which ultimately result in inflation’s signature adiabatic 

density perturbations. They are of astrophysical interest because of their phys- 
ical size is stretched from the subatomic to the astrophysical during the pe- 
riod of exponential expansion. Similarly, quantum fluctuations in the metric itself 
{h ^ Sg ^ H/mp\) give rise to a spectrum of gravitational waves with wave- 
lengths of astrophysical interest (fluctuations in other light scalar fields can result 
in particle production or isocurvature perturbations). 

Inflation is a paradigm and not a model because there is no agreed upon 
identity for the inflaton field. Many models exist, with the inflaton playing a variety 
of roles, from inducing electroweak symmetry breaking to breaking supersymmetry 
to the compactification of extra dimension(s), and the energy scale of inflation 
ranging from 1 TeV to 10^® GeV [14]. While there is no standard model, each model 
makes its own set of precise predictions. 

That being said, the inflationary paradigm does make a set of generic pre- 
dictions that can be used to test - and even falsify - it. The baisic predictions of 
inflation are; 

• Spatially flat Universe 

• Not quite scale- invariant, almost power-law spectrum of Gaussian, adiabatic 
density perturbations 

• Not quite scale- invariant, almost power-law spectrum of gravitational waves 

The first two of these predictions are now being tested by precision measurements 
of cosmic microwave background (CMB) anisotropy on sub-degree angular scales, 
with early results consistent with inflation. The third prediction, which may hold 
the key to definitively testing inflation and shedding light upon the underlying 
cause, is inspiring a new generation of very challenging experiments. 

3 Specific models make specific predictions [15] 

While all models of inflation are based upon speculative physics that goes beyond 
(usually well beyond) the standard model of particle physics, each model makes 
predictions that sharpen the basic predictions of the paradigm. The reason for 
this is simple: The physics is speculative but the rules are well defined - the semi- 
classical evolution of a scalar field. 

While models have been constructed where two or more fields evolve during 
inflation, or where the kinetic term for the inflaton is not canonical, I will discuss 
the predictions for single-field inflation, assuming that the kinetic term is canonical. 
More complicated models also make definite predictions, though the relationship 
of observable quantities to the potential(s) can be different. 

The prediction of a flat Universe is not tied to the form of the potential; 
only that inflation lasts sufficiently long to explain the homogeneity and isotropy. 
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Generally speaking, the duration of inflation far exceeds that needed to produce 
a flat Universe (although models of inflation have been tuned to give less than 
1). The inflationary prediction of a flat Universe corresponds to density parameter 
0.0 = 1, where Oq is the ratio of the total matter/energy density to the critical 
density. 

The other two predictions of inflation involve the metric perturbations that 
arise from the quantum fluctuations associated with deSitter space: adiabatic den- 
sity (or scalar) perturbations from fluctuations in the inflaton potential energy and 
gravity waves (tensor perturbations) from fluctuations in the metric itself. Their 
amplitude and variation with scale do depend upon the properties of the scalar- 
field potential, and this is the basis for the belief that observations may someday 
pin down the underlying model of inflation [16] 

Both the scalar and tensor perturbations have an approximately - but not 
exactly - scale-invariant spectrum. This fact fundamentally traces to the approxi- 
mate deSitter space associated with the inflationary phase. In physical terms, that 
means that the dimensionless strain amplitude of gravity waves when they re- 
enter the horizon after inflation is almost independent of scale: huoR ~ H/mp\ ~ 
For density perturbations, it is the amplitude of the density perturba- 
tion at horizon crossing that is almost independent of scale: {Sp/p)uop ^ /(t> ^ 

Because the post-horizon-crossing evolution of both density pertur- 
bations and gravity waves depends upon the time elapsed since horizon crossing - 
which is longer for longer wavelength perturbations - the spectra of density pertur- 
bations and gravity waves is not scale invariant today. The Fourier components of 
both scalar and tensor perturbations are approximately power-law in wavenumber 
k of the fluctuations; at horizon crossing: 

(<5p/p)hor oc 
huoR oc 



where n {nr) are the power-law indices for scalar (tensor) perturbations. 

Both density perturbations and gravity waves lead to fluctuations in the 
temperature and polarization of the CMB across the sky. Predictions for observable 
quantities can be expressed in terms of the inflationary potential. For instance, the 
scalar (S) and the tensor contributions (T) to the CMB quadrupole anisotropy are 



~ 4n ~ ' (mpiV'/V)^ 
T = - 0.56(y/mpi^) 



( 3 ) 

( 4 ) 



where Cf and Cj are the contribution of scalar and tensor perturbations to the 
variance of the I = 2 multipole amplitude ((| 02 mP) = C 2 +C 2 ) and V is the value 
of the inflationary potential when the scale k = Hq (present horizon scale) crossed 
the Hubble radius during inflation. The numerical coefficients in these expressions 
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depend slightly upon the composition of the Universe; the numbers shown are for 
ÜM = 0.35 and Oa - 0.65 [17]. 

The power-law indices that characterize the scalar and gravity- wave spectra 
can be expressed in terms of the inflationary potential and its derivatives; 



J_ / mpy' V mpi / mp|V Y 

SttV V J 47t [ V J 

_ f mp,V' \ 

8ir [ V J 



(5) 

( 6 ) 



For typical inflationary potentials the deviations from scale invariance are expected 
to be of order 10%: |n - 1| ~ 0(0.1) and nr ~ —0(0.1) [18]. 

Note that the ratio between the gravity-wave contribution to the CMB 
quadrupole anisotropy and the density-perturbation contribution to the CMB 
quadrupole anisotropy provides a consistency test if the tensor spectral index can 
be measured: 

T/S = -5tit (7) 

for Qm — 0.35 and Qa = 0.65 [17]. 

The fluctuation spectra are not exact power laws (except in the case of power- 
law inflation) ; variations in the power-law indices with k may be expressed in terms 
of higher derivatives of V((/>) [19]. 



dn 

d\nk 



dfiT 

dink 



l_ (rnplYlA ( mp,Vi \ 

327t 2 I, v; ) [ K ) 

1 /rnpi^Vj') fmpiK'Y 

^ 87 t 2 14 J [ K J 

^ v "^ J ay 

1 l'mp\V'\^ 1 

3^ V y V u ; “3^ 





(8) 

(9) 



Finally, once the form of the density perturbations and the composition of 
the Universe are specified, the initial conditions for the formation of structure in 
the Universe are set. Inflation specifies the form of the density perturbations and 
the composition of the Universe is known - 29% cold dark matter (with a dash 
of it in massive neutrinos), 4% baryons and 66% dark energy. This means that 
the large (and growing) body of observational data that probes the formation of 
structure in the Universe provides an additional test of the inflationary paradigm. 
In particular, they have significant potential to determine n and test the Gaussian 
nature of the underlying density perturbations. 
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4 Mid-term report C 2 ird and future expectations 

The three basic predictions of inflation blossom into a series of 9 testable conse- 
quences, most of which can also probe the underlying inflationary model. 

The cosmic microwave background will play the leading in testing these pre- 
dictions because its anisotropy and polarization are such a clean probe of the 
gravity waves and density perturbations produced by inflation. The predictions of 
inflationary models can be stated in terms of the predicted variances of the multi- 
pole amplitudes that characterize CMB anisotropy and polarization [20]; because 
there are only 2£-hl multipole amplitudes for multipole even an ideal experiment 
is limited by sample size (often referred to as cosmic variance) in estimating the 
true variance. 

The discovery of CMB anisotropy on angular scales of order 10 degrees by the 
COBE Differential Microwave Radiometer in 1992 [21] opened the door for testing 
inflation. The measurement of CMB anisotropy on sub-degree angular scales by 
balloon-borne and ground-based experiments allowed the serious testing to begin 
(most of the features and probative power lies in the anisotropy on sub-degree 
scales) [20]. Presently, it is the results of the BOOMERanG, DASI, CBI, Maxima, 
CAT, and Archeops experiments that define the state-of-the-art in our knowledge 
of the CMB angular power spectrum. Soon, the MAP all-sky satellite-based exper- 
iment will report its first results and really clean up the anisotropy power spectrum 
out to ^ ~ 900. The ESA/NASA Planck satellite is scheduled for launch in 2007; it 
should provide the definitive power spectrum out to ^ ~ 3000 as well as significant 
results on polarization. 

The following are the nine predictions, the present status report and prospects 
for the future. A Table summary is given at the end of this section. 

1. Spatial flatness 

This prediction is the most straightforward; it simply implies Qq = 1.0 ± 0.00001. 
The ‘±0.00001’ arises because fluctuations on the current Hubble scale will led 
to the knowable part of the Universe appearing slightly open (underdensity) or 
slightly closed (overdensity). 

Now: Measurements of the position of the first acoustic peak in the CMB power 
spectrum indicate that Oq = 1-03 ± 0.03, consistent with spatial flatness [22]. 
Further, a direct accounting of the amount of matter and energy leads to an 
independent, though less precise, determination of the total mass/energy that is 
reassuringly consistent with spatial flatness, = 1 ± 0.25. 

Future: The precision testing of this prediction lies with MAP, Planck and other 
future CMB experiments that will probe the smallest angular scales and fix the 
positions of the acoustic peaks with high precision. Since the positions of the 
acoustic peaks also depend upon the composition of the Universe, information 
from large-scale structure measurements that constrains the matter density is also 
critical. The ultimate precision to which Uq can be probed will likely be in the 
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range açi^ ~ 0.005 — 0.001 and will depend upon how well the composition can be 
fixed by other independent methods [23, 24]. 

2. Density Perturbations from Quantum Fluctuations 

The most striking prediction of inflation may be that the density perturbations 
that seeded structure on scales of millions of light years and larger arose from 
quantum fluctuations on subatomic scales; if this is true, the CMB is a picture 
of quantum noise(!). This prediction breaks down into 5 separate testable conse- 
quences. 

a. Acoustic peaks: Since the density perturbations are impressed at very early 
times (<C sec), by the time of last scattering {t ~ 400,000yrs) all perturbations 
are purely “growing mode,” leading to a synchronizing of the perturbations on 
different scales. Some modes were caught at maximum compression or rarefaction, 
leading to the prediction of a series of “acoustic peaks” in the angular (multipole) 
power spectrum [20]. 

Now: Current CMB experiments have probed the angular power spectrum out to 
^ ~ 2000; at least three and perhaps as many five peaks have been resolved [22]. 
There is no question that the acoustic peaks associated adiabatic perturbations 
have been seen. 

Future: Planck and other future CMB experiments that probe the smallest angular 
scales should resolve six or more acoustic peaks (the damping of anisotropy on 
very small angular scales due to the finite thickness of the last scattering surface 
exponentially diminishes the amplitude of successive peaks). These experiments 
will also be able to separate out any small admixture of isocurvature perturbations 
(which could arise during inflation or later on). 

b. Gaussianity: The inflation-produced density perturbations arise from quantum 
fluctuations in a very weakly coupled (essentially free) scalar field and hence should 
be Gaussian to a high degree of precision. The CMB has the greatest power to 
test this prediction since it probes the density perturbations when they were linear 
(nonCaussianity automatically develops when gravity drives the amplitude of the 
perturbations into the nonlinear regime). 

Now: There is no evidence for nonCaussianity. 

Future: The all-sky CMB mapping experiments (MAP and Planck) which are de- 
signed to control systematics have the greatest the potential to test this prediction. 
Important to quantifying how well the Gaussianity prediction is faring is the con- 
struction of a realistic model with nonCaussianity to compare with. 

c. Abnost scale- invariant spectrum: Inflation predicts an almost, but not quite 
scale invariant spectrum [10]. Typically, the deviations from scale invariance are 
of order 10%: \n — 1| ~ 0(0.1), with indeterminate sign [18]. This is not only a 
key test of inflation, but a window to the underlying physics. 

Now: Measurements of CMB anisotropy at small angular scales can probe this 
prediction most sharply. Current observations are beginning to significantly con- 
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strain n: n = 1.05 ± 0.09 [22], consistent with inflation but not precise enough to 
test the inflationary prediction sharply. 

Future: The Planck Mission should be able to determine n to a precision of ±0.008 

[23] . An ideal CMB experiment could achieve a precision of almost ten times better 

[24] ; whether or not a future ground-based and space-based experiment with such 
capability is carried out remains to be seen. 

d. Almost power- law: Inflation predicts that the power-law index varies with scale, 
with dn/d\nk ~ ±10“^ for many models and ten times larger for some [19]. 

Now: Current CMB measurements, dn/d\nk = —0.02 ±0.04 [25], are consistent 
with the inflationary prediction, but lack the precision to measure a variation. 
Future: It is likely that the CMB offers is the most powerful probe, with Planck 
projected to do a factor of ten better than the current limit [23]; an ideal CMB 
experiment might reach the expected level of variation in n. 

e. CDM scenario for structure formation: Nearly scale- invariant, Gaussian adia- 
batic density perturbations is one of the two pillars of the highly successful CDM 
paradigm for structure formation (the other being slowly moving, weakly inter- 
acting dark matter particles). As such, tests of the CDM paradigm are tests of 
inflation. In particular, the study of large-scale can constrain n and the abundance 
of rare objects, such as clusters, can be used to test the Gaussianity prediction. 

3. Gravity Waves from Quantum Fluctuations 

a. Amplitude: The amplitude of the gravity waves is directly proportional to the 
energy scale of inflation and does not depend upon the shape of the potential. Un- 
fortunately, theory gives very little advice about the amplitude, and unlike density 
perturbations which are de rigueur (to seed structure formation) the Universe can 
live just fine without gravity waves. Huterer and I have explored what can be said 
about T/S without regard to specific potentials by reformulating the equations 
of motion for inflation. We concluded that if n > 0.9 and the potential has no 
unnatural features (the key here is the definition of unnatural ) , T/S must exceed 
10-3 [26]. 

Now: There is no evidence for inflation-produced gravity waves; current limits from 
the CMB imply T/S < 0{1) [27]. 

Future: There are two basic ways to get at the gravity waves. The first is to use their 
signature in CMB anisotropy and polarization. Unlike the series of acoustic peaks 
associated with density perturbations that extend out to £ ^ 3000, gravity waves 
produce a rather featureless angular power spectrum that dies off at around £ ^ 
100. Because the precision of even a perfect CMB experiment is limited by sampling 
variance, CMB anisotropy alone can separate the gravity- wave signature only if 
T/S > 0.1 [28]. However, the polarization signature is not so limited: gravity waves 
excite a mode of polarization (B-mode or curl) that density perturbations cannot 
[29]. The only limitations to detecting gravity waves through their polarization 
signature are sensitivity and foregrounds. It appears that a dedicated polarization 
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Table 1: TESTING INFLATION: YEAR 2002 SUMMARY 



Prediction 


Mid-Term 


Expectation 


1. Flatness 

a. Oo = 1 


Oo = 1.03 ±0.03 


±0.001 


2. Density perturbations 

a. Adiabatic: acoustic peaks 

b. Gaussian 

c. |n-l| -(9(0.1) 

d. dn/d\nk — (9(10~^) 

e. CDM Paradigm 


at least 3 

no evidence against 
n- 1.05 zb 0.09 
dn/dlnk = —0.02 zb 0.04 
many successes 


6 or 7 
?? 

±0.001 

±10"^ 


3. Gravity waves 

a. Amplitude 

b. Tlx — 0.1) 

c. Consistency: T/S = —Stit 


T/S < 0(1) 


T/S > 10-3 
±0.03 ?? 



experiment might be able to achieve a sensitivity to T/S as good as 10“^ [30] 
(recall, that if n > 0.9 and the potential is “natural”, T/S is expected to be this 
large). By comparison, Planck is expected to achieve T / S 0.02 [23]. 

The CMB is only sensitive to the longest wavelength gravity waves, A ~ 
10^^ cm to 10^^ cm; however, the spectrum extends to wavelengths as short as 1 km. 
Direct detection of inflation-produced gravity waves will be very challenging as 
Ügw is at most 10~^^ at the mHz-kHz frequencies where the planned detectors will 
operate: the projected sensitivities of LIGO and LISA to a stochastic background 
of gravitational waves are 10“^^ and 10~^^ respectively [31], far short of what is 
expected. 

b. Spectrum: The predicted spectral index for one-field models is related to the 

amplitude, tit = which allows a consistency test if tit can be measured. The 

combination of direct and CMB detections could measure nr accurately: owing to 
the long lever-arm between the Hz frequencies of gravity-wave detectors and the 
very long wavelength of gravity waves that affect the CMB, a factor of 2 precision 
in each measurement would result in a few percent measurement of nr- 

c. Consistency: For single-field inflation models rtr = ~ ^ ? as noted above. 

5 How much of the truth does inflation have and who is 0? 

A key to testing inflation and getting at the underlying physics is the detection of 
gravitational waves. Not only is the amplitude of the gravitational waves directly 
proportional to the energy scale of inflation, T (x V/mpi"^, but this third prediction 
of inflation is an undeniable smoking gun for inflation. 
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Let me elaborate and comment. Some would claim that the first two predic- 
tions of infiat ion - fiat Universe and scale- invariant density perturbations - have 
long been considered features of any sensible cosmological model. Certainly both 
were discussed well before infiation (e.g., flatness by Peebles and Dicke [32] and 
scale-invariant density perturbations by Harrison and Zel’dovich [33]). Thus, there 
is some truth to this point of view. However, because inflation provides a mech- 
anism for actually producing a flat Universe with almost scale-invariant density 
perturbations it also makes a prediction about how close to scale invariant they 
should be. As I have emphasized, an important test of inflation is its prediction 
that |n — 1| ~ 0(0.1) and not n — I, 

In addition to providing a smoking gun for inflation, the detection of gravity 
waves instantly reveals the epoch and energy scale of inflation: 

2x10“^^ sec 

‘ " VtJs 

= 3 X 10“^mpi {T/Sy^* ~ 3 X 10‘® GeV(T/5)^/^ 



Further, the values of T/5 and n — 1 can be used to solve for the inflationary 
potential and its first two derivatives: 



V 



V' 



V" 



1.8T TTlpi 



Sn T 



47T 



5 

3T' 

(..-!) + 55 



VImpy 



( 10 ) 

( 11 ) 

( 12 ) 



where the numerical factors depend upon the composition of the Universe and are 
given for fiM = 0.35 and IIa == 0.65 [17]. Measurements of T, S, and {n — 1) can 
be used to shed light on the underlying inflaton potential. 



6 If it smells like a rose, is it a rose? 

Inflation has some of the truth, but it has by no means been verified to the degree 
that we can safely include it as part of “a new standard cosmology.” Further, 
while inflation seemed like a bold and expansive step forward twenty years ago, 
by today’s standards it seems more modest: It only explains the isotropy and 
homogeneity on a temporary basis (albeit an exponentially long temporary basis) ; 
It does not address the question of the initial singularity (or why there is a universe 
at all); and It still stands disconnected from string theory or any other fundamental 
theory of elementary particle physics. For all of these reasons we should be open to 
new ideas. At the moment there are several intriguing ideas - e.g., ekpyrotic/cyclic 
model [34] and variable speed of light theories [35] - but none have reached the 
point of making sharp predictions like inflation. 
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The cosmological data that we have seen to date have not only provided the 
first tests of inflation, but they have also raised the bar for its alternatives. In fact, 
a die-hard inflationist might well argue that any theory that is able to account for 
the observable facts without appealing to initial conditions will have to look very 
much like inflation. 

To be specific, the existence of adiabatic density perturbations on superhori- 
zon scales at the time of last scattering and the enormous heat of the big bang 
(quantified by the entropy of 10^° within our current Hubble volume) require any 
scenario that does not simply appeal to initial conditions to incorporate superlu- 
minal expansion (accelerated expansion or accelerated contraction) and entropy 
production, the two hallmarks of inflation [36, 37]. 

The necessity of each is easy to explain. First, to causally create a density 
perturbation in an expanding Universe requires that a sub- bubble- scale sized re- 
gion at very early times {I < H~^) must grow to a size much, much greater than 
the Hubble length at last scattering. This translates into a kinematic requirement: 
There must be an epoch where the scale factor grows faster than t. (More precisely 
and more generally, the condition is that R and R have the same sign, which is 
called super luminal expansion.) 

Today’s Hubble volume contains an entropy of about 10^® (largely in the form 
of photons and neutrinos). At early times, the entropy within a Hubble volume is 
at most (mpi/T)^ (equality pertaining for a radiation dominated phase). In the 
absence of massive entropy production the superluminal phase will produce very 
large, but very empty (low entropy) density perturbations. Entropy production 
is needed to create enough photons and eventually enough matter to account for 
the 10^^ baryons (and even more dark matter particles) per galaxy. (This same 
argument could have been worded in terms of producing a smooth region of the 
Universe corresponding to our present Hubble volume.) 

To summarize, an alternative to inflation must involve superluminal expan- 
sion and mcLSsive entropy production; in addition, if it can be described in 4- 
dimensions by a the evolution of a single degree of freedom (which could be called 
a scalar field) , an inflation hegemonist might be tempted to call it a realization of 
inflation, rather than a fundamentally new paradigm. She might have a good case, 
since inflation is still a paradigm in search of a model. 

7 Concluding remarks 

The chance that any major new idea in theoretical physics has something to do 
with the truth is very small; in cosmology the odds are even longer. Nonetheless, 
inflation has passed its first round of major tests and it does appear that inflation 
has some of the truth. 

Precision measurements of CMB anisotropy on sub-degree angular scales by 
ground-based and balloon-borne experiments have led the way in testing inflation. 
Over the next decade or so, results from satellite-borne experiments with even 
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greater precision and control of systematics - MAP, Planck and possibly a new 
satellite mission dedicated to polarization - will sharpen the tests by more than 
30 fold and the crucial gravity-wave signature of inflation may be detected. 

The challenge and importance of verifying the third prediction of inflation 
cannot be overstated. Discovery of inflation- produced gravity waves is a smoking 
gun signature of inflation, immediately identifies the epoch and scale of inflation 
and reveals information about the inflationary potential. If its spectral index can 
be measured, the consistency of the single-field inflation paradigm can be checked. 

While it is too early to bring on the champagne - plenty could still go wrong 
- the bar has been raised for any new competitor to inflation. Accounting for 
the superhorizon-sized adiabatic density perturbations whose existence has been 
confirmed by detection of acoustic peaks in the CMB angular power spectrum 
requires both superluminal expansion and entropy, the two hallmarks of inflation. 

While the successes of inflation are gratifying, we are very far from under- 
standing the underlying cause. Single-field models require very weakly coupled 
scalar fields (dimensionless couplings of order 10“^^ or so) and the potentials seem 
hopelessly contrived. It could be that our 4-dimensional scalar- field formulation of 
inflation makes things look unnatural, and that when viewed in higher dimensions 
or when the scalar field is properly interpreted, the inflaton and its potential will 
make perfectly good sense. 

In addition to the “who is (j) question”, there are other questions: the de- 
tails of reheating, the possibility of quantum fluctuations during inflation excite 
isocurvature modes or produce particle relics, the search for a connection between 
planck scale physics and CMB anisotropy (the modes seen on the CMB sky today 
were subPlanckian in size during inflation), and the multiverse. 

Perhaps the most daunting challenge facing inflation is convincing even our- 
selves that inflation really happened. Even if inflation passes all of its tests and 
its gravity waves are detected both by their CMB signature and directly by Hz- 
frequency laser interferometers, will we be able to say, with the same confidence 
that use in discussing big-bang nucleosynthesis or the quark/hadron transition, 
that the Universe really did inflate (rather than inflation provides a consistent 
way of describing what we see today)? Our confidence in big-bang nucleosynthesis 
derives from laboratory-based nuclear physics and in the quark/hadron transi- 
tion from computer simulations and accelerator experiments. Will we be to find a 
laboratory crosscheck for inflation that will gives us similar confidence? 

Finally, there is a downside to inflation: If inflation does succeed in making 
the present state of the Universe insensitive to its initial state, it will place a veil 
between us the beginning of the Universe. 
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Tests of Gravity on Astrophysical Scales 
and Variation of the Constants 

Jean-Philippe Uzan 



Abstract. We summarize a series of observational tests of the law of gravity on large 
astrophysical scales. These tests account for testing both the Poisson equation (in- 
verse square law) using weak lensing and the Einstein equivalence principle through 
the test of the constancy of the constants of Nature. We emphasize the need to test 
general relativity on cosmological scales in light of the cosmological constant prob- 
lem and of recent observational claims concerning the variation of fine structure 
constant. 



1 Testing gravity on large scales 

Cosmological observations have lead to the “evidence” that the universe is un- 
dergoing a late time acceleration [1]. The interpretation of this conclusion is still 
a matter of debate. At the lowest level, the conclusion to be reached is that the 
Friedmann equations for a universe filled only with normal matter (i.e. radiation 
and dust) cannot explain the current data. There are different ways of facing this 
fact. Either one can conclude that the interpretation of the cosmological data is 
not correct (i.e. we do not accept the evidence for the acceleration of the universe, 
see Ref. [1] for a recent critical review and Ref. [2] for an interesting proposal) or 
one tries to introduce new degrees of freedom in the cosmological model at hand. 
In this latter case, these extra degrees of freedom, often referred to as dark energy^ 
can be introduced as a new kind of matter (including a cosmological constant, 
quintessence,. . . ) or as a new property of gravity. 

In the first, and most common, approach one keeps general relativity un- 
changed while introducing new forms of gravitating matter (dark matter to ex- 
plain galaxy rotation curves and matter with negative pressure to explain the 
acceleration of the universe) beyond the standard model of particle physics. But, 
one is still left with the cosmological constant problem [3] (why is the density of 
vacuum energy expected from particle physics so small?) unsolved as well as the 
time coincidence problem (why does the dark energy starts dominating today?). 

Another route is to allow for modifications of gravity in the infrared, i.e. 
at large distances, in order to explain why a large vacuum energy density does 
not give rise to a large curvature. Such alternatives have recently received more 
attentions in the framework of braneworld models in which the standard model 
fields are localized on a 3-dimensional brane embedded in a higher dimensional 
spacetime. All higher dimensional models predict that gravity should depart from 
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its standard Newton behavior on small scales and up to now this scale is con- 
strained to be smaller than 100 — 500 /xm [4]. Among braneworlds models, a class 
of models have also the feature to allow for deviation from 4-dimensional Einstein 
gravity on large scales. This is for example the case of some multi- brane models [5], 
multigravity [6], brane induced gravity [7] or simulated gravity [8]. In such mod- 
els, gravity is not mediated only by a massless graviton (hence breaking one of the 
hypothesis of Weinberg’s theorem [3]), so that one expects to have deviations from 
Newton inverse square law on large scales. From a cosmological point of view, it 
was shown that a common feature of these modifications of gravity was to lead to 
an accelerated expansion of the universe without introducing matter with negative 
pressure [9, 10]. Other phenomenological approaches such including a modification 
of inertia [11] or a non-local modification of gravity [12] have also been proposed. 

Prom a more theoretical point of view, string theory seems to be the only 
known promising framework that can reconcile quantum mechanics and gravity, 
even though it is not yet fully defined beyond the perturbative level. One defini- 
tive prediction drawn for the low-energy effective action is the existence of extra- 
dimensions and of a scalar field, the dilaton, that couples to matter [13] and whose 
expectation value determines the string coupling constant. It follows that the low- 
energy coupling constants are in fact dynamical quantities. When the dilaton is 
massless (or almost) it leads to 3 effects: (i) a scalar admixture of a scalar com- 
ponent inducing deviations from general relativity in gravitational effects, (ii) a 
variation of the couplings and (iii) a violation of the weak equivalence principle. If 
the dilaton were to remain massless, it would induce a violation of the universality 
of free fall seven order of magnitudes larger than the actual bounds [14]. To avoid 
such a catastrophe, it has either to suddenly take a mass larger than a few meV 
(so that gravity will be compatible with Einstein gravity above a millimeter) or 
decouple from matter [14]. Both mechanisms have different implication concerning 
the variation of the coupling constants. As a conclusion, testing for the constancy 
of constants may reveal the existence of further gravitational fields or of extra- 
dimensions and it opens an observational window on the low-energy limit of string 
theory and on the stabilization of the dilaton and extra-dimensions. 

All these considerations are driving to develop new tests of gravity on cosmo- 
logical scales that will probe both the inverse square law behavior and the Einstein 
equivalence principle. Up to now, the observational status is the following. 

1. On Solar System size, the Newton law as well as the universality of free 
fall are tested with a very good accuracy (see Fig. 1 for a summary of the 
constraints on a Yukawa type fifth force and Ref. [26]) . 

2. On galactic scales, there are a number of astrophysical constraints that a 
successful modification of gravity will have to face (see e.g. Ref. [15] where 
most of the constraints have been discussed in details), specially if the con- 
sidered modification aims at explaining the galaxy rotation curves. Note that 
this is in general not the case of the models trying to explain the acceleration 
of the universe by a modification of gravity on large scales. It is not clear 
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yet that a unique model will be able to explain both the flat rotation curves 
and the acceleration of the universe, as it is not clear that a unified model of 
dark matter-energy exists. If the modification of gravity has some relevance 
on galactic scales then it will have to explain the flattening of the rotation 
curves and to account for the dependence of the galaxy rotation curve on the 
luminosity of the galaxy. This dependence is encapsuled in the Tully-Fischer 
relation relating the luminosity of a spiral galaxy to its asymptotic rotation 
velocity L oc with a ^ A. This sets severe constraints on theories in 
which the cross-over scale with standard gravity is fixed (see e.g. Ref [16]) 
and favored theories where this cross-over scale depends on the considered 
galaxy. The compatibility between X-ray and strong lensing observations 
tends to show that the Poisson equation holds up to roughly 2 Mpc [17]. 

3. On cosmological scales, there is at the moment no direct tests of gravity. 
Indeed the growth of cosmological structure can put some constraints but 
usually the observations entangle the properties of the matter and gravity. 
Recently, it has been claimed that observations of quasars absorption spectra 
were in favor of a lower fine structure constant in the past. If real, this will 
be an indication of the break-down of the Einstein equivalence principle. 

In this proceedings, we first recall in Section 2 a proposal the inverse square law on 
large (astrophysical) scales that I recently proposed with Francis Bernardeau [18]. 
Then I review the tests of the constancy of the constant of nature (Section 3) and 
give a summary of the extensive review [19]. In particular, I focus on the con- 
straints on the fine structure constant and discuss some recent claims concerning 
its possible variation. 

2 Testing Newton’s law on large scales 

The deflection of light by a gravitational potential, first observed during the Solar 
eclipse of the 29th May 1919 by the expeditions lead respectively by Eddington 
and Cottingham in Principia island, and Davidson and Crommelin in the Nordeste 
of Brazil, was at the heart of the first test of general relativity. 

This test checks that the deflection of light by the Sun gravitational field is 
the one predicted by the theory of general relativity. Indexed, to be conclusive, these 
experiments have to assume that the mass of the Sun is known. What is really 
tested is thus the consistency between the mass of the Sun and the gravitational 
field it creates, i.e. the weak field limit of the Einstein equations. Recently, we 
proposed that such a test can be extended to astrophysical scales where there is, 
at the moment, no test of the gravitational law. The bending of light by a matter 
distribution is intrinsically a relativistic effect which enables to test gravity at 
extragalactic scales. 

To sketch, the effect of a modification of gravity on large scales and the 
method we proposed, let us assume that the background spacetime can be de- 
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scribed by a Priedmann-Lemaître spacetime. As long as we are dealing with sub- 
horizon scales, we can take the metric to be of the form, assuming flat spatial 
sections, 

ds^ - -(1 - 2^)dt^ + a^(l -h 2$) (dx^ -h (1) 

where t is the cosmic time, a{t) the scale factor, x the comoving radial coordinate 
and dn^ the unit solid angle. In a Newtonian theory of gravity, ^ is the Newtonian 
potential ^ n determined by the Poisson equation 

= AwGpa'^S ( 2 ) 

where G is the Newton constant and A the three dimensional Laplacian in co- 
moving coordinates, p the background energy density and 8 = 8p/ p is the density 
contrast. If the Newton law is violated above a given scale then we have to change 
Eq. (2) and the force between two masses distant of r derives from $ = ^^/(r/rs) 
where f{x) 1 when x ^ 1. For instance, choosing f{x) = 1/(1 + x) will de- 
scribe a gravity which is four dimensional on small scales and that becomes flve 
dimensional of large scales. Using Eq. (2) it leads, with r — ax, to 



#(x) 





(3) 



If the Poisson equation is satisfled then the power spectra of the density contrast 
and gravitational potential have to satisfy 

V^ibNik) = {^T^Gpa^Ÿ Ps{k), (4) 



that is 

VA'b{k)= {iwGpa^Ÿ Vs{k)fc{krs)'^ (5) 

where fc can be expressed in terms of the Fourier transform of / (see flg. 1). 

A way to test the validity of the Newton law is thus to test the validity of 
Eq. (2) which is possible if one can measure 6 and ^ independently. 

Prom galaxy catalogs, one can extract a measurement of the two-point cor- 
relation function of the cosmic density field, ^(r) = (^(0)<^(r)). It leads to a mea- 
surement of 

On the other hand, weak lensing surveys offer a novel and independent win- 
dow on the large scale structures. Weak lensing measurements are based on the 
detection of coherent shape distortions of background galaxies due to the large 
scale gravitational tidal forces. The deformation of a light bundle is characterized 
by the amplification matrix 



A 



1- K-Ji 

72 



72 

1 - K -h 7i 



(7) 
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Figure 1: Summary of the constraints on a Yukawa type deviation to the Newton’s 
law on Solar System scales. The gravitational potential is parameterized by V — 
(1 + VNewton- From Fischbach and Tamadge [20]. 

7, the shear, can be measured from galaxy ellipticities [21] from which one can 
reconstruct the convergence n. The convergence is generated by the cumulative 
effect of large scale structures along the line of sight [22, 23]. In a direction 6 it 
reads, 

«W = / 9ix)^2^{T>ix)0,x)<^X (8) 

Jo 

where V is the comoving angular diameter distance and where A2 is the two 
dimensional Laplacian in the plane perpendicular to the line of sight. The function 
g depends on the radial distribution of the sources by 

k,{6) is a function on the celestial sphere that can be decomposed, in the small 
angle approximation, in Fourier modes 

«( 1 ) = / " (10) 

so that, using the expression (8) and the definition of the angular power spectrum 
of n as {k{\)k,*{V)) = (27r)“^7^/^(/)J^^^(l - 1'), we obtain 




( 11 ) 
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It clearly appears that cosmic shear measurements are a direct probe of the gravi- 
tational potential. By comparing weak lensing and galaxy catalogs measurements, 
we have a test of the Poisson equation. 

So far cosmic shear has been detected up to a scale of about 2h~^ Mpc [21] {h 
being the Hubble constant in units of lOOkm/s/Mpc). This method is in principle 
applicable to any scale up to 100 h~^ Mpc. With galaxy surveys such as SDSS that 
will measure Vs up to 500 Mpc [24] it makes possible comparisons of Vs and 
V^^ on cosmological scales therefore enabling direct tests of the gravity law up to 
roughly 100^“^ Mpc. 

To illustrate this discrepancy we consider the growth of the perturbations on 
scales from ten to some hundreds of Mpc in a modified gravity scenario. For that 
purpose, we assume that the standard behavior of the scale factor is recovered 
(i.e. we have the standard Friedmann equations). Note that this is the case in 
most of the scenarios in which such modifications of gravity occur and in which 
the evolution of the scale factor is modified in the same way £is by a cosmological 
constant. 

In the weak field limit and for a pressureless fiuid, taking advantage of the 
fact that the relation between 6 and ^ is local in Fourier space, 6 can be shown to 
satisfy the evolution equation 



4 - - \h^ m U 4 = 0 ( 12 ) 

where a dot refers to a derivative with respect to H = à/ a is the Hubble 
parameter. 

Looking for a growing mode as (5 ^ ex in a Einstein-de Sitter matter 

dominated universe (0 = 1, H = 2/3t) gives a growing solution such that z/+(/c) ^ 
2/3 for kxs > 1 and 0 for kxg 1. At large scales the fluctuations 

stop growing mainly because gravity becomes weaker and weaker. On Fig. 2, we 
depict the numerical integration of the growing mode and the resulting power 
spectrum assuming that f{x) = 1/(1 -h x). Note that since Xg and the comoving 
horizon respectively scale as and y/a (in an Einstein-de Sitter universe) Xg 
enters the horizon at about 760 Mpc if Tg = 50 Mpc. Thus, all the modes 
with comoving wavelengths smaller than 760 Mpc feel the modified law of 
gravity only when they are subhorizon. As a consequence, it is well justified for 
all the observable modes (i.e. up to 500 Mpc) to consider the effect of the 
non-Newtonian gravity in the subhorizon regime only. 

Let us emphasize that, on Fig. 2, the deviation from the standard behavior 
of the matter power spectrum is model dependent (it depends in particular on 
the cosmological parameters), but that the discrepancy between the matter and 
gravitational potential Laplacian power spectra is a direct signature of a modified 
law of gravity. Note that biasing mechanisms (i.e. the fact that galaxies do not 
necessarily trace faithfully the matter field) cannot be a way to evade this test since 
bias has been found to have no significant scale dependence at such scales [25] . 
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wavelength, 



Figure 2: In a theory in which gravity switches from a standard four dimensional 
gravity to a five dimensional gravity above a crossover scale of Vs = 50h~^ Mpe, 
there are different cosmological implications concerning the growth of cosmologi- 
cal perturbations. Since gravity becomes weaker on large scales, fluctuations stop 
growing [left panel], [right panel] It implies that the density contrast power spec- 
trum (thick line) differs from the standard one (thin line) but, more important, 
from the gravitational potential power spectrum (dash line). 
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3 Testing gravity with the constants of Nature 

At the heart of general relativity is the Einstein equivalence principle that states 
that (i) the weak equivalence principle (also referred to as the universality of free 
fall) is valid, i.e. that any electrically neutral test body with negligible gravitational 
self-energy falls identically, independently of its mass and chemical composition, 
(ii) the local Lorentz invariance holds, i.e. that the result of any non-gravitational 
experiment is independent of the freely falling referential in which it is performed, 
and (iii) the local position invariance also holds, i.e. that the result of any non- 
gravitational experiment is independent of where and when it is performed. 

If the Einstein equivalence principle is valid then gravity can be described 
as the consequence of a curved spacetime and is a metric theory of gravity, an 
example of which are general relativity and the Brans-Dicke theory. This state- 
ment is not a “theorem” but there are a lot of indications to back it up [26] . Note 
that superstring theory violates the Einstein equivalence principle since it intro- 
duces additional fields (e.g. dilaton, moduli. . . ) that have gravitational-strength 
couplings which violate of the weak equivalence principle. A time variation of a 
fundamental constant is in contradiction with Einstein equivalence principle since 
it violates the local position invariance. All new interactions that appear in the 
extension of standard physics implies extra scalar or vector fields and thus an ex- 
pected violation of the weak equivalence principle, the only exception being metric 
theories such as the class of tensor-scalar theories of gravitation in which the dila- 
ton couples universally to all fields and in which one can have a time variation of 
gravitational constant without a violation of the weak equivalence principle. 

By constraining the variation of the fundamental constant on astrophysical 
scales, we thus test a central hypothesis on which general relativity is built. 

3.1 Constants that may vary 

The question of the constancy of the constants of physics was probably first ad- 
dressed by Dirac [27] who expressed, in his “Large Numbers hypothesis”, the 
opinion that very large (or small) dimensionless universal constants cannot be 
pure mathematical numbers and must not occur in the basic laws of physics. He 
suggested, on the basis of this numerological principle, that these large numbers 
should rather be considered as variable parameters characterizing the state of the 
universe. Dirac noticed a series of numerical coincidences such as between the 
relative magnitude of electrostatic and gravitational forces between a proton and 
an electron and^ representing the age of the universe in atomic time. 

He concluded that these coincidences can be “explained” if one assumes that the 
gravitational constant, G, varies with time and scales as the inverse of the cosmic 
time. 

^ Ho is the value of the Hubble constant today, e the charge of the electron, rrie its mass and 
c the speed of light. 
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Indeed, there was no theory backing-up this hypothesis but let us stress some 
interesting points. First, it was argued, using the ‘^anthropic principle”, that the 
coincidences found by Dirac can be derived from physical models of stars and the 
competition between the weakness of gravity with respect to nuclear fusion [28]. 
Second, let us note that considering a variable constant accounts to considering 
that it is a dynamical field. This was first pointed out by Jordan [29] who proposed 
the first implementation of Dirac’s idea into a field-theory framework. Nevertheless, 
Dirac’s idea motivated many studies concerning the variation of the constant and 
it was trying, as we also do today, to make a link between the macroscopic and 
microscopic world and it was mainly driven by the still non-understood fact that 
gravity is very weak compared with other forces of nature. 

How many constants should be tested? To be pragmatic, the constancy of all 
the parameters that are not determined by the theory at hand have to be tested. 
As an example, the minimal standard model of particle physics plus gravitation 
that describes the four known interactions depends on 20 free parameters. Either 
these parameters are not fundamental constants and will be considered as fields 
in a more general theory, one output of which must be the determination of these 
parameters, or they are fundamental constants in which case one will only be able 
to measure them. In that sense, the study of the constants offers a window on the 
limits of the theory itself. 

To finish this series of comments, let us stress that the introduction of con- 
stants in physical laws is closely related to the existence of systems of units. It 
implies that the numerical constants is deeply related to the definition of what 
we call a second, a meter etc. . . Since the definition of a system of units and 
the value of the fundamental constants (and thus the status of their constancy) 
are entangled, and since the meaisurement of any dimensionful quantity is in fact 
the measurement of a ratio to standards chosen as units, it only makes sense to 
consider the variation of dimensionless ratios. We will thus focus on the variation 
of dimensionless ratios which, for instance, characterize the relative magnitude of 
two forces, and are independent of the choice of the system of units and of the 
choice of standard rulers or clocks. 

Since we can only measure the variation of dimensionless quantities (such as 
the ratio of two wavelengths, two decay rates, two cross sections . . . ), the idea is to 
pick up a physical system which depends strongly on the value of a set of constants 
so that a small variation will have dramatic effects. The general strategy is thus to 
constrain the spacetime variation of an observable quantity as precisely as possible 
and then to relate it to a set of fundamental constants. This latter step involves 
limitations related to our theoretical understanding of the considered system. 

3.2 It is a test of the theory of gravity 

Testing for the constancy of the constants is a test of the Einstein equivalence 
principle. But let us also stress that if constants are varying one expects also in to 
have an anomalous force (a “fifth force” ) which is likely to violate the universality 
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of free fall. Note that to violate the universality of free fall, it requires that the 
extra-field mediating the new force does not couple universally to all matter fields. 

Since the mass of any test body depends on the masses of its constituants 
and of its binding energy, we expect it to depend on the value of all the coupling 
constants as well as of the mass of all fundamental particles. This has a profound 
consequence concerning the motion of any test body. 

Let a be any fundamental constant, assumed to be a scalar function and 
having a time variation of cosmological origin so that in the privileged cosmological 
rest- frame it is given by a{t). A body of mass m moving at velocity v will experience 
an anomalous acceleration 



ôa 



1 dmv dv d\nm . ^ 

; T' = — 

m dt dt da 



(13) 



N ow, in the rest-frame of the body, a has a spatial dependence a[{t' -h v.f /c^)/ 
so that, as long as v c, Va = [à/c^)v. The anomalous acceleration 
can thus be rewritten as 



Sd — — 



a Smc^\ 

J V Ina. 



m ôa 



(14) 



The violation of the universality of free fall is quantified by considering the pa- 
rameter rji 2 characterizing the difference of acceleration of two bodies, labelled 1 
and 2, and defined by 



r]i2 



o |ai -02| 
|ai + a,2\ 



(15) 



For two test bodies falling in an external gravitational field, ^ext, one gets a viola- 
tion of order 



r]i2 



d ln(mi /m 2 ) o I V In a| 

c . 

5 In a l^extl 



(16) 



This anomalous acceleration is generated by the change in the (electromagnetic, 
gravitational, ...) binding energy [30]. Besides, the a-dependence is a priori 
composition-dependent. As a consequence, any variation of the fundamental con- 
stants will entail a violation of the universality of free fall: the total mass of the 
body being space dependent, an anomalous force appears if energy is to be con- 
served. 

In conclusion, deviation from Einstein gravity, violation of the universality of 
free fall and variation of the constants are in general related and expected together 
so that testing for the constancy of the constants on astrophysical scales opens a 
window on gravity at these scales. 



3.3 The fine structure constant: status of the constraints 

Recent astrophysical observations have restarted the debate concerning the con- 
stancy of the fine structure constant, a^^. We briefiy review in this section the 
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various constraints obtained up to now in order to present the different methods. 
All the details concerning the different methods and bounds were clearly explained 
in Ref. [19]. 

• The Oklo phenomenon 

Oklo is a prehistoric natural fission reactor that operated about 2 x 10^ yr 
ago (corresponding to a redshift of ~ 0.14) during a few million years in the 
Oklo uranium mine in Gabon. Two billion years ago, uranium was naturally 
enriched (due to the difference of decay rate between and and 
represented about 3.68% of the total uranium (compared with 0.72% today). 
Besides, in Oklo the concentration of neutron absorbers which prevent the 
neutrons from being available for the chain fission was low; water played the 
role of moderator and slowed down fast neutrons so that they can interact 
with other and the reactor was large enough so that the neutrons did 
not escape faster than they were produced. 

From isotopic abundances of the yields, one can extract informations about 
the nuclear reactions at the time the reactor operated and reconstruct the 
reaction rates at that time. One of the key quantity measured is the ratio 
g2^Sm/g2^Sm of two light isotopes of samarium which are not fission prod- 
ucts. This ratio of order of 0.9 in normal samarium, is about 0.02 in Oklo 
ores. This low value is interpreted by the depletion of g2^Sm by thermal 
neutrons to which it was exposed while the reactor was active. 

Shlyakhter [31] pointed out that the capture cross section of thermal neutron 
by IfSm 

n + ||®Sm — + (17) 

is dominated by a capture resonance of a neutron of energy of about 0.1 eV. 
The existence of this resonance is a consequence of an almost cancellation 
between the electromagnetic repulsive force and the strong interaction. To 
obtain a constraint, one first needs to measure the neutron capture cross 
section of 52^ Sm at the time of the reaction and to relate it to the energy of 
the resonance. One has finally to translate the constraint on the variation of 
this energy on a constraint on the time variation of the considered constant. 

Without going into the details, the key point is the sensitivity of the value 
of the energy of the resonance, Er, to change of the fine structure constant 
that was estimated to be of order [32] 

This large amplification between the resonance energy (~ 0.1 eV) and the 
sensitivity (^^ 1 MeV) allows to set a constraint of magnitude | < 

Er/1 Mev ~ 10“^. More precisely, Damour and Dyson [32] obtained the 
constraint 



0.9 X 10 ^ < 1-2 x 10 ^ 



(19) 
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at 2a level, corresponding to the range —6.7 x 10 ^^yr ^ < 

5.0 X 10”^^yr“^ if is assumed constant. 

• Other nuclear constraints 

a-, ^-decays and spontaneous fission were also used to constrain the vari- 
ation of the fine structure constant. The main idea is to extract the 
dependence of the decay rate and to use geological samples to bound its time 
variation. 

Many nuclei were used. The sharpest constraint was obtained from the (3- 
decay of osmium to rhenium by electron emission 

75’^ Re — > 70^Os -I- -h e“, (20) 

first considered by Peebles and Dicke [33]. As long as long-lived isotopes 
are concerned, for which the decay energy is small, we can use a non- 
relativistic approximation for the decay rate 

X = A{AEf (21) 

so that the sensitivity is given by 

din A dlnA^ 

^ = =Pj] • (22) 

dlna^^ alno^EM 

Peebles and Dicke [33] noted that the very small value of its decay energy 
AE 2=: 2.5 keV makes it a very sensitive indicator of the variation of 
In that case p ~ 2.8 so that s ~ —18000. It follows that a change of about 
10“^% of will induce a change in the decay energy of order of the keV, 
that is of the order of the decay energy itself. 

The data concerning osnium were recently updated to take into account the 
improvements in the analysis of the meteorite data which now show that the 
half-life has not varied by more than 0.5% in the past 4.6 Gyr (i.e. a redshift 
of about 0.45). This implies that [34] 

|Aa,„/Q,„|<3xlO-^ (23) 



• Atomic spectra 

The previous bounds on the fine structure constant assume that other con- 
stants like the Fermi constant do not vary. The use of atomic spectra may 
offer cleaner tests since we expect them to depend mainly on combinations^ 
of (1 and ^p. Two approaches have mainly received attention: the com- 
parison of atomic clocks in laboratory experiments and the use of quasar 
absorption spectra. 

^ pis the ratio between the masses of the electron and proton and g-p the proton gyromagnetic 
factor. 
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Laboratory experiments are based on the comparison either of different 
atomic clocks or of atomic clock with ultra-stable oscillators. They are thus 
based only on the quantum mechanical theory of the atomic spectra. They 
also have the advantage to be more reliable and reproducible, thus allowing a 
better control of the systematics and a better statistics. Their evident draw- 
back is their short time scales, fixed by the fractional stability of the least 
precise standards. This time scale is of order of a month to a year so that 
the obtained constraints are restricted to the instantaneous variation today, 
but it can be compensated by the extreme sensibility. They involve the com- 
parison of either ultra-stable oscillators to different composition or of atomic 
clocks with different species. Solid resonators, electronic, fine structure and 
hyperfine structure transitions respectively give access to^ i?oo/<^EM 5 

-RooaL and£?p//JîooaL- 

Among all experiments, the sharpest constraint has been obtained by Sortais 
et al. [35] who compared a rubidium to a cesium clock over a period of 24 
months. They deduced that dln(z/j^^/z/^J/dt = (1.9 zb 3.1) x 10“^^yr“^. 
Assuming constant, they deduced 

= (4.2 ± 6.9) X 10-'® yr-'. (24) 

The observation of spectra of distant astrophysical objects encodes informa- 
tion about the atomic energy levels at the position and time of emission. As 
long as one sticks to the non- relativistic approximation, the atomic transition 
energies are proportional to the Rydberg energy and all transitions have the 
same a -dependence, so that the variation will affect all the wavelengths by 
the same factor. Such a uniform shift of the spectra can not be distinguished 
from a Doppler effect due to the motion of the source or to the gravitational 
field where it sits. The idea is to compare different absorption lines from 
different species or equivalently the redshift associated with them. 

While performing this kind of observations a number of problems and sys- 
tematic effects have to be taken into account and controlled: (i) errors in 
the determination of laboratory wavelengths to which the observations are 
compared; (ii) while comparing wavelengths from different atoms one has to 
take into account that they may be located in different regions of the cloud 
with different velocities and hence with different Doppler redshift; (iii) one 
has to ensure that there is no light blending; (iv) the differential isotopic 
saturation has to be controlled. Usually quasars absorption systems are ex- 
pected to have lower heavy element abundances; (v) hyperfine splitting can 
induce a saturation similar to isotopic abundances; (vi) the variation of the 
velocity of the Earth during the integration of a qu 2 isar spectrum can induce 
differential Doppler shift; (vii) atmospheric dispersion across the spectral di- 
rection of the spectrograph slit can stretch the spectrum; (viii) the presence 

^Roo is the Rydberg constant. 
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of a magnetic field will shift the energy levels by Zeeman effect; (ix) temper- 
ature variations during the observation will change the air refractive index in 
the spectrograph; (x) instrumental effects such as variations of the intrinsic 
instrument profile have to be controlled. Most of these effects are discussed 
in Ref. [36]. 

An efficient and extensively used method is to observe fine-structure doublets 
for which the frequency splitting between the two lines of the doublet is 

^2 7-4 p 

(25) 

It follows that i^vjv oc It can be inverted to give ^ ^ 

function of AA and Â as 




Murphy et al [37] studied the doublet lines of Si IV, C IV and Ng II and 
focused on the fine-structure doublet of Si IV toward 8 quasars with redshift 
z ~ 2 — 3 to get 

=(-0.5±1.3)xl0-^ (27) 

Recently Webb et al [38] introduced a new method referred to as the many 
multiplet method in which one correlates the shift of the absorption lines of 
a set of multiplets of different ions. One advantage is that the correlation 
between different lines allows to reduce the systematics. An improvement 
also arises from the comparison of transitions from different ground-states 
for ions with very different atomic mass; this increases the sensitivity because 
the difference between ground-states relativistic corrections can be very large 
and even of opposite sign. 

Webb et al [39] re-analyzed their initial sample [38] and included new optical 
QSO data to have 28 absorption systems with redshift 2 ; = 0.5 — 1.8 plus 
18 damped Lyman-a absorption systems towards 13 QSO plus 21 Si IV 
absorption systems toward 13 QSO . The analysis used mainly the multiplets 
of Ni II, Cr II and Zn II and Mg I, Mg II, A1 II, A1 III and Fe II were also 
included. The data were reduced to get 72 individual estimates of 
spanning a large range of redshift. From the Fe II and Mg II sample they 
obtained 

A«bm /a™ = (-0.7 ± 0.23) x (28) 

for 2 ; = 0.5 — 1.8 and from the Ni II, Cr II and Zn II they got 

Aa^„ = (-0.76 ± 0.28) x (29) 

for .2 = 1.8 — 3.5 at a Aa level. It refers only to the statistical confidence level. 
The fine-structure of Si IV gave 

Aa^„/aBM =(-0.5±1.3)xl0-5 



(30) 




Vol. 4, 2003 Tests of Gravity on Astrophysical Scales 



S361 



for 2 : = 2 — 3. These results are summarized on Fig. (3). This series of results 
is of great importance since all other constraints are just upper bounds. 
Such a non-zero detection, if confirmed, will have tremendous implications 
concerning our understanding of physics. Among the first questions that 
arise, it is interesting to test whether this variation is compatible with other 
bounds (e.g. test of the universality of free fall), to study the level of detection 
needed by the other experiments knowing the level of variation by Webb et 
al. [39] , to sort out the amplitude of the variation of the other constants and 
to ensure that no systematic effects has been forgotten. 

• Cosmological constraints 

On cosmological scales, the results of primordial nucleosynthesis and the 
observation of the cosmic microwave background were used to constrain the 
variation of the fine structure constant. 



The Cosmic Microwave Background Radiation (CMBR) is composed of the 
photons emitted at the time of the recombination of hydrogen and helium 
when the universe was about 300,000 years old. This radiation is observed 
to be a black body with a temperature T = 2.723 K with small anisotropies 
of order of the fj,K. Prior to recombination, the photons are tightly coupled 
to the electrons, after recombination they can be considered mainly as free 
particles. Changing the fine structure constant modifies the strength of the 
electromagnetic interaction and thus the only effect on CMB anisotropies 
arises from the change in the differential optical depth of photons due to 
the Thomson scattering, f — XerieCaT, which enters in the collision term of 
the Boltzmann equation describing the evolution of the photon distribution 
function and where Xq is the ionization fraction (i.e. the number density 
of free electrons with respect to their total number density rie). The first 
dependence of the optical depth on the fine structure constant arises from 
the Thomson scattering cross-section given by 






Stt 2 

3 



(31) 



and the scattering by free protons can be neglected since m^lm^ ~ 5 x 10 
The second, and more subtle dependence, comes from the ionization fraction. 

Avelino ei al. [40] claim that BOOMERanG and MAXIMA data favor a value 
of smaller by a few percents in the past and Battye et ai [42] showed 
that the fit to current CMB data are improved by allowing ^ 0 and 

pointed out that the evidence of a variation of the fine structure constant can 
be thought of as favoring a delayed recombination model. Avelino et ai [41] 
then performed a joint analysis of nucleosynthesis and CMB data and did 
not find any evidence for a variation of at one-sigma level at either 
epoch. They consider the baryon fraction and the fine structure constant as 
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independent and the marginalization over one of the two parameters lead to 

-0.09 < < 0.02 (32) 

at 68% confidence level. Landau et al. [43] concluded from the study of 
BOOMERanG, MAXIMA and CORE data in spatially flat models with adi- 
abatic primordial fluctuations that, at 2a level, 

-0.14< < 0.03. (33) 

All these works assume that only is varying but one has to assume the 
constancy of the electron mass. The variation of the gravitational constant 
can also have similar effects on the CMB [44]. In conclusion, strong con- 
straints on the variation of can be obtained from the CMB only if the 
cosmological parameters are independently known. 

The amount of ^He produced during the big bang nucleosynthesis is mainly 
determined by the neutron to proton ratio at the freeze-out of the weak 
interactions that interconvert neutrons and protons. The result of Big Bang 
nucleosynthesis (BBN) thus depends on G, the weak and strong coupling 
constants respectively through the expansion rate, the neutron to proton 
ratio, the neutron-proton mass difference and the nuclear reaction rates, 
besides the standard parameters such as e.g. the number of neutrino families. 
In more details, the abundance of “^He by mass, Yp, is well estimated by 

(«/ri,exp(-t„/T ) 

1 + (n/p)f exp(-iN/rn) 

{n/p)f = ex.p(—Q/kT{) is the neutron to proton ratio at the freeze-out time, 
Tf, and with Q = rrin — rrip. The time is the time after which the photon 
density becomes low enough for the photo-dissociation to be negligible; it 
is roughly given by oc = 1.636Gp(l + 3g\)m^/ {2 tt^) is 

the neutron lifetime, with g a — 1.26 being the axial/vector coupling of the 
nucleon. Assuming that the deuteron binding energy is proportional to the 
square of the strong interaction structure constant, Ed a this gives a 
dependence t^/r^ oc Gp being the Fermi constant. 

The light element abundances are thus sensible to the freeze-out tempera- 
ture, which depends on the Fermi constant, G, on the proton- neutron mass 
difference Q, and on the values of the binding energies Ba so that they mainly 
depend on the four coupling constants and the mass of the quarks. An in- 
crease in G or in the number of ultra-relativistic particles results in a higher 
expansion rate and thus to an earlier freeze-out, i.e. a higher Tf. A decrease 
of the Fermi constant, corresponding to a longer neutron lifetime, leads to a 
decrease of the weak interaction rates and also results in a higher Tf. This 
shows that primordial nucleosynthesis involves many constants which makes 
the interpretation of the results more difficult. 
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Bergstrom et al [45] studied the dependence of the thermonuclear rates on 
. Keeping all other constants fixed, assuming no exotic effects and taking 
a lifetime of 886.7 s for the neutron, it was deduced that 

<2x10-2. (35) 



• Universality of free fall 

As explained in Section 3.2, one expects to observe a violation of free fall. 
The most accurate constraints on 7712 are 7712 = (—1.9 ±2.5) x 10“^^ between 
beryllium and copper [46] and I 7712 I < 5.5 x 10“^^ between Earth-core-like 
and Moon-mantle-like materials [47]. The Lunar Laser Ranging (LLR) ex- 
periment gives the bound 7712 = (3.6 ± 4) x 10“^^ [48]. 

The LLR constraint, |aEarth — ^Moon| < cm.s“^, implies that on the 

size of the Earth orbit 

IVlna^j^l < 10“^^ — 10“^^cm“^ (36) 

Extending this bound to the Hubble size leads to the estimate < 

— 10“^. This indicates that if the claim by Webb et al. [39] is correct then 
it should induce a detectable violation of the equivalence principle by coming 
experiments such as MICROSCOPE^ and STEP^. They will respectively 
have an accuracy of the level 77 ~ 10 “^^ and 77 10 “^^. 

3.4 Other constants 

The variation of other constants have also been considered. This is the case of 
the gravitational constant, the electron to proton mass ratio, the weak and strong 
interaction structure constants and the proton gyromagnetic factor. A description 
of the observations used in these cases can be found in Ref. [19], as well as an up 
to date list of the observational constraints. Let us just discuss the latest results 
concerning the electron to proton mass ratio, /i. 

More recently, Ivanchik et al. [49] measured, with the VLT, the vibro- 
rotational lines of molecular hydrogen for two quasars with damped Lyman- a 
systems respectively at 2 : = 2.3377 and 3.0249 and also argued for the detection 
of a time variation of fi. Their most conservative result is (the observational data 
were compared to two experimental data sets) 

= (-5.7 ± 3.8) X 10"^ (37) 

at 1.5(J and the authors cautiously point out that additional measurements are 
necessary to ascertain this conclusion. 



■^http : //sci2 . esa. int/Microscope/ 
^http : //einstein. stanford . edu/STEP/ 
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4 What should we conclude? 

The constraints concerning the fine structure constant are summarized on Fig. 4. 
The non-zero detections by Webb et al. [39] draw the questions of their compati- 
bility with the bounds obtained from other physical systems but also, on a more 
theoretical aspect, of the understanding of such a late time variation which does 
not seem to be natural from a field theory point of view. 

Theoretically, one expects all constants to vary (e.g. in GUT, Kaluza-Klein 
and string inspired models) and the levels of their variation are correlated. Better 
analysis of the degeneracies are really needed before drawing definitive conclusions 
but such analysis are also dependent in the progresses in our understanding of the 
fundamental interactions and particularly of the QCD theory and on the generation 
of the fermion masses. 

In a given theoretical framework, one can deduce the relation between the 
variation of different constants as well as their time behavior. Concerning the 
time variation, the result by Webb [39] is not compatible with the Oklo or Re/Os 
results if the variation is linear with time (see Fig 3). The recent bound by Olive et 
al [34] at 2 ^ 0.45 emphasizes the difficulty to achieve such a late time deviation 
and shows that the stabilization of the fine structure constant, if it has varied, had 
to occur very quickly. Concerning the compatibility with other measurements, it 
can be concluded that, in a GUT framework, the result by Webb [39] is neither 
compatible with the constraints on the variation of /.i and from quasar 

spectra nor with the bound by Murphy et al [37] using the Si IV doublet. Both 
results (on 0 ;^^^ and /i) arise from the observation of quasar absorption spectra; 
it is of importance to ensure that all systematics are taken into account and are 
confirmed by independent teams, using e.g. the VLT which offers a better signal 
to noise and spectral resolution. 

The developments of high energy physics theories such as multi-dimensional 
and string theories provide new motivations to consider the time variation of the 
fundamental constants (see Ref. [19] for a review of the theoretical motivations). 
The observation of the variability of these constants constitutes one of the very 
few hope to test directly the existence of extra-dimensions. In the long run, it may 
help to discriminate between different effective potentials for the dilaton and/or 
the dynamics of the internal space. But indeed, independently of these motiva- 
tions, the understanding of the value of the fundamental constants of nature and 
the discussion of their status of constant remains a central question of physics in 
general: questioning the free parameters of a theory accounts to question the the- 
ory itself. The step from the standard model-hgeneral relativity to string theory 
allows for dynamical constants and thus starts to address the question of why the 
constants have the value they have. Unfortunately, no complete and satisfactory 
stabilization mechanism is known yet and we have to understand why, if confirmed, 
the constants are still varying. Remind that the dilaton has to become massive or 
to decouple for low-energy effective string theory to be compatible with the test 
of general relativity. 
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Fractional look-back time 

Figure 3: Data points concerning the value of the fine structure constant inferred 
from the observations of quasar spectra by Webb et al [39]. The best fit of the 
data of figure, assuming a constant fine structure constant (solid line), does not 
seem to favor today’s value of the fine structure constant (dotted line). This could 
indicate an unknown systematic effect. Besides, if the variation of is linear 
(dash line) then these observations is incompatible with the Oklo results. From 
Murphy et al [36]. 

Let us also emphasize an important issue. If the fine structure constant has 
varied in a close past then, in an effective 4-dimensional theory, the only consistent 
approach to make a Lagrangian parameter time dependent is to consider it as a 
field, (j) say. The Klein-Gordon equation for this field (0 4- -h m?(j) + , . . = 0) 
implies that 4> is damped as (/) oc if its mass is much smaller than the Hubble 
scale and that it oscillates if its mass is much larger than the Hubble scale. Thus, 
in order to be varying during the last Hubble time but not drastically, </) has to be 
very light with typical mass m ~ Hq ~ 10“^^ eV. As in the case of quintessence, 
this induces difficulties to understand how such a light mass is stable against 
radiative corrections. 

There are a lot of theoretical motivations to point toward a deviation from 
Einstein gravity on large scales (such as the cosmological constant problem or 
string phenomenology). I have presented two independent ways of testing this 
ideas, the test of the inverse square law on astrophysical scales using weak grav- 
itational lensing and the test of Einstein equivalence principle using the study of 
the variation of the constants. This may open a window on the law of gravity 
and it offers a new links between astrophysics, cosmology and high-energy physics 
complementary to primordial cosmology. 
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Figure 4: Summary of the different constraints described in the text. From Murphy 
et al [36]. 

Acknowledgments 

I thank the organizers of TH-2002 for their invitation to present these results. 
Some of the results presented here have been obtained in collaboration with FYancis 
Bernardeau. I thank him for our ongoing and fruitful collaboration. 

References 

[1] P.J.E. Peebles and B. Ratra, Rev. Mod. Phys. (to appear), 

[arXiv : astro-ph/0207347]. 

[2] T. Buchert and M. Carfora, [arXiv:gr-qc/02 10045]. 

[3] S. Weinberg, Rev. Mod. Phys. 61 , 1 (1989). 

[4] J.C. Long J.C., et al, [arXiv:hep-ph/0210004]. 

[5] R. Gregory, V.A. Rubakov, and S.M. Sibiryakov, Phys. Rev. Lett. 84, 4690 

(2000). 

[6] 1. 1. Kogan, et ai, Nucl Phys. B 584, 313 (2000). 

[7] G. Dvali, G. Gabadadze, and M. Porati, Phys. Lett. B 485, 208 (2000). 





Vol. 4, 2003 Tests of Gravity on Astrophysical Scales 



S367 



[8] B. Carter, J.-P. Uzan, R. Battye, and A. Mennim, Class. Quant. Gray. 18, 
4871 (2001); J.-P. Uzan, Int. J. Mod. Phys. A 17, 2739 (2002); J.-P. Uzan, 
Int. J. Theor. Phys. (2002) to appear. 

[9] C. DefFayet, Phys. Lett. R502, 1999 (2001); C. DefFayet et al., Phys. Rev. D 
65, 044023 (2002). 

[10] T. Damour, LI. Kogan, and A. Papazoglou, [arXiv:hep-th/0206044]. 

[11] M. Milgrom, [arXiv:astro-ph/020723l]. 

[12] N. Arkani-Hamed et ai, [arXiv:hep-th/0209227]. 

[13] T.R. Taylor, and G. Veneziano, Phys. Lett. B 213, 450 (1988); E. Witten, 
Phys. Lett. B 149, 351 (1984). 

[14] T. Damour and A.M. Polyakov, Nuc. Phys. B 423, 532 (1994); T. Damour 
and A.M. Polyakov, Gen. Rel. Gray. 26, 1171 (1994). 

[15] A. Aguirre, C.P. Burgess, A. Friedland and D. Nolte, Class. Quant. Gray. 
18, R223 (2001). 

[16] R.H. Sanders, Month. Not. R. Acad. Soc. 223, 539 (1986). 

[17] S.W. Allen, S. Ettori, and A.C. Fabian, [arXiv:astro-ph/0008517]. 

[18] J.-P. Uzan and F. Bernardeau, Phys. Rev. D 64, 083004 (2001). 

[19] J.-P. Uzan, Rev. Mod. Phys. (to appear), [arXiv:hep-ph/0205340]. 

[20] E. Fischbach and C. Tamadge, The search for non Newtonian Gravity (AIP 
Springer-Verlag, New York, 1999); Y.T. Chen and A. Cook, Gravitational 
experiments in the laboratory (Cambridge University Press, New York, 1993). 

[21] L. Van Waerbeke et al, Astron. Astrophys. 358, 30 (2000); N. Kaiser, G. Wil- 
son, and G.A. Luppino, [arXiv:astro-ph/0003338]; D. Bacon, A. ReFregier 
and A. Ellis, Month. Not. R. Astron. Soc. 318, 625 (2000); D. Wittman D. 
et ai, Nature 405, 143 (2000). 

[22] Y. Mellier, Annu. Rev. Astron. Astrophys. 37, 127 (1999). 

[23] M. Bartelmann and P. Schneider, [arXiv: astro-ph/99 12508] and reFs. 
therein. 

[24] M. Vogeley, [arXiv:astro-ph/9805160]. 

[25] V. Narayanan, A. Berlind, and D. Weinberg, Astrophys. J. 528, 1 (2000). 

[26] C.M. Will, Theory and experiment in gravitational physics, (Gambridge Uni- 
versity Press, 1993); C.M. Will, Living Rev. Relativity 4, 4 (2001). 




S368 



Jean-Philippe Uzan Ann. Henri Poincaré 



[27] P.A.M. Dirac, Nature (London) 139, 323 (1937); Proc. Roy. Soc. London A 
165, 198 (1938); Proc. Roy. Soc. London A 338, 439 (1974). 

[28] R.H. Dicke, Nature (London) 192, 440 (1961); B. Carter, in Confrontation of 
cosmological theories with observational data, I AU Symposia 63, edited by 
M. Longair (Reidel, Dordrecht) 291 (1974). 

[29] P. Jordan, Naturwiss. 25, 513 (1937); P. Jordan, Z. Physik 113, 660 (1939). 

[30] R.H. Dicke, in Relativity, Groups and Topology, Lectures delivered at Les 
Houches 1963, edited by C. DeWitt and B. De Witt, (Gordon and Breach, 
New York, 1964); R.H. Dicke, Gravitation and the universe, American Philo- 
sophical Society (Philadelphia, 1969); M.P. Haugan, Ann. Phys. (NY) 118, 
156 (1979); K. Nordtvedt, Phys. Rev. Lett 65, 953 (1990). 

[31] A.I. Shlyakhter, Nature (London) 264, 340 (1976). 

[32] T. Damour and F.J. Dyson, Nuc. Phys. H 480, 37 (1996). 

[33] P.J. Peebles and R.H. Dicke, Phys. Rev. 128 2006 (1962). 

[34] K. Olive, M. Pospelov, Y.-Z. Qian, A. Coc, M. Cassé, and E. Vangioni-Flam, 
2002, preprint hep-ph/0205269. 

[35] Y. Sortais et al., Physica Scripta T95, 50 (2001). 

[36] M.T. Murphy, J.K. Webb, V.V. Flambaum, and S.J. Curran, Proceedings of 
the XXII Physics in Collision Conference (PIC02), Stanford, USA, 11 June 
2002. 

[37] M.T. Murphy, J.K. Webb, V.V. Flambaum, C.W. Churchill, J.X. Prochaska, 
Month. Not. R. Astron. Soc. 327, 1223 (2001). 

[38] J.K. Webb, V.V. Flambaum, C.W. Churchill, M.J. Drinkwater, J.D. Barrow, 
Phys. Rev. Lett. 82, 884 (1999). 

[39] J.K. Webb, M.T. Murphy, V.V. Flambaum, V.A. Dzuba, J.D. Barrow, C.W. 
Churchill, J.X. Pochaska, and A.M. Wolfe, Phys. Rev. Lett. 87, 091301 
(2001). 

[40] P.P. Avelino, C.J. Martins, G. Rocha, and P. Viana, Phys. Rev. D62, 123508 

(2000). 

[41] P.P. Avelino, S. Esposito, G. Mangano, C.J. Martins, A. Melchiorri, G. Miele, 
O. Pisanti, G. Rocha, and T.P. Viana, Phys. Rev. D 64, 103505 (2001). 

[42] R.A. Battye, R. Crittenden, and J. Weller, Phys. Rev. D63, 0453505 (2001). 

[43] S. Landau, D.D. Harari and M. Zaldarriaga, Phys. Rev. D63, 083505 (2001). 




Vol. 4, 2003 Tests of Gravity on Astrophysical Scales 



S369 



[44] A. Riazuelo and J.-P. Uzan, Phys. Rev. D 66, 023525 (2002). 

[45] L. Bergstrom, S. Iguri, and H. Rubinstein, Phys. Rev. D 60 , 045005 (1999). 

[46] Y. Su, B.R. Heckel, E.G. Adelberger, J.H. Gundlach, M. Harris, G.L. Smith 
and H.E. Swanson, Phys. Rev. D 50 , 3614 (1994). 

[47] S. Baessler, B.R. Heckel, E.G. Adelberger, J.H. Gundlach, U. Schmidt and 
H.E. Swanson, Phys. Rev. Lett. 83 , 3585 (1999). 

[48] J. Müller, M. Schneider, K. Nordtvedt, and D. Vokrouhlicky, in Proceedings 
of the 8th Marcel Grossman Meeting on General Relativity, Jerusalem, 1997 
(World Scientific, Singapore, 1999). 

[49] A.V. Ivanchik, E. Rodriguez, P. Petitjean, and D. Varshalovich, 

[arXiv: astro-ph/0112323]. 



Jean-Philippe Uzan 

Institut d ’Astrophysique de Paris 

GReCO, ERE 2435-CNRS 

98bis boulevard Arago 

75014 Paris cedex 

Prance 

and 

Laboratoire de Physique Théorique 

CNRS-UMR 8627 

Université Paris Sud 

Bâtiment 210 

F-91405 Orsay cedex 

France 




I Annales Henri Poincaré 



Ann. Henri Poincaré 4, Suppl. 1 (2003) S371 - S384 
© Birkhâuser Verlag, Basel, 2003 
1424-0637/03/01S371-14 
DOI 10.1007/s00023-003-0928-8 



Conformai Invariance in Percolation, 

Self- Avoiding Walks, and Related Problems 

John Cardy 



Abstract. Over the years, problems like percolation and self-avoiding walks have 
provided important testing grounds for our understanding of the nature of the 
critical state. I describe some very recent ideas, as well as some older ones, which 
cast light both on these problems themselves and on the quantum field theories to 
which they correspond. These ideas come from conformal field theory. Coulomb gas 
mappings, and stochastic Loewner evolution. 

This talk is about ‘geometric’ critical phenomena. These are random spatial pro- 
cesses, where either (1) the probability distribution is determined by equilibrium 
statistical mechanics, and we ask questions about geometrical properties, or (2) 
the probability distribution is itself geometrical in nature. The simplest example 
of (1) is clustering in percolation (see Fig. 1), in which the probability distribution 
is trivial (in this case sites of a triangular lattice are independently coloured black 
or white with equal probability,) but we ask questions like whether there exists a 
path on (say) the black sites connecting opposite edges of a large rectangle. The 
paradigm example of (2) is the ensemble self-avoiding walks (SAWs) of a fixed 
(large) length, all weighted equally (see Fig. 2). In this case one might ask, for 
example, questions about the distribution of the distance between the ends. Other 
examples abound: for example the clusters formed by the spins in a critical Ising 
model, or the boundaries of the Fortuin-Ksisteleyn clusters in the Potts model. 

Note that although percolation is relevant to disordered media and SAWs 
relevant but to polymer physics, the emphasis in this talk is on understanding the 
nature of their fractal geometry. The outline is as follows: first, we shall discuss the 
various expectations for ‘geometric’ critical behaviour from conventional critical 
behaviour, largely based on well-known mappings between these types of problem. 
Since, in two dimensions, critical behaviour is described by conformal field theory, 
we may deduce all sorts of (non-rigorous) results, and we shall describe some of 
these. A related set of non-rigorous techniques is based on a mapping of cluster 
boundaries to a height models and then to a Coulomb gas, and we shall mention 
these. 

But the main point of my talk will be to bring to the attention of theoretical 
physicists relatively new ideas which, it so happens, were developed by mathemati- 
cians, based on the direct construction of continuum limit of cluster boundaries, 
and known as SLE. I will describe how these provide rigorous and new results 
for percolation, as an example. I finish with some (slightly) provocative conclu- 
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Figure 1: Critical site percolation on the triangular lattice. Each hexagon is inde- 
pendently coloured black or white with probability Is there a path on neigh- 
bouring black hexagons which connects the left and right sides of the rectangle? 



sions about the usefulness of rigorous methods and of the traditional approach to 
quantum field theory, in the study of critical phenomena in general. 

‘Geometric’ vs. ‘conventional’ critical behaviour 

Much of our intuition about geometric critical behaviour is based on two well- 
known mappings: 

1. (Fortuin-Kasteleyn [1]) Q-state Potts model 

Random Cluster Model: the Potts model is a generalisation of the Ising 
model in which ‘spins’ at the sites of a lattice can each take one of Q values. 
Initially, Q must be an integer > 2, but the partition function may also be 
written 

/^jclustersK 

— /percolation 

where the clusters are weighted as in percolation, with the parameter p being 
simply related to the temperature in the Potts model. Evidently, the limit 
Q ^ 1 reproduces percolation. Although in this limit the partition function 
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Figure 2: A typical self- avoiding walk. In this case the lattice is too fine to be 
visible. 



(with free boundary conditions) is trivially equal to 1, the correlations are 
nontrivial. 

2. (de Gennes [2]) 0(n) model Self- Avoiding Loops: This model similarly 
generalises the Ising model to n-component spins and a hamiltonian which 
is invariant under 0{n) rotations. The partition function may be expressed 
as that of a gas of non- intersection loops: 

Z = {n"“°P=l>,oop gas 

Evidently the case n = 1 corresponds to the Ising model, while the limit 
n — > 0 leaves a single self- avoiding loop. In fact the 0(n) model has two 
types of critical behaviour for n < 2: one corresponding to the dilute phase, 
and a dense critical phase. In the dense phase, the loops are in the same 
universality class as the hulls of F-K clusters with Q = n^. 

From these correspondences, we can build a dictionary which relates geometrical 
properties to more conventional thermodynamic quantities. For example 

Cluster size 4^ susceptibility oc {p ~ 

Radius correlation length ^ mass^^^^ 
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Critical Behaviour & Euclidean Field Theory 

It has been realised since the late 1960s that the scaling limit of an isotropic system 
near a continuous phase transition is a euclidean quantum field theory. If we take a 
near-critical lattice model, such that the correlation length f the lattice spacing 
a, the following limit exists: 

{(j){ri) . . . <f>{rjv))QFT = lim . . . 5(rAr)) lattice 

a^O, 4 fixed 

where (f){r) is a local quantum field, and S{r) is the corresponding lattice quantity. 
The non-trivial power is the scaling dimension of 0. This correspondence is 
rooted in an emphasis on correlation functions of local (or quasi-local) operators 
and their algebra encoded in the operator product expansion (and is therefore 
not always the best tool to investigate other quantities.) It has been proved in 
very few examples, but if assumed it has many powerful consequences: the Renor- 
malisation Group, universality, and, in particular scaling-.^, the property that the 
critical exponents describing off-critical behaviour of thermodynamic quantities 
are simply related to those describing decay of correlation functions at the critical 
point. This means that, for many purposes, we may restrict ourselves to studying 
the behaviour at the critical point. This means that the corresponding quantum 
field theory is massless: a conformal field theory. 

Conformal Field Theory 

Conformal field theory provides a very powerful tool to study critical behaviour, 
especially in two dimensions: 

• in local classical field theories, scale invariance implies that the trace of the 
stress tensor T^, vanishes, and this by itself implies conformal invariance. 

• CFT assumes this holds (up to conformal anomaly c) in the full theory 
including fluctuation effects. 

• In CFT, unlike normal QFT, there is a 1-1 correspondence between local 
operators and states of the Hilbert space, and the spectrum of these is usually 
discrete rather than continuous. 

• For d = 2, these transform according to irreducible representations of an 
infinite-dimensional Virasoro algebra. Classifying these essentially classifies 
all possible CFTs, that is, all universality classes. 

• There is, however, a problem: given some critical lattice model, to which 
CFT does it correspond? 

• this was answered in part in the work of Friedan, Qiu and Shenker [3] , who 
showed that in theories having reflection positivity (eg. Q — 2,3,4 Potts 
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Figure 3 : Schematic represensation of a crossing cluster which connects the arcs 
C1C2 and C3C4. Of course in reality this is a fractal object with a fractal boundary. 



models, or n = 1, 2 in the 0{n) model), we should look for unitary represen- 
tations, and at least when c < 1 this leads to a discrete series of possibilities. 
Moreover, the necessary decoupling of null states in these theories leads 
to linear differential equations for correlation functions (Belavin, Polyakov, 
Zamolodchikov [ 4 ].) 

But percolation, SAWs, and related models are not unitary: in fact they have 
partition function Z — 1 {c = 0 ) even though their correlations are nontrivial. In 
fact non-unitary c = 0 CFTs are very poorly understood. Nevertheless they are 
important not just for percolation and SAWs, but for all critical problems with 
quenched disorder (e.g. the quantum Hall plateau transition, a major unsolved 
problem.) 

The Crossing Formula in Percolation 

A recurring theme in this talk will be the following problem (see Fig. 3 ): 

• Given a simply connected region D of the plane, with suitably smooth 
boundary dD with 4 marked points Cj, what is the probability of a spanning 
cluster connecting C1C2 with C3C4 (in the limit lattice spacing -^0)? 

The following conjecture was made (Cardy [ 5 ]), based on ideas of CFT and the 
mapping to the Q ^ \ limit of the Potts model; the Riemann mapping theorem 
allows us to conformally map interior of D into the unit disc, with the marked 
points Cj Zj . Then the crossing probability depends only the anharmonic ratio 
V = ^12234/213224 and is 

^( 3 ) h p (I 2 4 \ 



The argument depends on: 
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Figure 4: The crossing probability is a difference of Potts model partition functions, 
in which the Potts spins are fixed into the same (a), or different (a and 6), states 
along the two arcs C 1 C 2 and C 3 C 4 . 



• assuming that the scaling continuum limit exists and is given by a CFT with 
c = 0; 

• realising that crossing probability is related to a difference of partition func- 
tions of the Q ^ 1 Potts model, with different boundary conditions (Fig. 4); 

• realising that states of the CFT induced by changes in the boundary con- 
dition (‘boundary condition changing operators’) also should correspond to 
Virasoro representations; 

• guessing the right representation, and hence deducing the appropriate dif- 
ferential equation. 

The formula has been numerically verified to high precision, but it is hard to 
see how to make arguments rigorous, or to go beyond them. 

Cluster Boundary Approach 

Instead of thinking about clusters, in the Potts language, it is sometimes easier 
to think about cluster boundaries, or hulls, using the 0(n) language. There are 
really two different approaches here: the older Coulomb gas arguments, and the 
more recent ideas of SLE. 

‘Coulomb gas’ method 

(den Nijs [6], Nienhuis [7], Duplantier & Saleur [8], Kondev [9], . . .) The elements 
of these arguments are: 



• thinking of cluster boundaries in random cluster model (or loops in 0(n) 
model) as a gas of (unoriented) closed loops. 
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Figure 5: Oriented loops and the mapping to configurations of the height model. 



• randomly orientate each loop: each configuration of oriented loop than maps 
onto one of a height model, with degrees of freedom h{r) C integers, on the 
dual lattice (see Fig. 5): 

• the factors of Q (resp, n) can be associated with local (but in general com- 
plex!) Boltzmann weights; 

• assume in the continuum limit that h{r) takes values in the real numbers, 
and that the measure converges to a gaussian exp(— J (dh)‘^d‘^)] 

• this gives a c = 1 CFT, with, however, various bells and whistles like a charge 
at oo, screening charges, etc., which make it nontrivial; 

• within this formulation critical exponents are calculable, eg ^^saw = f and 

^perc — 3 1 

• but correlation functions are ambiguous, and, more seriously, it has proven 
very hard to make this approach rigorous. 

• nevertheless new results are still emerging from this method: eg distribution 
of internal areas of loops: density n{A) of large loops with area > A 

n{A) Cj A C universal 



where [10] 



C'perc = 1/8V37T 



0.0229720 predicted 
0.022972(1) measured 
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Figure 6: The process in which a random walker lays down the randomness at it 
progresses. 



Dynamical description of cluster boundaries (SLE) 

Although we usually think of percolation in terms of first laying down a particu- 
lar random configuration of black or white sites, and then identifying the clusters 
and the boundaries of the clusters, it is statistically equivalent to construct these 
boundaries as certain random walks, where the walker lays down the random con- 
figuration as it goes. More specifically, this ‘exploration’ process involves random 
walker laying down the configuration as it moves: black sites to the left, white to 
the right (see Fig. 6). Notice that the path automatically reflects from itself, and 
also from the boundary if we choose the correct boundary conditions (black sites 
to the left of the starting point, white to the right.) 

How should one characterise the continuum limit of these paths (assuming 
this exists)? In what sense is it conformally invariant? These were the questions 
addressed by Schramm [11] and Lawler, Schramm and Werner [12], and the answer 
is: 

Stochastic Loewner Evolution(SLE) 

For definiteness, consider the half-plane with black sites along x < 0 and white 
sites along x > 0. The walker starts at the origin, as shown in Fig. 7: 

Rather than trying to write an equation for the path, consider the conformal 
mapping 2 : ^ g{z\t) which sends {region of the half-plane which has not been 
excluded by the path} ^ {upper half-plane}. Then, instead of the dynamics of 
the path, we may think about dynamics on conformal mappings. In particular. 
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Figure 7: Schematic representation of SLE. At any given time, the path, and the 
region between it and the real axis, form an excluded part of the upper half plane. 
The complement of this is conformally mapped by the function gt{z) into the whole 
upper half plane. This function obeys the stochastic Loewner equation. 

Schramm [11] conjectured that the continuum limit of the percolation exploration 
process corresponds to the Loewner equation 

dg{z-,t) ^ 2 

dt g{z\t) — a{t) 

This has several important properties: 

• if a{t) is a real continuous function, the excluded region of the half- plane 
grows with increasing t\ 

• from requirements of scaling and locality, we conclude that a{t) must be 

Brownian motion, ie à = C^{t) with C,{t)C,{t') = — (Rohde & Schramm 

[13]) 

If 0 < K < 4 path is simple 

4 < Av < 8 it touches itself 

8 < K it fills space 

• the fractal dimension of the path is = 1 -f ac/8 (for k, < 8). 

• only for k = 6 does the SLE path not ‘feel’ where the boundary of the 
domain is as long as it does not hit it, as one expects for percolation with 
uncorrelated site probabilities. This led to the conjecture: 

SLEe is the conformally invariant continumn limit 
of percolation cluster boimdaries 
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• one can actually compute with SLE: it involves arguments quite familiar to 
theoretical physicists working in a different area, namely stochastic processes. 
In this way, one obtains all previously conjectured critical exponents at Pc 
(and with the help of rigorous scaling relations (Kesten [14]), exponents away 
from Pc), including multifractal irrational but algebraic exponents (related 
to 2d quantum gravity (Duplantier [15]), and some new ones, eg the 

• backbone exponent (Lawler, Schramm, Werner [16]), the fractal dimension of 
the part of the infinite cluster which would carry electric current. This is 
given by the lowest eigenvalue of a 2d Dirichlet problem (and is probably 
not a rational or even an algebraic number, which makes its derivation by 
CFT or Coulomb gas methods a real challenge.) 

• once SLE is assumed to describe the limit of percolation hulls, one easily 
gets and the crossing formula. As illustrated in Fig. 7, if we map the region 
into the upper half plane so that the arcs C1C2 and C'^C^ map to (—00, a) 
and (0, b) respectively, then 

Pr(white crossing (—00, a) (0,5) 

= Pr[a gets excluded before h) 

• this gives the same 2F1 formula as conjectured from CFT. 

However, all this is so far dependent on Schramm’s conjecture. 

The missing link: Smirnov’s proof of the crossing formula 

Smirnov [17] proved that the crossing formula holds for the continuum limit of site 
percolation on a triangular lattice, and thereby that SLEg is the continuum limit 
of percolation cluster boundaries. Therefore all the results derived from SLEg are 
rigorous. 

• First, it was observed by Carleson that the crossing formula is simple in an 
equilateral triangle (see Fig. 8). 

• Note that the formula as proposed is the boundary value of an analytic func- 
tion P{z): so what is its interpretation for 2 not on the boundary? Smirnov 
proposed studying P{z) = Pr{z separated from C1C4 by at least one cluster 
spanning from C1C2 to C2C4). Then: 

• on the triangular lattice, P{z) satisfies linear relations, which mean that its 
continuum limit exists, and in fact just is the real part of a harmonic func- 
tion, ie, a solution of Laplace’s equation. It is the rather unusual boundary 
conditions which then determine that P{z) oc distance from C1C4, which 
leads directly to the crossing formula when 2 is taken to lie on the boundary. 
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Figure 8: Carleson’s version of the crossing formula in an equilateral triangle, and 
Smirnov’s generalisation: what is the probability that there is a cluster which 
connects AB with BC^ at the same time disconnecting the point 2 from ACl 



Other values of k 

It turns out that other values of ac in SLE correspond to different values of n in 
the 0(n) model, for example 

• Self- Avoiding Walks: if we use the principle that uniform measure on set of 
simple paths must remain uniform when restricted to a subset, and assume 
the continuum limit is SLE,^, then we are led to /c = This reproduces all the 
conjectured results for SAWs (and more) (Lawler, Schramm, Werner [18]). 

• the critical Ising model is conjectured to be described by SLE3. This leads 
to all the standard results and some new ones: eg, if we take an Ising model 
inside a simply connected region of the plane, with boundary conditions 
that the spins are up on one segment of the boundary, and down on its 
complement (so that there exists a domain wall crossing the region, see 
Fig. 9), what is the probability that the domain passes above a given point 
zl The result is conformally invariant, and an explicit formula is provided 
by SLE (Schramm [19]). Of course these results still need the analogue of 
Smirnov’s proof to make them completely rigorous, but they appear to be 
completely beyond the reach of traditional approaches to the Ising model, 
which focus on spin correlation functions. 

Final Remarks 

I shall finish with a couple of possibly provocative remarks. The first concerns 
progress in rigorous results for critical behaviour. The first such nontrivial results 
were of course obtained for the Ising model in 1944 by Onsager, followed by the 
work of Yang, and this gave rise to a whole industry of computations of correlation 
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Figure 9: The Ising model in a domain with + boundary conditions on one segment 
of the boundary, and — boundary conditions on the complement: what is the 
probability that the consequent domain wall passes above the point zl 



functions and other quantities in this and variant models. The next step was 
perhaps Lieb’s 1967 solution of 6- vertex type models, then Baxter’s solution of the 
8- vertex model in 1971. The methods used by Baxter and others (Bethe ansatz, 
commuting transfer matrices, corner transfer matrices) are powerful and elegant, 
and they produce ‘exact’ results, but they are not, in general, fully rigorous. 

Now we finally have, thanks to the work of Kesten, Smirnov, Lawler, 
Schramm, Werner, and others rigorous results for a different sort of two- 
dimensional critical behaviour, namely percolation. The methods used are com- 
pletely different, relying on identifying the continuum limit explicitly, and per- 
forming computations directly in that limit, rather than taking the limit of lattice 
results. As a result, only universal quantities are calculated. The methods focus 
on the geometrical aspects of the measure, rather than the correlation functions 
of local operators. It is quite likely that in the next few years we shall see similar 
rigorous results for a whole set of such models, at least in two dimensions. 

Of course, particularly at a meeting on Theoretical Physics, one might ques- 
tion the necessity of mathematical rigour. Are we not only interested in the an- 
swers, and in the physical picture which they convey? Of course this is true, but the 
historical fact remains that theoretical physicists in this subject have been perhaps 
too happy for many years with their heuristic and incomplete arguments, which 
had once been very fruitful but perhaps had reached a dead end. It took mathe- 
maticians to question the true content of these arguments, to find them wanting, 
and then to develop a completely independent approach, that of SLE. The result 
is that we now are beginning to have a new physical way of understanding the ori- 
gins of conformal invariance and of the emergence of non- trivial critical exponents, 
quite separate from that of the renormalisation group. The methods of SLE could 
have been developed by physicists - after all they are based on Brownian motion 
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- but the fact is that they were not. This affords a strong example for continuing 
to make theoretical physics as mathematically rigorous as possible. 

The second point concerns progress in quantum field theory, especially from 
the point of view of critical behaviour. Historically, the development of the renor- 
malisation group point of view in 1969 by Wilson and others gave answers to many 
outstanding questions at the time. It gave a non-rigorous framework in which to 
understand many important features of critical phenomena, such as universality 
and scaling, and to do approximate computations. Prom 1984 the methods of con- 
formal field theory, originally developed for string theory, were brought in, and 
they provided a plethora of exact but non-rigorous results in two dimensions. But 
even today field theory is still tied to particle physics ideas. We see that both 
in the way it is taught and in the continuing focus on the objects of interest to 
particle theory: namely correlation functions of local fields. While this has been 
successful in the past, it is ill-suited to study other objects (eg the crossing for- 
mula, which required the introduction of ‘boundary-condition changing’ operators 
to be expressed in the old language.) One might ask whether QFT has outlived 
its usefulness to generate new results, or whether (rather like string theory) it can 
reinvent itself in the 21st century, in a different and more powerful form ~ perhaps 



QFT as fractal geometry? 



Note added. Very recently Bauer and Bernard [20] have, among other things, ex- 
posed the relationship between Virasoro null vectors and SLE, thus establishing 
the connection with the CFT approach to the crossing formula. 

This work was supported in part by EPSRC Grant GR/J78327. I thank 
R. Ziff for help in producing Fig. 1, and T. Kennedy for permission to reproduce 
Fig. 2. 
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Abstract. First-passage problems are related to such questions as the statistics of 
first-crossing times (across a boundary, or set of boundaries) of a stochastic process 
x(t), e.g. a random walker. In the ’’spatially extended” generalization, the single 
variable x{i) is replaced by a field /(x, t), with specified dynamics, and one considers 
the first passage of / at a given point x. Such problems are, in general, non-trivial 
even for rather simple processes such as diffusion. This class of problems will be 
introduced through coarsening systems and reaction-diffusion problems. Analytic 
results will be presented for the asymptotics of the one-dimensional two-species 
annihilation process, A -|- — > 0, with different initial densities. The asymptotic 

decay of the density of the minority species maps onto the survival probability, P(t), 
of a diffusing particle moving in an infinite sea of diffusing traps. We show that 
P{t) ~ exp( — Ap[Dt]^/^), where p is the trap density, D the trap diffusion constant 
and A = 4/y^, independent of the particle diffusion constant. The corresponding 
results for general dimensions d <2 will also be presented. 



1 Introduction 

There is a long history of the study of first-passage problems in many different 
areas of science [1, 2]. The simplest example is that of a random walker, starting 
at position x, with an absorbing boundary at x = 0. If the walker’s diffusion 
constant is J9, the probability that it has not yet been absorbed at time t is [1] 
P{t) = erf(|x|/\/4Dt), where erf(x) is the error function. The derivative of this 
function, P\{t) = —dP/dt = exp(-x^/4iAt), gives the probability 

distribution of the first-passage time^ i.e. the time at which the particle first reaches 
the origin. The persistence exponent^ 6^ is defined by the asymptotic large-time 
behavior P{t) ~ giving ^ = 1/2 for the random walker. 

The determination of 6 for more complex processes is in general very difficult. 
The random walker discussed above is described (in continuous time and space) 
by the Langevin equation dxjdt = ry(i), where r]{t) is Gaussian white noise with 
mean zero and correlator {r]{t)ri{t')) = 2D6{t — t'). The next simplest process 
is the random acceleration problem, d^x/dt^ — rj{t). This process is known to 
have 6 = 1/4 [3], though the calculation is much more involved than the random 
walk [4]. The higher derivative processes, d'^xjdt'^ = rf{t)^ have thus far proved 
intractable for n > 2. It seems, however, that the exponent 6 is not a simple 
fraction: numerical computations give [5] ^(3) = 0.22022(3), ^(4) = 0.20958(3) 
and ^(5) = 0.20413(3). The feature of all the processes with n > 1 that makes 
the calculation of 6 difficult is that they are non- Markovian [2]. Of course, these 
processes can be written as Markovian processes in a larger number of variables. 
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e.g. cPx/dt^ = rj{t) can be expressed as dxfdt = v, dv/dt = ? 7 (t), i.e. it is a Markov 
process with two degrees of freedom. The “spatially extended” processes which we 
now discuss involve infinitely many degrees of freedom. We discuss these under 
two related headings: coarsening problems, and reaction-diffusion problems, with 
special attention being paid to the two-species annihilation process A B ^ 0. 

2 Coarsening Processes 

Consider a system which, under cooling, undergoes a phase transition from a dis- 
ordered phase to an ordered phase. The transition can correspond to either an 
order-disorder transition or to phase separation, according to whether the order 
parameter is nonconserved or conserved. The essential point is that there is a 
unique Gibbs state (and the dynamics is ergodic) in the high-temperature phase, 
while in the low temperature phase there is more than one Gibbs state and ergod- 
icity is broken. 

If such a system is cooled rapidly through the phase transition, domains of 
the two (or more) possible ordered phases form and grow (“coarsen”) with time 
[6] , the driving force for the coarsening being the elimination of interfaces (domain 
walls) between the phases so as to reduce the overall (free) energy of the system. 
One can then pose the following persistence (or first-passage) problem: what is the 
fraction of space, at a given time t, which has always been in the same phase? This 
corresponds to the fraction of space which has never been crossed by a domain wall. 
In most cases, this decays as a power-law in time, P(t) ^ where 6 is in general a 
nontrivial number. Note that the order parameter, 0(x, t), of the phase transition 
is a field, i.e. it corresponds to an infinite number of degrees of freedom. Even 
though the dynamics of the entire field can be Markovian, e.g. the system might be 
described (for a nonconserved order parameter) by the time-dependent Ginzburg- 
Landau equation dt(l> = —T{ÔF/Ô(I)), where F[(f)] is a Ginzburg-Landau free-energy 
functional, the effective dynamics of the field at a single point, </>(x, t), is non- 
Markovian [2] . Measurements of 9 have been performed for a number of coarsening 
systems, e.g. Ising models [7], twisted nematic liquid crystals [8], gas diffusion [9], 
soap froths [10], and breath figures [11] (formed by vapor condensation on a cold 
plate). The last three examples show that coarsening phenomena encompass a 
much larger class of systems than phase transitions. 

In general, the exponent ^ is a non-trivial number, even for simple diffusion 
[12] (where one considers regions where the diffusion field has not crossed its 
mean value). One notable exception is the one-dimensional (ID) Potts model, for 
which exact results have been obtained [13]. This will be discussed in the following 
section. 
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3 Reaction-Diffusion Processes 

Domain coarsening in the ID Ising model at temperature T = 0 takes place 
through the elimination of domain walls. In this case the walls are independent 
random walkers which annihilate on contact, so the coarsening can be described 
by the reaction-diffusion process A-^A-^0. The “site” (or spin) persistence prob- 
lem posed above then amounts to calculating the fraction of sites that have not 
not been crossed by an A-particle or, in a continuum description, the fraction of 
the line which has not been traversed by an A-particle. The generalization to the 
q-stdite Potts model is immediate. In this model there are q possible phases. If 
the domain walls are represented by A-particles, and the system coarsens from a 
random initial state, two domain walls either annihilate (yl + 0 ) on contact 

or coalesce {A A ^ A), with probabilities l/{q — 1) and {q — 2)/{q — 1), these 
being the probabilities that the Potts states on either side of the walls are the 
same or different respectively. The persistence (fraction of space not crossed by 
a walker) decays as P{t) ^ where the exponent is given exactly by [13] 

^{q) = -1/8 + ( 2 / 7 r 2 ){cos ~'([2 - q]/\/2q)}^. 

A related problem, which might be termed “walker persistence” (as opposed 
to “site” persistence) is the following. What is the fraction of walkers (i.e. A- 
particles) which have never met another walker? The connection with the previous 
problem is as follows. Let us tag one particle and give it a different diffusion 
constant {D') from the other particles (D). Then we calculate the probability, 
P{t), that the tagged particle has not met another particle up to time t. For D' = 0 
this problem reduces to the site persistence problem, while D' — D corresponds 
to the walker persistence problem. An early analytical attack on the general D' 
problem was made by Monthus [14], who found P{t) ~ and computed (j){q) 

to first order in D'/D, for all q, and to first order in (q — 1) for all D'jD. Two 
trivial cases are g = 2, D' = D, and q = oo, D'/D arbitrary. In the former, all 
surviving walkers are persistent (since all reactions lead to annihilation), so the 
persistence is proportional to the particle density which is known [1] to decay as 
i 0 0^2) = 1/2. In the latter, all reactions lead to coalescence, so only the 
closest walkers to the left and right of the tagged particle play a role. The problem 
then reduces to a three-particle problem that can be solved exactly [15] for any 
D'/D, with P{t) ~ for D' = D, i.e. 0(oc) = 3/2. For values of q in the 

range 2 < g < cxd, however, only simulation results are available up to now [16]. 
One point worth noting is that (f)(q) > 0{q) for all g, i.e. the tagged particle has a 
greater survival probability if it stays still (D' = 0) than if it moves like the other 
particles (D' = D). We will return to this theme below. 

A grossly simplified version of this model, which turns out to be equivalent 
to a much-studied problem in chemical kinetics, is as follows. Consider a set of 
completely non-interacting random walkers, with equal diffusion constants D, ini- 
tially randomly distributed with density p. What is the fraction, D(t), of walkers 
that have never met another walker up to time f? Clearly this will decay faster 
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than for the Potts model, because no walkers ever disappear. In fact it decays as 

P(t) = exp(-AiVÎ) (1) 

in ID, as first shown rigorously by Bramson and Lebowitz [17] in another context. 
For analytical study it is convenient to tag one particle, give it a different diffusion 
constant, D', and treat the other particles as traps for the tagged particle. The 
idea is then to calculate the survival probability, P(t), of the tagged particle in 
the presence of the moving traps. 

Now consider a seemingly very different problem, the two-species annihilation 
process A + B 0, where the A and B particles have diffusion constants Da and 
Dbi and are initially randomly distributed with densities Pa(0), Pb( 0)- What is 
the asymptotic time dependence of the particle densities? This problem was first 
posed almost 20 years ago in the seminal paper of Toussaint and Wilczek (TW) 
[18], using the process as a model of monopole- antimonopole annihilation in the 
early universe (though applications to chemical kinetics and condensed matter 
physics are more numerous [19]). TW showed that if pa(0) = /^b( 0) the densities 
decay asymptotically as in space dimensions d < 4. If the initial densities are 
different, however, the density of the minority species (A, say) decays much more 
rapidly. The connection with the trapping problem is as follows. At late times, 
PA{t) PB{t) and the A particles can be regarded as independently diffusing in a 
background of the majority B particles, which act as traps for the A particles by 
virtue of the annihilation reaction. An equivalent problem, therefore, is to consider 
a single A particle moving among B particles (which do not interact with each 
other) and ask for the probability, P(t), that the A particle survives to time t. In 
the context of the original A-\- B ^ 0 process (or A B B [20] , which has the 
same asymptotics [17]) the “particle” A and “traps” B are taken to have the same 
diffusion constants, but for generality we will take them to be different. Da = D' 
and Db = D. 

Until recently, no analytical result was available for the constant Ai in Eq. 
(1). Furthermore, attempts to determine Ai by numerical simulations (or even to 
confirm the predicted stretched-exponential decay with exponent 1/2) are severely 
hampered by large, slowly decaying transients [21, 22, 23, 24]. In ref. [23], a sophis- 
ticated numerical approach enabled data to be obtained down to P{t) ~ 10“^^, 
but still the asymptotic time dependence could not be unambiguously established. 
In the following section we outline a derivation of an exact result for Ai and present 
the general result for dimensions in the range 1 <d <2. The principal results are 
presented in ref. [25], with full details in ref. [26]. 

4 Diflfiising Particle with Mobile Traps 

Consider a particle with diffusion constant D' moving in a sea of mobile traps 
with diffusion constant D, The particle starts at the origin, while the traps are 
initially placed randomly in space with density p. We derive an exact result for the 
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constant Ai in the ID result (1), and present the generalization to space dimen- 
sions d <2. The results are obtained by deriving upper and lower bounds on the 
survival probability of the particle and showing that they coincide asymptotically 
for t ^ oo. The upper bound is intuitive, but supported by numerical studies, 
while the lower bound is rigorous. 

1. Upper Bound. An intuitive upper bound, Pu{t), on P(t), for any D', is provided 
by the problem with D' == 0, in which the particle stays at its initial position, 
which we call x — Q. Although we have as yet been unable to make this bound 
rigorous [27] , it is intuitively clear that when, as here, the traps are (statistically) 
symmetrically placed with respect to the particle, the particle will on average 
survive longer if it stays still than if it diffuses. This assertion has been checked, 
using the algorithm outlined in ref. [23], for all (lattice) walks up to time t = 2S 
[26]. For D' = 0, P{t) is just the probability that none of the moving traps has 
reached the origin up to time t. This problem is exactly soluble [28]. Since similar 
techniques will be needed to derive the lower bound, we outline the solution here. 

The N traps move independently according to the Langevin equations xi = 
r}i[t) , i = 1,...,^", where r]i(t) is Gaussian white noise with mean zero and 
correlator {'ni{t)r]j{t')) — 2Dôijô{t — t'). The quantity P{t) is just the product 
of the individual trap probabilities. For a given trap starting at the required 
probability is [1] Pi{xi,t) = erf(|xi|/v^4D^). So the upper bound is 



Pu{t) = 



N 

Herf (|xi| 

i—1 




( 2 ) 



where (...) means an average over the initial positions of the traps. Since the Xi 
are also independent, the latter average also factors. Using N — pL^ where L is the 
length of the system, and each Xi is uniformly distributed in (— L/2,L/2), gives 



Pu{t) = 



çL/2 

IjL / dx erfc(|x|/ V 4Dt) 

J-LI2 



ex] 



v[-^p{DtlnŸY , (3) 



where the final result follows on taking the limit L ^ oo. 

2. Lower Bound. Consider the same system as before but with a pair of absorbing 
boundaries at x = ±1/2. We consider the subset of initial conditions in which 
all the traps lie outside the interval (— //2,//2) (and the particle is at x = 0), 
and trajectories in which neither the particle nor any of the traps has crossed a 
boundary up to time t. We calculate the probability, F/,(t), of such an occurrence 
over the ensemble of all initial conditions and trajectories. These restricted initial 
conditions and trajectories are a subset of all the possible initial conditions and 
trajectories in which the particle never meets a trap. It follows that P{t) > Pl(^), 
i.e. Pl(^) is a lower bound. 

The probability that there are no traps in the interval {—1/2, //2) at t = 0 is 
exp(— p/). Given that there are no traps in this interval at t = 0, the probability 
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that no traps enter the interval up to time t is given by the same result as in the 
derivation of the lower bound, namely exp[— Fiaally, the probability 
that the particle, starting at x = 0, has not left the interval (—1/2, 1/2) up to time 
t is given, for times t > by [1] (4 /tt) exp(— tt^D'^//^). Assembling these 

contributions gives 

P{t) > (4/tt) exp[— 4p(Dt/7r)^/^ — {pi D't/l‘^)] . (4) 

Since this inequality holds for all Z, the best lower bound is obtained by 
maximizing with respect to 1. The optimum value is I = {27r‘^D'tl pY^^ , and the 
best lower bound is 

Piit) = — exp[-4p(Dt/7r)^/^ — 3(7r^p^D't/4)^/^] . (5) 

7T 

Since the second term in the exponent is negligible compared to the first as t ^ oo, 
the two bounds converge to yield the asymptotic form 

P{t) exp[—Ap{Dt/TcY^‘^] . (6) 

for P{t). More precisely, we can take the logarithm of P{t) and divide out the 
leading y/i dependence to get 

A. < < A . {D'/py^^ 

v/?- {pWty/^-^ \2J (p2D^)l/6 ’ 

giving —[lnP{t)]/{p‘^Dty^‘^ — 4/y^. It is striking that the asymptotic 

form (6) depends only on the density p and diffusion constant D of the traps, and 
is independent of the diffusion constant D' of the particle. 

As an aside we note that, while the left-hand inequality in (7) holds for all 
t, since Eq. (3) does, the right-hand inequality is strictly a large t result. This is 
because the factor (4/7 t) exp(— tt^DT//^) in Eq. (4) comes from the lowest mode in 
the Fourier decomposition of the survival probability of the particle in the interval 
{—1/2, 1/2). This mode dominates for D't » which requires p'^D't 1. A lower 
bound on P{t) valid for all t can easily be written down by including all Fourier 
modes, but the large-t form (5) is sufficient for present purposes. A comparison 
of the predicted bounds with the numerical data of ref. [23] show that the data 
lie between the bounds [25, 26] (except at early times where lattice effects are 
important [24]). The approach to asymptotic behavior is slow, but consistent with 
the slow convergence of the bounds. A clearer understanding of the subdominant 
terms is necessary before a detailed comparison with numerical or experimental 
data can be made. 

The model can be generalized to include n diffusing particles starting from 
the same point [23]. The bounds can easily be generalized to this case [26]. The 
probability that all n particles survive at time t satisfies the same inequalities (7) 
except that the final term on the right-hand side acquires an additional factor 
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So the bounds converge fovt—^oo and the same asymptotic form, Eq. (6), 
is obtained for all n. 

This approach is also readily generalized to dimensions d > 1 [26]. For d < 
2 we find P{t) ~ exp(— where Xd = {2 p / it d) {A tt D)^ ^ sm{7rd/ 2) while, 
for d = 2, P{t) ^ ex.p[—\ 2 t/ hit] with A 2 = AiipD. The latter agrees with the 
functional form obtained in [17], but with a precise value for the constant A 2 . 
For d > 2, simple exponential decay is obtained, in agreement with [17], but the 
bounds no longer converge and the decay constant cannot be determined [26]. 

The optimal lengthscale I of the spatial region used to construct the lower 
bound behaves as I ~ ^(2-^)/(4-d) for 1 < d < 2, and as / ~ Ini for d = 2. The 
fact that this lengthscale is large for i ^ oc suggests a degree of universality in 
the results, e.g. it should not matter for the aisymptotics whether the problem is 
defined on a continuum or a lattice. Similarly the asymptotic expression for the 
upper bound is insensitive to small-scale details of the model: one obtains the same 
asymptotic behavior on the lattice and the continuum. For d > 2 this is no longer 
true: the optimal I tends to a constant determined by the range of interaction of 
the particle and traps (a nonzero interaction range is necessary for all d > 2, or 
the interaction rate is zero), and the two bounds no longer converge. We therefore 
expect the asymptotics to be non-universal for d > 2. The role of d = 2 as a kind of 
critical diiheneion for this problem is directly related to the recurrence of random 
walks imd < 2. 

We conclude this section by quoting some results for the ID trapping problem 
with a finite number of traps. In this case the particle’s survival probability decays 
asymptotically as a power-law, P{t) ~ t~^ , where the exponent ^ is a function of 
the numbers Nl and Nr of traps respectively to the left and right of the particle. 
The only analytical results are for N = Nl + Nr < 2 [15, 29], but one can obtain 6 
perturbatively for general Nr and Nr as a power series in the expansion parameter 
o;^ = D' /{D + (TV — 1)D') [25, 30]. To second order the result is 

e = ^ + -{N-A‘^)aN + \l{N-2){N-A^){TT-Sil-\n2))+2N{N-l)]a% (8) 

Z 7T 

where A = Nr— Nr. One noteworthy feature of this result is that for the symmetric 
case A = 0 (or, more generally, when A< yV), 6 increases with D' for small D' . 
This lends additional support to our assertion that setting D' = 0 should give an 
upper bound on the survival probability. 

5 Conclusion 

A number of examples of persistence, or first- passage, phenomena in spatially 
extended systems have been presented. In coarsening phenomena, the persistence 
usually decays as a power- law of the time, t, whereas in the trap model discussed 
in the latter part of this work it decays as a stretched exponential for d < 2 (and a 
simple exponential for d > 2 [17]). It is striking that in the trap model the leading 
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asymptotic decay of the particle’s survival probability is, for d < 2, independent 
of the diffusion constant, D' . By contrast, in the coarsening dynamics of the ID 
Potts model, for example, it depends in a non-trivial way on the ratio, D'/D, of 
the particle and trap diffusion constants [14]. In particular, the exponent 0{q), 
which characterizes the survival probability of a static particle (D' = 0) is smaller 
than the exponent (j){q) that describes the case D' = D [16]. In an attempt to gain 
some insight into why D' is important in some models and not in others we can 
employ the “toy model” of ref. [31]. 

In this model one considers a static particle (D' = 0) and moving traps. The 
density of traps can decrease with time, e.g. through annihilation or coalescence 
events, but in the toy model one treats the traps as disappearing randomly in such 
a way £is to mimic the time-dependence of the density, p(t), of the original model. 
In d = 1, for example, the survival probability of the particle is given by [31] 

Q(i) = exp 

For p{t) = p, a constant, this approach is, of course, exact and one recovers the 
result (3). If p{t) ~ for large t, however, (9) predicts a stretched exponential 
decay Q{t) ~ exp(— const for 0 < a < 1/2 and a power-law decay for 

a = 1/2, the latter result being in qualitative agreement (but with the wrong 
powers!) with the known power-law decay of persistence in, for example, ID Potts 
models. 

Consider, now, the stretched exponential regime and let the particle now 
move, with diffusion constant D'. Within the toy model one can exploit the same 
type of bounding arguments as employed in section 4. Eq. (9), corresponding to 
D' = 0, gives the intuitive upper bound, while the derivation of a rigorous lower 
bound follows the same pattern as Eq. (4), i.e. the result of the upper bound, 
Eq. (9) is multiplied by the factor exp[— (p/ + Tr^D't/^^)], which has to be maxi- 
mized with respect to the “box size” 1. As before, this second factor has the form 
exp[— const (D't)^/^], depending explicitly on D'. The convergence of the bounds 
for large t requires that this latter factor be subdominant with respect to the 
nominally dominant factor, Eq. (9). If p{t) ~ with a < 1/2, for large t this 
dominant factor decays, as we have seen, as Q{t) ~ exp(— constt^/^“^), so the 
relevant condition for the convergence of the bounds is a < 1/6. Within the toy 
model, therefore, the leading asymptotic form for the survival probability is def- 
initely independent of D' for a < 1/6, but not necessarily for a > 1/6. Although 
a better understanding of the D'-dependence is clearly desirable, the results from 
this very simple toy model are rather suggestive. 
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Passive Sliders on Growing Surface 
and (Anti-)Advection in Burger’s Flows 

Barbara Drossel and Mehran Kardar 



1 Introduction 

Passive scalar advection: Dust particles or dye or a hot fluid element entrained by 
a flow, but having no effect on the flow. 

Famous problem in turbulence theory. The advected scalar shows a complex 
scaling behavior that stands in no simple relation to the scaling behavior of the 
flow. 

Passive sliders; Particles sliding on stochastically growing surface, e.g., do- 
main walls, particles driven by gravity, . . . 

Noisy Burger’s Equation 



Compressible 
Advection of 

^ - aV(H + Cp 

(h and (pi white noise, delta- correlated 

Equivalent: KPZ equation for interface growth 



— -1- \{v • V)F = y = -Vh 

; vortex- free, 
particles of density o: 



Particles riding on the surface 



Scaling behavior 



X = —aVh -h Cx 



([h(x, t) — h{x t')Ÿ) ~ |x — X g 


t-t' 1 


_\x — x'\^\ 





2 + X — 2 

in d = 1: X = 1/2 (roughness exponent), z = Sl2 (dynamical critical exponent). 
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t-i' 



Our goal: find \p 

1. Renormalization group 

2. Computer simulations in Id 
Renormalization group in 1-loop 

Flow equations (with a = a/ \ and k = k/u): 



{p{x, t)p(x', t')} - {pf \x ~ x'f^'’ - 2g 



da 

dk 

~dl 

dDp 

~df 



3/2 + 



2{a — 0^)1 



k I 
D, 



^ (1 + A:)2 

. , g^(3-A:) 1 

Zp — 2\p — 1 + 2 



k{l-^k)\ 



Fixed points: 

a/ A > 0: attractive fixed point a = k = 1\ Here the equations are Galilei 
invariant (set x' = x — Xet, t' = t, h' = h ex). \p ~ 3/4. 

a/X < 0: flow goes to a = k = 0, with a‘^/k = 1/6, Xp = 2/3 for = z = 3/2. 
For Zp g] 3/2,2], nonzero fixed points exist. Particles cannot follow surface 
fluctuations. 

Computer simulations 





Choose a random surface site and add a particle if no overhangs are gener- 
ated. ^ KPZ growth. 

Choose the color of the new particle with probability (l+i^)/2 in accordance 
with the majority of the neighbors moving domain wall. 
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Particle düffusion 
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Nonuniversal exponent for a/\ < 0. 

Particle positions 

• a/X > 0 



Mean number of particles that are in the same bin as a given particle 
as measured divided by L 



10^ 



10 



3 



10^ 




10 " 10 ^ 10 ^ 10 ^ 
bin size 



10 ’ 10 ^ 
b insize 



L=4096 

L=2048 

L=1024 

L=512 

L=256 

L=128 



Almost all particles at the same site, p is not a smoothly varying quantity. 

Xp = 1/2. 
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• a/\ < 0 



Mean number of particles that are in the same bin as a given particie 
as measured divided by L^.3 




Not all particles can follow the height fluctuations. \p = 0.85. (For K < 1 
different exponents ^ no universality.) 

Particle velocities 

Fix the height difference between the two ends of the system and measure the 
mean velocity 
• a/A > 0, fixed L 



mean velocity for fixed L and different slope 
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• a/A < 0, fixed L 
velocity satisfies scaling law 

v = L~yWhC{VhVL) with j/ ~ 0.14 




X 



asymptotic behavior: 

C{x) ~ for large x. 

For sufficiently small V/i (< \j2\fL\. 

V = a(L)V(/i) 

with 

a(L) - L~^ 

and y - 0.14. (RG: y = 0). 

Relation to particle diffusion exponent Zp\ 

After time t: slope on length I ~ has not yet changed. Particle has moved 
a distance 

^ = 3/2(1 - y) 

For a/X> 0: y = 0 and Zp = 3/2. 

For a/A < 0: ^ 0.14 and Zp ~ 1.74. 
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Conclusions 

• Even in one dimension nontrivial scaling behavior 

• Fundamental difference between a/ A > 0 (Galilei-invariant case) and 
a/A < 0. 

• There seems to be nonuniversal scaling for the case a/A < 0. 

Literature: B. Drossel and M. Kardar, Phys. Rev. B. 66, 195914 (2002). 
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Conformai Spiral Multifractals 

Bertrand Duplantier 



Abstract. The exact joint multifractal distribution for the scaling and spiraling 
of electrostatic potential lines near any conformally invariant scaling curve is de- 
rived in two dimensions. Its spectrum /(a, A) gives the Hausdorff dimension of 
the points where the potential scales with distance r as ~ while the curve 
spirals logarithmically with a rotation angle (p = Alnr. It obeys the scaling law 
/(a, A) = (1 -H A^)/ (a) — hX^ with â = a/(l + A^) and h = (25 — c)!l2^ and where 
/(a) = f{a^ A = 0) is the pure harmonic measure spectrum, and c the conformal 
central charge. The results apply to 0{N) and Potts models, as well as to SLE^- 



1 Introduction 

The geometric description of the random fractals arising in Nature is a fascinating 
subject. Outstanding among these fractals is the class of random clusters or curves 
arising in equilibrium critical phenomena, which are associated with fundamental 
ideas of scale invariance. In particular, in two dimensions (2D), statistical systems 
at their critical point are expected to produce conformally invariant (Cl) fractal 
structures [1], with a Gibbs equilibrium weight invariant under all planar confor- 
mal maps. This leads to a universal random geometry. Prominent examples are 
the continuum scaling limits of random walks (RW), i.e.. Brownian motion, self- 
avoiding walks (SAW), and critical percolation, Ising or Potts clusters. A wealth 
of exact methods has been devised for their study: Coulomb gas [2], conformal 
field theory (CFT) [3], and quantum gravity methods [4, 5, 6, 7, 8]. Recently, in 
parallel to the “cascade structure” found by conformal invariance of Brownian 
motion [9], a natural “multifractal structure” has been found from 2D quantum 
gravity [10]. It was developed for self-avoiding walks [11] and percolation clusters 
[12], (see also [13, 14, 15]), and finally for random curves with arbitrary central 
charge c, the parameter labeling the universality class of the underlying CFT [16]. 
Rigorous probabilistic methods have also been developed, with the landmark intro- 
duction of the Stochastic Lowner Evolution (SLE) process, which directly mimics 
the wandering of critical cluster boundaries in the scaling limit [17]. 

A refined way of accessing the random geometry of conformally invariant 
scaling curves is provided by classical potential theory of electrostatic or diffusion 
fields near these random fractal boundaries, whose self-similarity is refiected in 
a multifractal (MF) spectrum describing the singularities of the potential, also 
called harmonic measure. The first cases addressed were those of RW’s or SAW’s 
in dimension d = A—e [18, 19]. In two dimensions, the first exact examples obtained 
from the quantum gravity approach appeared for the universality class of random 
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or self- avoiding walks [11] and percolation clusters [12], which all possess in 2D the 
same harmonic MF spectrum (see also [9, 13, 14, 15]). The general solution for the 
potential distribution near any Cl fractal in 2D, obtained in [16], depends only on 
the central charge (see also [20, 21]). This solution can be generalized to higher 
multifractal correlations, like the joint distribution of potential on both sides of a 
simple scaling path [22, 23]. 

On another hand, the study of the SLE process is presently a very active 
field of research [24, 25, 26, 27, 28, 29, 30, 31] (see also the contribution [32] in 
this volume) . It is intimately related to the quantum gravity approach [33] . 

An important question concerns the geometry of the equipotential lines near 
a random (Cl) fractal curve. They are expected to rotate wildly, or wind, in a 
spiraling motion, that closely follows the boundary itself. The key geometrical 
object is here the logarithmic spiral, a structure which is conformally invariant. 
The MF description should generalize to a mixed multifractal spectrum, accounting 
for both scaling and winding of the equipotentials [34]. 

The exact solution to this mixed MF spectrum for any random Cl curve has 
been recently obtained in a collaboration with Ilia Binder in [35]. In particular, it 
has been shown to be related by a scaling law to the usual harmonic MF spectrum. 
A rigorous derivation from the SLE formalism is possible [36]. 

In these Proceedings, we describe the multifractal properties of the scaling 
and spiraling of random curves, as seen by the harmonic measure. The universal 
description is given in terms of the central charge c of the underlying conformal 
field theory. The results apply to the critical 0{N) loop model, or to the EP’s 
of critical Fortuin-Kasteleyn (FK) clusters in the Q-Potts model, all described in 
terms of Coulomb gas with some coupling constant g [2] . SLE^ paths also describe 
cluster frontiers or hulls. One has the correspondence k, = A/g, with a central 
charge 

c = (3 - 2g){3 - 2g') = ^(6 - «)(6 - «'), (1) 

symmetric under the duality gg' = 1 or k,k,' — 16 [16, 23, 33]. 

In the following sections, we present the universal description of multifractal 
functions for arbitrary conformally- invariant curves. For several simple paths we 
also define higher multifractal spectra, which include the spiraling rate. A more 
complete description can be found in [33]. 

2 Conformally Invariant Frontiers and Quantum Gravity 

2.1 Harmonic Measure and Potential near a Fractal Frontier 

2.1.1 Local Behavior of the Potential 

Consider a single (conformally- invariant) critical random cluster, generically called 
C. Let H {z) be the potential at an exterior point 2 : G C, with Dirichlet boundary 
conditions H(w ^ dC) = 0 on the outer (simply connected) boundary dC oî C, 
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(or frontier T dC) , and 7i{w) = 1 on a circle “at co”, i.e., of a large radius 
scaling like the average size R of C. As is well-known [37], ^{z) is identical to 
the harmonic measure of the circle “at oo” seen from i.e, the probability that a 
random walker (more precisely, a Brownian motion) launched from z, escapes to 
oo without having hit C. 

The multifractal formalism [38, 39, 40, 41] characterizes subsets dCa of 
boundary sites by a Holder exponent ct, and a Hausdorff dimension / (ct) = 
dim (dCa), such that their potential locally scales as 

n{z-^we dCa) ^ {\z - w\/Rf , (2) 

in the scaling limit a<^r = \z — w\<^R (with a the underlying lattice constant 
if one starts from a lattice description before taking the scaling limit a 0). 

2.1.2 Equivalent Electrostatic Wedge Angle 

In 2D the complex potential (p{z) (such that the electrostatic potential Hiz) = 
^(f{z) and the field’s modulus |E( 2 ;)| = \ip'{z)\) for a wedge of angle 9, centered at 
w, is 

ip{z) = {z-wf^^. (3) 

By eq. (2) a Holder exponent a thus defines a local equivalent “electrostatic” angle 
9 = 7t/ a, and the MF dimension f{9) of the boundary subset with such 9 is 

f{9) = /(a = 7T/0). (4) 



2.1.3 Moments 

It is convenient to define the harmonic measure H{w^r) = H{dC H B{w,r)) in a 
ball B{w,r) of radius r centered ax w e dC, as the probability that a Brownian 
path started at infinity first hits the frontier T — dC inside the ball B{w,r). It is 
the integral of the Laplacian of potential H in the ball B{w, r), i.e,, the boundary 
charge in that ball. It thus scales as with the same exponent as in (2). 

Of special interest are the moments of H, averaged over all realizations of C, 
and defined as 




where points w G dC/r are the centers of a covering of the frontier dC by balls of 
radius r, and form a discrete subset dC/r C dC. The moment order n can be a 
real number. In the scaling limit, one expects these moments to scale as 

, (6) 

where the multifractal scaling exponents r (n) vary in a non-linear way with n 
[38, 39, 40, 41]. They obey the symmetric Legendre transform r (n) -h / {a) = an, 
with n — f' (o) ,a = r' (n). By normalization: r(l) = 0. Because of the ensemble 
average (5), values of / (a) can become negative for some domains of a [18, 42]. 
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2.2 Calculation of Exponents from Quantum Gravity 

2.2.1 Brownian Representation of Moments 

A collection of n independent RW’s, or Brownian paths B in the scaling limit, 
started at the same point a distance r away from the cluster’s hull’s frontier 
dC^ and diffusing without hitting dC^ give a geometric representation of the 
moment, in eq.(5) for n integer. Convexity yields the analytic continuation to 
arbitrary n’s. Let us introduce the notation Af\B for two random sets required to 
traverse, without mutual intersection, the annulus T> (r, R) from the inner boundary 
circle of radius r to the outer one at distance R, and A V B for two independent, 
thus possibly intersecting, sets [11]. With this notation, one can define a grand 
canonical partition function which describes the star configuration of the Brown- 
ian paths and cluster: dC An := dC A {yB)^. At the critical point, it is expected 
to scale for r/iZ ^ 0 as 



ZR{dCAn)^{r/RA”^+-, (7) 

where the scaling exponent 

X (n) := X {dC A n) (8) 

depends on n and is associated with the conformal operator creating the star 
vertex dC A n. The dots after exponent x{n) express the fact that there may be an 
additional contribution to the exponent, independent of n, corresponding to the 
entropy associated with the extremities of the random frontier [11]. 

By normalization, this contribution actually does not appear in the multifrac- 
tal moments. Since H is a, probability measure, the sum (5) is indeed normalized 
as 

= 1, (9) 

or in terms of star partition functions: 

Z, = Zk {dC A n) IZr {dC A 1) . (10) 

The scaling behavior (7) thus gives 

( 11 ) 

The last exponent actually obeys the identity x(l) = x{dC A 1) = 2, which will 
be obtained directly, and can also be seen as a consequence of Gauss’s theorem in 
two dimensions [18]. 

Owing to eqs. (6) (11) we simply get 



T (n) = x{n) — X (1) = X (n) — 2. 



(12) 
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2.2.2 Quantum Gravity 

To calculate these exponents, we use the fundamental mapping between the confor- 
mal field theory, describing a critical statistical system in the plane C or half-plane 
H, and the same CFT on a random planar surface, i.e., in presence of quantum 
gravity (QG) [5, 6]. Two universal functions U and V, which now depend on the 
central charge c of the CFT, describe the KPZ map between conformal dimensions 
A in bulk or boundary QG and those in the standard plane or half-plane: 

A — 'y 1 A^ — 

C/(A):=A-^, V{A) = -—^, (13) 

1 — 7 41 — 7 

with 

F(A):=C/Q(A + 7)). (14) 

The parameter 7 is the string susceptibility exponent of the random 2D surface (of 
genus zero) , bearing the CFT of central charge c [5] ; 7 is the solution of 

c = 1 — 67^(1 — 7)“\ 7 < 0. (15) 

The function U maps quantum gravity conformal weights, whether in the bulk 
(A) or on a boundary (À), into their counterparts in C, rr = 2 f/(A), or in M, 
X = U{A). The function V has been tailored to map quantum gravity boundary 
dimensions A to the corresponding conformal dimensions x = 2V{A) in the full 
plane C. 

The positive inverse function of U, is 

(æ) = i (V4(l - 7 ) 2 ; + 7 ^ + 7 ) , (16) 

and transforms conformal weights of a conformal operator in C or HI into the 
conformal weights of the same operator in quantum gravity, in the bulk or on the 
boundary. 

2.2.3 Boimdary Additivity Rules 

Consider two arbitrary random sets A, with boundary scaling exponents x (A) , 
X {B) in the half-plane H with Dirichlet boundary conditions. When these two sets 
are mutually- avoiding, the scaling exponent x{A A B) in C, as in ( 8 ), oi x {A A B) 
in H have the universal structure [ 11 , 12 , 16] 

x{AaB) = 2V [U~^ {x (A)) U~^ {x {B))] , (17) 

x{AaB) = U [U~^ {x {A)) ^ U~^ {x {B))] . (18) 

These fundamental relations generalize the “cascade relations” found in [9] for 
Brownian paths. They are established in the general case in [33]. U~^ (x) is, on the 
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Figure 1: Illustration of the additivity rule (20): each of the two non- intersecting 
strands of a simple random path lives in its own sector of the random disk near 
the Dirichlet boundary. 



random disk with Dirichlet boundary conditions, the boundary scaling dimension 
corresponding to x in the half-plane IHI, and in eqs. (17) (18) 

U~^{x{AaB)) = U~^ {i{A)) + U-^ {£{B)) (19) 

is a linear boundary exponent corresponding to the fusion of two “boundary oper- 
ators” on the random disk, under the Dirichlet mutual avoidance condition AaB. 
This quantum boundary conformal dimension is mapped back by V to the scaling 
dimension in C, or by 1/ to the boundary scaling dimension in IHI [16] (see [33]). 

2.2.4 Exponent Construction 

For determining the harmonic exponents x{n) (8), we use (17) for A = dC and 
B = 

• We first need the boundary (conformal) scaling dimension (b.s.d.) X 2 x(dC) 
associated with the presence of the random frontier near the Dirichlet boundary 
H. Since this frontier is simple, it can be seen as made of two non-intersecting 
semi- infinite strands (Fig. 1). Its b.s.d. in quantum gravity thus obeys (19) 

t/-i(x2)=2C/-i(xi), (20) 

where x\ is the boundary scaling dimension of a semi-infinite frontier path origi- 
nating at the boundary H. 

• The packet of n independent Brownian paths has x ((VB)^) = n, since x {B) = 1. 

• From (19) the QG boundary dimension of the whole set is (see Fig. 2): 

A [x {dC A n)] = 2U~^ (xi) + U~^ (n) . 



(21) 
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Figure 2: The quantum gravity construction (20) (21) of exponents (22). 

Its associated QG bulk conformal dimension is therefore A = ^(A + y). Prom eqs. 
(14) or (17) we finally find 

x{n) = 2U{A) = 2V{A) 

= 2V [2U~^ (xi) + U~^ (n)] . (22) 

The whole construction is illustrated in Fig. 2. 

• The value of the QG b.s.d. of a simple semi-infinite random path is 

= ^( 1 - 7 )- 



(23) 
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It is derived in [33] from the exponents of the 0{N) model, or of the SLE. It can 
be directly derived from Makarov’s theorem: 

rjnr 

ot(n = 1) = r'(n = 1) = -3— == 1) = 1, (24) 

an 

which, applied to (22), leads to the same result. We thus finally get 

x{n)=2V{l-'f + U-^ (n)) = 2f/ Q + (n)) . (25) 

This result satisfies the identity: x(l) = 21/(1) = 2, which is related to Gauss’s 
theorem, as mentioned above. 



2.2.5 Multifractal Exponents 

• The multifractal exponents r(n) (12) are obtained from (13-16) as [16] 

r (n) - x{n) — 2 

= - 1) + - l)n + 7^ - (2 - 7)1- (26) 

Similar exponents, but associated with moments taken at the tip, later appeared 
in the context of the SLE process (see II in [43], and [44]; see also [21] for Laplacian 
random walks, and [15] for SAW’s.) More multifractal exponents can be found for 
the SLE in [33]. 

It is convenient to express the results in terms of the central charge c as: 

• Multifractal Exponents 



1 . 25 - c / A 


r (n) = - 


24 IV 




1-c \ 


n G 


[” = 24 



24n + 1 — c 
25 -c 




(27) 



• Multifractal Spectrum 



a 



dr 1 1 I 25^^^ 

dn “ 2 2V 24n + 1 -c’ 



(28) 



/H = 



€ 





(29) 



a 




Vol. 4, 2003 Conformal Spiral Multifractals 



S409 



2.3 Geometrical Analysis of Multifractal Spectra 

2.3.1 Makarov’s Theorem 

The exponents r(n) satisfy, for any c, r'(n = 1) = 1, or equivalently f(a = 1) = 1, 
i.e., Makarov’s theorem [45], valid for any simply connected boundary curve. 



2.3.2 Symmetries 



It is interesting to note that the general multifractal function (29) can also be 
written as 



f{a)-a = 



25 -c 
24 




(2a - 1 + 



2a -1 



(30) 



The multifractal functions f {a)— a — f{0) — ^ thus possess the invariance property 
upon substitution of primed variables given by 

27t = 0 6' = — I — (31) 
a Œ 

this corresponds to the complementary domain of the wedge 0. This basic sym- 
metry, first observed [46] for the c = 0 result of [11], is valid for any conformally 
invariant boundary. 



2.3.3 Equivalent Wedge Distribution 



The geometrical multifractal distribution of wedges 0 along the boundary takes 
the form: 



/W = /(î) 



7T 25 — c (tt — 0)“^ 

9 12~ 9{27t-9)' 



(32) 



Remarkably enough, the second term also describes the contribution by a wedge 
to the density of electromagnetic modes in a cavity [47]. The simple shift in (32), 
25 ^ 25 — c, from the c = 0 case to general values of c, can then be related [48, 23] 
to results of conformal invariance in a wedge [49] . 



2.3.4 Hausdorff Dimension of the External Perimeter 

The maximum of f{a) corresponds to n = 0, and gives the Hausdorff dimension 
Dep of the support of the measure, i.e., the accessible or external perimeter^ a 
simple curve without double points which may differ from the full hull [50, 13], as: 

Z?EP = sup„/(a) = /(a(n = 0)) (33) 

= \~Y^VÏ^c{sf^c-Vl^c). (34) 

This corresponds to a typical singularity exponent à = o(0) = (3 — 2 Dep)”\ and 
to a typical wedge angle 9 — — 7t(3 — 2Def)> 
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Figure 3: Universal harmonie multifractal spectra /(a). The curves are indexed by 
the central charge c, and correspond to: 2D spanning trees {c= —2); self- avoiding 
or random walks, and percolation (c = 0); Ising clusters or Q = 2 Potts clusters 
{c = N = 2 loops, or Q = 4 Potts clusters (c = 1). The maximal dimensions are 
those of the accessible frontiers. The left branches of the various f{a) curves are 
largely indistinguishable, while their right branches split for large a, corresponding 
to negative values of n. 



2.3.5 Universal Multijfractal Data 

In Figure 3 we display the multifractal functions /, eq. (29), corresponding to 
various values of — 2 < c < 1, or, equivalently, to a number of components N G 
[0, 2], and Q e [0,4] in the 0{N) or Potts models. The /(a) functions distinguish 
the various universality classes, especially for negative n, i.e. large a. 

2.3.6 Needles 

The singularity at a; = or ^ = 27 t, in the multifractal functions /, or /, corre- 
sponds to boundary points with a needle local geometry, and Beurling’s theorem 
[51] indeed insures that the Holder exponents a are bounded below by This 
corresponds to large values of n. 

2.3.7 Fjords 

The right branch of f (ot) has a linear asymptote 

lim / (a) /a = n* — — (1 — c)/24. 

a— >oo 



(35) 
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The a ^ oo behavior corresponds to moments of lowest order n — » n*. This 
describes almost inaccessible sites: Define {H) as the number of boundary sites 
having a given probability H to be hit by a RW starting at infinity; the MF 
formalism yields, for i/ ^ 0, a power law behavior 

(36) 

with an exponent 1 + n* == < 1. 



2.3.8 Ising Clusters 

A critical Ising cluster (c = ^) possesses a multifractal spectrum with respect to 
the harmonic measure: 



r(n) = ^{n-1)-^ J^{V^Sn-\-l-7) , 



fie.) = 



96 



3- 



1 



2a- 1 



a 

48’ 



a G ( 2 ’+^ 



(37) 

(38) 



with the dimension of the accessible perimeter Dep = sup^/(a,c = ^) = 



2.3.9 Q = 4t Potts Clusters, and “ultimate Norway” 

The limit multifractal spectrum is obtained for c = 1 , which is an upper or lower 
bound for all c’s, depending on the position of a with respect to 1: 

/(a,c<l) < /(a,c = 1), 1 < a, 

/(a = l,c) = 1, Vc, 

/(a,c<l) > /(a,c = 1), a < 1. 

This MF spectrum provides an exact example of a left-sided MF spectrum, with an 
asymptote f {a ^ oo,c = 1) = | (Fig. 3). It corresponds to singular boundaries 
where /(^— ^0,c=l) = | = Dep, i.e., where the external perimeter is everywhere 
dominated by ^^fjords’\ with typical angle ^ = 0. It is tempting to call it the 
“ultimate Norway”. 

The frontier of a Q = 4 Potts Fortuin-Kasteleyn cluster, or the SLE ^^4 
provide such an example for this left-handed multifractal spectrum (c = 1). The 
MF data are: 

r(n) = + 

with accessible sites forming a set of Hausdorff dimension 

3 

Dep = sup„/(a,c= 1) = 2 ' 



(39) 

(40) 



(41) 
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The external perimeter which bears the electrostatic charge is a non-intersecting 
simple path. We therefore arrive at the striking conclusion that in the plane, a 
conformally-invariant scaling curve which is simple has a Hausdorff dimension at 
most equal to Dep = 3/2 [16]. The corresponding Q = A Potts frontier, while still 
possessing a set of double points of dimension 0, actually develops a logarithmically 
growing number of double points [52]. 

2.3.10 Dimensions of Potts Hulls and Frontiers 

As an application of the above results, let us give the various dimensions appearing 
along Potts cluster boundaries. These dimensions have been recently beautifully 
checked numerically [53]. 



Q 


0 


1 


2 


3 


4 


C 


-2 


0 


1/2 


4/5 


1 


Dep 


5/4 


4/3 


11/8 


17/12 


3/2 


Dh 


2 


7/4 


5/3 


8/5 


3/2 


Dsc 


5/4 


3/4 


13/24 


7/20 


0 



Table 1: Dimensions for the critical Q-state Potts model; Q = 0, 1,2 correspond 
to spanning trees, percolation and Ising clusters, respectively. 



3 Higher Universal Multifractal Spectra 
3.1 Double-Sided Spectrum: Definition 

We consider here the specific case where the fractal set C is a (conformally- 
invariant) simple scaling curve, that is, it does not contain double points. The 
frontier dC is thus identical with the set itself: dC — C. Each point of the curve can 
then be reached from infinity, and we address the question of the simultaneous 
behavior of the potential on both sides of the curve. Notice, however, that one 
can also address the case of non-simple random paths, by concentrating on the 
double-sided potential near cut-points, as for instance in Brownian paths [33]. 
The potential H scales as 

[z G dCot.cc') ~ I2: — (42) 

when approaching w on one side of the scaling curve, while scaling as 

H- [z w~ e dCoc,oi>) ~ I2: - , (43) 

on the other side. The multifractal formalism now characterizes subsets Ca^oc' of 
boundary sites w with two such Holder exponents, a, o', by their Hausdorff di- 
mension /2(a,o;') := dim(Ca ct')- The standard one-sided multifractal spectrum 
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f{a) is then recovered as the supremum: 

/(a) = supc '/2 {a, a') . 



(44) 



3.2 Explicit Higher Spectra 

3.2.1 Multiple- Sided Spectra 

One can consider a star configuration Sm of a number m of similar simple scal- 
ing paths, all originating at the same vertex w, and the joint harmonic measure 
moments in each branch. These higher multifractal spectra require some further 
interpretation as associated with the scaling behavior of densities of points, as they 
yield negative dimensions for high enough m [23]. 

Their evaluation for an m-arm star gives the explicit formula in terms of the 
central charge c [23]: 



fm ({o^2=l,...,m}) 



25 - c 1 2 

~ S(1 - 



1 

2 




-1 



1 - c 
24 



m 






(45) 



1=1 

where the central charge c and the parameter 7 are related by eq. (15). The domain 
of definition of the poly-multifractal function / is independent of c and given by 




i=l 



(46) 



3.2.2 One and Two-Sided Cases 



The m = 1 case 



corresponds to the potential in the vicinity of the tip 
scaling path, and naturally differs from the usual f{a) spectrum, which describes 
the potential on one side of the scaling path. 

The two-sided case mentioned in section 3.1 is obtained for m = 2 as 



1 -c 
24 ^ 



(47) 



of a conformally-invariant 



/2 {a, a') 



25 -c 1 \ \ fl 1 '\1 ‘ 

12 2(1 -7) [ ~ 2 Va a ') 



1 — c 
24 



(a + a') . 



(48) 



This doubly multifractal spectrum possesses the desired properties, like (44). 
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4 Spiraling of Conformally Invariant Cmves 

Another important question arises concerning the geometry of the equipotential 
lines near a random (Cl) fractal curve. These lines are expected to rotate wildly, 
or wind, in a spiraling motion that closely follows the boundary itself. The key 
geometrical object is here the logarithmic spiral^ which is conformally invariant 
(Fig. 4). The MF description should generalize to a mixed multifractal spectrum. 




Figure 4: A double logarithmic spiral mimicking the local geometry of the two 
strands of the conformally- invariant frontier. 

accounting for both scaling and winding of the equipotentials [34]. 

In this section, we describe the exact solution to this mixed MF spectrum for 
any random Cl curve [35]. In particular, it is shown to be related by a scaling law 
to the usual harmonic MF spectrum. We use the same conformal tools as before, 
fusing quantum gravity and Coulomb gas methods, which allow the description 
of Brownian paths interacting and winding with Cl curves, thereby providing a 
probabilistic description of the potential map near any Cl random curve. 

4.1 Harmonic Measure and Rotations 

As above, consider a single critical random cluster C in the scaling limit. Let ^{z) 
be the potential at an exterior point 2 ; G C, with Dirichlet boundary conditions 
7i {w dC) = 0 on the outer boundary dC of C, and ^{w) = 1 on a circle “at 00 ” , 
i.e., of a large radius scaling like the average size R of C. 

Let us now consider the degree with which the external boundary curve (which 
is simply connected and conformally invariant) winds in the complex plane about 
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the boundary point w e dC. Let us generically call (p[z) — arg {z — w). In the scaling 
limit, the multifractal formalism, here generalized to take into account rotations 
[34], characterizes subsets dCa,x of boundary sites by a Holder exponent a, and a 
rotation rate A, such that their potential lines respectively scale and logarithmically 
spiral as 



n{z w e dCa,x) ^ 

(p (z ^ w ^ dCa,x) ~ Alnr, (49) 

in the limit r = \z — w\ ^ 0. The Hausdorff dimension dim(^Ca^A) = f 
defines the mixed MF spectrum, which is Cl since under a conformal map both a 
and A are locally invariant. 

As above, we consider the harmonic measure H (u;,r), which is the integral 
of the Laplacian of 7Y in a disk B{w^r) of radius r centered at ic G dC^ i.e., the 
boundary charge in that disk. It scales as r^ with the same exponent as in (49), 
and is also the probability that a Brownian path started at large distance R first 
hits the boundary at a point inside B{w, r). Let p(w, r) be the associated winding 
angle of the path down to distance r from w. The mixed moments of H and 
averaged over all realizations of C, are defined as 

^n,p=l y] /f"(w,r)exp(pv5(w,r))\ « (50) 

\wedC/r / 

where the sum runs over the centers of a covering of the boundary by disks of 
radius r, and where n and p are real numbers. As before, the moment of 
H (w^r) is the probability that n independent Brownian paths diffuse along the 
boundary and all first hit it at points inside the disk B{w, r). The angle p{w^r) is 
then their common winding angle down to distance r (Fig. 5). 

The scaling limit in (50) involves multifractal scaling exponents r (n, p) which 
vary in a non-linear way with n and p. They give the multifractal spectrum / (a, A) 
via a symmetric double Legendre transform: 

a 

/(a, A) 

n 

Because of the ensemble average (50), / (a. A) can become negative for some o, A. 



dr 

= an -h Ap — r (n, p) , 



(51) 
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Figure 5: Two-sided boundary curve dC and Brownian n- packet winding together 
from the disk of radius r up to distances of order as measured by the winding 
angle (p{w,r) = aig{dC A n) as in (50) and in (58). 



4.2 Exact Mixed Multifractal Spectrum 



The 2D conformally invariant random statistical system is labelled by its central 
charge c, c < 1 [1]. The main result is the following exact scaling law [35]: 



/(«,A) 

b 




(52) 



where / {a) = / (a, A = 0) is the usual harmonic MF spectrum in the absence of 
prescribed winding, first obtained in [16], and described in section 2, eq. (29). It 
can be recast as: 

/(û) = « + (53) 

25 -C 



We thus arrive at the very simple formula for the mixed spectrum: 



f{a, A) 



a-\-h — 



ba^ 

2a - 1 - A2 ■ 



( 54 ) 



Notice that by conformal symmetry 



suPa/(o,A) = f{a,\ = 0), 
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i.e., the most likely situation in the absence of prescribed rotation is the same as 
A = 0, i.e. winding- free. The domain of definition of the usual f{a) (54) is 1/2 < a 
[16, 51], thus for A-spiraling points eq. (52) gives 

-(1 + A^) < a, (55) 

in agreement with a theorem by Beurling [51, 34]. 

We have seen in section 2.3 the geometrical meaning to the exponent a\ 
For an angle with opening 0, a = n/O, the quantity n/a can be regarded as a 
local generalized angle with respect to the harmonic measure. The geometrical 
MF spectrum of the boundary subset with such opening angle 0 and spiraling rate 
A reads from (54) 



/>,A) :=/(a = ^,A) 



7T 

0 



+ b- 



‘I 




As in (55), the domain of definition in the 6 variable is 
0<6><^(A), 6>(A) = 27r/(l + A^). 



The maximum is reached when the two frontier strands about point w locally 
collapse into a single A-spiral, whose inner opening angle is 0{\) [51]. 

In the absence of prescribed winding (A = 0), the maximum Dep ■= 
-C^Ep(O) == sup^/(o;, A = 0) gives the dimension of the external perimeter of the 
fractal cluster. Its dimension (34) reads in this notation 

1 1 , 95 — r 

DEP = -{l + b)--Vb(^), 

For spirals, the maximum value T>ep(A) = sup^/(a,A) still corresponds in 
the Legendre transform (51) to n = 0, and gives the dimension of the subset of 
the external perimeter made of logarithmic spirals of type A. Owing to (52) we 
immediately get 



Z?ep(A) = (1 + A2)Z?ep - 6A2 . (56) 

This corresponds to typical scaled values 

d(A)-(l + A2)d, é(A) -é/(l + A"). 

Since 6 > 2 and Dep < 3/2, the EP dimension decreases with spiraling rate, in a 
simple parabolic way. 

Fig. 6 displays typical multifractal functions /(o, A;c). The example cho- 
sen, c = 0, corresponds to the cases of a SAW, or of a percolation EP, the scal- 
ing limits of which both coincide with the Brownian frontier [11, 12, 14]. The 
original singularity at a = ^ in the rotation free MF functions /(o,0), which 
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Figure 6 : Universal multifractal spectrum /(a, A) for c = 0 (Brownian frontier, 
percolation EP and SAW), and for three different values of the spiraling rate A. 
The maximum /(3,0) = 4/3 is the Hausdorff dimension of the frontier. 



describes boundary points with a needle local geometry, is shifted for A 7 ^ 0 to- 
wards the minimal value (55). The right branch of / (a. A) has a linear asymptote 
lima ^+00 / (û^> A) /a = — (1 — c)/24. Thus the A-curves all become parallel for 
a +CXD, i.e., ^ ^ 0 “^, corresponding to deep fjords where winding is easiest. 

Limit multifractal spectra are obtained for c = 1, which exhibit exact exam- 
ples of left-sided MF spectra, with a horizontal asymptote f {a ^ + 00 , A; c = 1) = 
I — ^A^ (Fig. 7). This corresponds to the frontier of a Q == 4 Potts cluster (i.e., the 
SLEkz= 4 ), a universal random scaling curve, with the maximum value Dep =3/2, 
and a vanishing typical opening angle ^ = 0, i.e., the “ultimate Norway” where 
the EP is dominated by ''fjords’^ everywhere [16, 23]. Fig. 8 displays the dimension 
7 ^ep(A) as a function of the rotation rate A, for various values of c < 1 , corre- 
sponding to different statistical systems. Again, the c = 1 case shows the least 
decay with A, as expected from the predominance of fjords there. 

4.3 Conformal Invariance and Quantum Gravity 

We now give the main lines of the derivation of exponents r (n,p), hence /(a, A) 
[35]. As before, n independent Brownian paths B, starting a small distance r away 
from a point w on the frontier dC, and diffusing without hitting dC, give a geometric 
representation of the moment, of the harmonic measure in eq.(50) for 
integer n (Fig. 5). Convexity yields the analytic continuation to arbitrary n’s. 
Let us introduce an abstract (conformal) field operator ^dc/\n characterizing the 
presence of a vertex where n such Brownian paths and the cluster’s frontier diffuse 
away from each other in the mutually- avoiding configuration dC An [ 11 , 12 ]; to this 
operator is associated a scaling dimension x{n). To measure rotations using the 
moments (50) we have to consider expectation values with insertion of the mixed 
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Figure 7: Left-sided multifractal spectra /(a, A) for the limit case c = 1, the 
“ultimate Norway” (frontier of a <5 = 4 Potts cluster or SLE^= 4 ). 



operator 

— x(n,p), (57) 

where arg(^CAn) is the winding angle common to the frontier and to the Brownian 
paths (see Fig. (5)), and where x{n,p) is the scaling dimension of the operator 
It is directly related to r{n,p) by [11] 

X (n, p) = T (n, p) + 2. (58) 

For rz = 0, one recovers the previous scaling dimension 

x(n,p = (ï) — x{n), 

r{n,p = 0) = r{n) = x{n) — 2. 

For the purely harmonic exponents x(n), describing the mutually- avoiding set 
dC A n, we have seen in eqs. (25) and (20) that 

x{n) = 2V [2U~^ (xi) + U~^ (n)] . (59) 

In (59), we recall that the arguments Xi and n are respectively the boundary 
scaling dimensions (b.s.d.) (20) of the simple path Si representing a semi-infinite 
random frontier (such that A 5i), and of the packet of n Brownian paths, 

both diffusing into the upper half-plane H. The function U~^ transforms these 
half-plane b.s.d’s into the corresponding b.s.d. ’s in quantum gravity, the linear 
combination of which gives, still in QG, the b.s.d. of the mutually- avoiding set 
dC An. = (A<Si)^ An. The function V finally maps the latter b.s.d. into the scaling 
dimension in C. The path b.s.d. x\ (20) obeys U~^ (xi) = (1 — 7)/2. 
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Figure 8: Dimensions Dep(A) of the external frontiers as a function of rotation 
rate. The curves are indexed by the central charge c, and correspond respectively 
to: loop-erased RW (c = — 2 ;SLE 2 ); Brownian or percolation external frontiers, 
and self- avoiding walk (c = 0 ;SLEs/ 3 ); Ising clusters (c = ^;SLE 3 ); Q = 4 Potts 
clusters (c = 1; SLE 4 ). 



It is now useful to consider k semi-infinite random paths Si , joined at a single 

k 

vertex in a mutually- avoiding 5^ar configuration <S/c =Si A Si A — ' S\= (A<Si)^. (In 
this notation the frontier near any of its points is a two-star dC = S 2 -) The scaling 
dimension of Sk can be obtained from the same b.s.d. additivity rule in quantum 
gravity, as in (17) or (59) [16] 

= 2V[kU-^{xi)]. (60) 

The scaling dimensions (59) and (60) coincide when 

x(n) = x(Sk(n)) (61) 

Thus we state the scaling star- equivalence 

dC An Sk{n), (63) 

of two mutually- avoiding simple paths dC = S 2 = S\ A S\, further avoiding n 
Brownian motions, to k{n) simple paths in a mutually- avoiding star configuration 
Sk[n) (Fig. 9). This equivalence plays an essential role in the computation of the 
complete rotation spectrum (58). 





Vol. 4, 2003 



Conformal Spiral Multifractals 



S421 




U (J,) 



Figure 9: Equivalence (62) between two simple paths in a mutually- avoiding con- 
figuration «S 2 = <Si A «Si , further avoided by a packet of n independent Brownian 
motions, and k{n) simple paths in a mutually- avoiding star configuration <S/e(^). 



4.4 Rotation Scaling Exponents 

The Gaussian distribution of the winding angle about the extremity of a scaling 
path, like <Si, was derived in [54], using exact Coulomb gas methods. The argument 
can be generalized to the winding angle of a star Sk about its center [55], where 
one finds that the angular variance is reduced by a factor l//c^ (see also [56]). 
The scaling dimension associated with the rotation scaling operator 
is found by analytic continuation of the Fourier transforms evaluated there [35]: 

2 

x{Sk\p) = x{Sk) - 

1 — 7 /c^ 

i.e., is given by a quadratic shift in the star scaling exponent. To calculate the 
scaling dimension (58), it is sufficient to use the star-equivalence (62) above to 
conclude that 

2 

x{n,p) = x{Sk^n);p) = x{n) - 

which is the key to our problem. Using eqs. (62), and (59) gives the useful identity: 
i(l - ^)k^{n) = x(n) -2 + 6, 

with b = Recalling (58), we arrive at the multifractal result: 



r(n,p) = r(n) 



1 

4 r(n) -h 6’ 



(64) 



where r(n) = x(n) — 2 corresponds to the purely harmonic spectrum with no 
prescribed rotation. 




S422 



Bertrand Duplantier Ann. Henri Poincaré 



4.5 Legendre Transform 

The structure of the full r- function (64) leads by a formal Legendre transform (51) 
directly to the identity 

/(a, A) = (1 + A2)/(â) - 6A2, 

where f{a) = an — r(n), with â = dr{n)/dn, is the purely harmonic MF function. 
It depends on the natural reduced variable â à la Beurling (â G [|, +oo)) 

a dx 1 1 / b 

ITV ^ 2^2V2n + 6-2’ 

whose expression emerges explicitly from (59). Whence eq.(52), QED. 

4.6 Higher Mixed Rotation Spectra 

It is interesting to consider also mixed higher multifractal spectra including log- 
arithmic spiraling rates [57]. For a conformally- invariant scaling curve which is 
simple, i.e., without double points, like the external frontier dC^ here taken alone, 
define the universal function f 2 {a,a';X) which gives the Hausdorff dimension of 
the points where the potential varies jointly with distance r as on one side of 
the curve, and as on the other, given a local spiraling rate A of the curve. It 
satisfies the generalization of scaling relation (52) in terms of the scaled variables 
à = a/{l + A^), â' = a' /{I -h A^) 

/ 2 (a,a';A) = (1 -h A^)/ 2 (o;, o'; A = 0) - 6A^ . (65) 

= a + (66) 

where 

/2(a,a').«a,c';A = 0) * ‘ ^ 

+ (67) 

is the standard double-sided multifractal distribution (48) for the potential. The 
mixed double-sided multifractal function is thus 

~^^(a + a'). (68) 

This double multifractality can be generalized to higher ones [23], by considering 
the distribution of potential between the arms of a rotating star Sm- It can be 
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expressed via the scaled variables 



Oil 



as 



1 + A2’ 



/m({ai};A) = (1 + A^)/m ({ai}; A) - 6A^ 

in terms of the purely harmonic multifractal spectrum (45): 
fm{{ai}) = /m({aj};A = 0) 



8 ( 1 - 
6-2 



1 2 ^ 1 

-m 1 - > T — 

- 7) è 2a* 



-1 






This gives the following poly-multifractal result [57]: 



/m({«i};A) = 6- 



8(1-7) 

6-2 



1 ™ 1 
Y-L 

1 + A2 2oi 



• 






(69) 

(70) 



(71) 



(72) 



In conclusion, we have described general conformally- invariant curves in the 
plane in terms of the universal parameters c (central charge) or 7 (string suscep- 
tibility). The multifractal results described in the sections above thus also apply 
to the SLE^ process after substituting k for 7 or c (Eq. (1). Two dual values of 
K correspond to the same values of 7 or c. The reason is that we have considered 
geometrical properties of the boundaries which actually were self- dual An exemple 
is the harmonic multifractal spectrum of the SLE «;/>4 frontier, which is identical 
to that of the smoother (simple) SLE(i 6/^/)^^<4 path. So we actually need only 
the set of simple SLE traces with k < A. The mixed harmonic spectrum f(a, A), 
which can also be derived directly for the SLE [36], obeys the same duality prop- 
erty. When dealing with higher multifractality, we assumed the random curves to 
be simple, thus again k < 4or non simple random paths, quantum gravity rules 
have to be modified and lead to other multifractal spectra, as explained in detail 
in [33]. 

The gracious hospitality of the Mittag-Leffler Institute, during the pro- 
gramme Conformal Mappings directed by Peter Jones, and where the work on 
the spiral multifractals was initiated in collaboration with Ilia Binder, is gratefully 
acknowledged. 




S424 



Bertrand Duplantier Ann. Henri Poincaré 



References 

[1] A. A. Belavin, A. M. Polyakov, and A. B. Zamolodchikov, Nucl. Phys. B241, 
333 (1984). 

[2] See, e.g., B. Nienhuis, in Phase Transitions and Critical Phenomena^ edited 
by C. Domb and J. L. Lebowitz, (Academic, London, 1987), Vol. 11. 

[3] See, e.g., J. C. Cardy, in Phase Transitions and Critical Phenomena^ edited by 
C. Domb and J. L. Lebowitz, (Academic, London, 1987), Vol. 11 ; Lectures 
on Conformal Invariance and Percolation^ Chuo University, Tokyo (2001), 
math-ph /0 1030 1 8 . 

[4] V.A. Kazakov, Phys. Lett. A119, 140 (1986). 

[5l V.G. Knizhnik, A.M. Polyakov, and A.B. Zamolodchikov, Mod. Phys. Lett. 
A 3, 819 (1988). 

[6] F. David, Mod. Phys. Lett. A3, 1651 (1988); J. Distler and H. Kawai, Nucl. 
Phys. B321, 509 (1988). 

[7] B. Duplantier and I.K. Rostov, Nucl. Phys. B340, 491 (1990). 

[8] I. K. Rostov, Mod. Phys. Lett. A 4, 217 (1989); M. Gaudin and I. K. Rostov, 
Phys. Lett. B 220, 200 (1989). 

[9] G.F. Lawler and W. Werner, Ann. Probab. 27, 1601 (1999). 

[10] B. Duplantier, Phys. Rev. Lett. 81, 5489 (1998). 

[11] B. Duplantier, Phys. Rev. Lett. 82, 880 (1999). 

[12] B. Duplantier, Phys. Rev. Lett. 82, 3940 (1999). 

[13] M. Aizenman, B. Duplantier and A. Aharony, Phys. Rev. Lett 83, 1359 (1999). 

[14] G.F. Lawler and W. Werner, J. European Math. Soc. 2, 291 (2000). 

[15] J. L. Cardy, J. Phys. A 32, L177 (1999). 

[16] B. Duplantier, Phys. Rev. Lett. 84, 1363 (2000). 

[17] O. Schramm, Israel Jour. Math., 118, 221 (2000). 

[18] M.E. Cates and T.A. Witten, Phys. Rev. Lett. 56, 2497 (1986); Phys. Rev. 
A35, 1809 (1987). 

[19] C. von Ferber and Y. Holovatch, Europhys. Lett. 39, 31 (1997); Phys. Rev. E 
56, 6370 (1997). 

[20] G.F. Lawler, O. Schramm and W. Werner, Acta Math. 187, 275 (2001). 




Vol. 4, 2003 Conformal Spiral Multifractals 



S425 



[21] M. B. Hastings, Phys. Rev. Lett. 88, 055506 (2002). 

[22] B. Duplantier, in Fractals: Theory and Applications in Engineering^ M. 
Dekking et al eds., pp. 185-206 (Springer-Verlag, 1999). 

[23] B. Duplantier, J. Stat. Phys. 110, 691 (2003); arXiv:cond-mat/0207743. 

[24] G. F. Lawler, O. Schramm, and W. Werner, Acta Math. 188 (2002) (to ap- 
pear), arXiv:math. PR/0005295, Math. Res. Lett. 8, 401 (2001). 

[25] S. Rohde and O. Schramm, arXiv:math.PR/0106036. 

[26] G.F. Lawler, O. Schramm, and W. Werner, Electronic J. of Probability 7, 
2002, paper no. 2, arXiv:math.PR/0108211. 

[27] S. Smirnov, and W. Werner, Math. Res. Lett. 8, 729 (2001), 
arXiv:math.PR/0109120. 

[28] G.F. Lawler, O. Schramm, and W. Werner, arXiv:math. PR/0204277. 

[29] G.F. Lawler, O. Schramm, and W. Werner, arXivimath. PR/0209343. 

[30] M. Bauer, and D. Bernard, Phys. Lett. B 543, 135 (2002) [arXivimath- 
ph/0206028]; arXiv:hep-th/0210015; arXiv:hep-th/0301064. 

[31] R. Friedrich and W. Werner, arXiv:math.PR/0209382; 
ar Xiv : mat h-ph /030 1 0 1 8 . 

[32] J. C. Cardy, Conformal Invariance in Percolation, Self-Avoiding Walks, and 
Related Problems, this volume. 

[33] B. Duplantier, arXivimath- phys/0303034. 

[34] I. A. Binder, Harmonic Measure and Rotation of Simply Connected Planar 
Domains, PhD Thesis, Caltech (1998). 

[35] B. Duplantier and I. A. Binder, Phys. Rev. Lett. 89, 264101 (2002); 
arXiv:cond-mat/0208045. 

[36] I. A. Binder and B. Duplantier, in preparation. 

[37] S. Kakutani, Proc. Imper. Acad. Sci. (Tokyo) 20, 706 (1942). 

[38] B.B Mandelbrot, J. Fluid. Mech. 62 , 331 (1974). 

[39] H.G.E. Hentschel and I. Procaccia, Physica (Amsterdam) 8D, 835 (1983). 

[40] U. Frisch and G. Parisi, in Proceedings of the International School of Physics 
“Enrico Fermi”, course LXXXVIII, edited by M. Ghil (North-Holland, New 
York, 1985) p. 84. 




S426 



Bertrand Duplantier Ann. Henri Poincaré 



[41] T.C. Halsey, M.H. Jensen, L.R Kadanoff, I. Procaccia, and B.L Shraiman, 
Phys. Rev. A33, 1141 (1986). 

[42] M. Cates and J.M. Deutsch, Phys. Rev. A35, 4907 (1987); B. Duplantier and 
A. Ludwig, Phys. Rev. Lett. 66, 247 (1991); C. von Berber, Nucl. Phys. B490, 
511 (1997). 

[43] G. F. Lawler, O. Schramm, and W. Werner, Acta Math. 187, (I) 237, (II) 275 
(2001), arXivimath. PR/991 1084, arXiv:math.PR/0003156; Ann. Inst. Henri 
Poincaré PR 38, 109 (2002), arXiv:math.PR/0005294. 

[44] G. F. Lawler, ESI preprint 2001, An introduction to the stochastic Lôwner 
evolution, http: / /www. mat h. duke.edu/ ~jose/esi.html. 

[45] N. G. Makarov, Proc. London Math. Soc. 51, 369 (1985). 

[46] R. C. Bail, B. Duplantier, and T. C. Halsey, unpublished (1999). 

[47] R. Balian and B. Duplantier, Ann. Physics 112, 165 (1978), p.l83. 

[48] B. Duplantier and J. L. Cardy, private discussion (Aspen, 1999). 

[49] J. L. Cardy, Nucl. Phys. B 300, 377 (1988). 

[50] T. Grossman and A. Aharony, J. Phys. A 20, L1193 (1987). 

[51] A. Beurling, The collected works of Ame Beurling. Vol. 1, Contemporary 
Mathematicians, Birkhàuser Boston Inc., Boston, MA, 1989, Complex anal- 
ysis, Edited by L. Carleson, P. Malliavin, J. Neuberger and J. Wermer. 

[52] A. Aharony and J. Asikainen, arXiv:cond-mat/0206367. 

[53] J. Asikainen, A. Aharony, B. B. Mandelbrot, E. M. Rauch, J. -P. Hovi, 
ar Xi V : cond- mat /0212216. 

[54] B. Duplantier and H. Saleur, Phys. Rev. Lett. 60, 2343 (1988). 

[55] B. Duplantier, to be published. 

[56] B. Wieland and D. B. Wilson, to be published. 

[57] B. Duplantier and I. A. Binder, in preparation. 



Bertrand Duplantier 
Service de Physique Théorique 
Unité de recherche associée au CNRS 
CEA, Saclay 

F-91191 Gif-sur- Yvette Cedex 
France 




Ann. Henri Poincaré 4, Suppl. 1 (2003) S427 - S436 
© Birkhâuser Verlag, Basel, 2003 

1424-0637/03/01S427-10 ^ 

DOI i0.i007/s00023-003-0932-z I Annales Henri Poincaré 



Condensation Transitions in Nonequilibrium Systems 



M.R. Evans 



Abstract. Systems driven out of equilibrium can often exhibit behaviour not seen in 
systems in thermal equilibrium- for example phase transitions in one- dimensional 
systems. In this talk I will review several ‘condensation’ transitions that occur when 
a conserved quantity is driven through the system. Although the condensation is 
spatial, i.e. a finite fraction of the conserved quantity condenses into a small spa- 
tial region, useful comparison can be made with usual Bose-Einstein condensation. 
Amongst some one-dimensional examples I will discuss the ‘Bus Route Model’ where 
the condensation corresponds to the clustering together of buses moving along a 
bus- route. 



1 Introduction 

Broadly speaking, one can consider two types of nonequilibrium systems: those 
relaxing towards thermal equilibrium and those held far from thermal equilibrium 
e.g.hy the system being driven by some external field. In the latter case the steady 
state of the system will not be described by usual Gibbs-Boltzmann statistical 
weights rather it will be a nonequilibrium steady state. A natural way to construct 
a nonequilibrium steady state is to drive the system by forcing a current of some 
conserved quantity, for example energy or mass, through the system. Such systems 
are known as driven diffusive systems (DDS) [2]. 

In recent years the possibility of phase transitions and phase separation in 
one- dimensional nonequilibrium systems has been explored and some examples are 
by now well studied. To appreciate the significance one should recall the general 
dictum that in one-dimensional equilibrium systems phase ordering and phase 
transitions do not occur (except in the limit of zero- temperature, or with long 
range interaction) [3]. 

Let us briefly review work on one- dimensional phase transitions in driven 
systems. A very simple one-dimensional driven diffusive system is the asymmetric 
simple exclusion process (ASEP). Here particles hop in a preferred direction on 
a one-dimensional lattice with hard-core exclusion (at most one particle can be 
at any given site). Indicating the presence of a particle by a 1 and an empty site 
(hole) by 0 the dynamics comprises the following exchanges at nearest neighbour 
sites 



1 0 
0 1 



0 1 with rate 1 

1 0 with rate q 



( 1 ) 
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The open system was studied by Krug [4] and boundary induced phase tran- 
sitions shown to be possible. Specifically one considers a lattice of N sites where 
at the left boundary site (site 1) a particle is introduced with rate a if that site 
is empty, and at the right boundary site (site N) any particle present is removed 
with rate (3. Thus the dynamical processes at the boundaries are 

at site 10—^1 with rate a 

at site 1 0 with rate (3 . (2) 

These boundary conditions force a steady state current of particles J through the 
system. Phase transitions occur when limAr_^oo J exhibits non- analyt ici ties. The 
steady state of this system was solved exactly for the totally asymmetric case [5, 6] 
and more recently for the general q case [7, 8]. When q < I the phase diagram 
comprises three phases: a high-density phase where the current is controlled by a 
low exit rate (3 - one can think of this is queue of cars at a traffic light that doesn’t 
let many cars through; a low-density phase where the current controlled by a low 
injection rate a - think of this as a traffic light that does not let many cars onto an 
open road; a maximal-current phase where both a^f3 are high {a^(3 > {1 — q)/2) 
and the current is J = (1 — g)/4. Note that since increasing a and (3 doesn’t increase 
the current, the current is saturated. In the maximal current phase generic long- 
range correlations exist, an example being the decay of particle density from the 
left boundary to the bulk value 1/2 which is a power law ~ where x is 

distance from the left boundary. 

On the line a = /3 < {1 — q)/2 which separates the high and low density 
phases one finds coexistence between a region of low density in the left part of 
lattice and a region of high density on the right separated by a ‘shock’ where the 
density changes sharply over a microscopic distance. 

Perodic systems (i.e. a ring of sites) can also exhibit phase separation when 
inhomogeneities or defects are introduced. A very simple example is to introduce 
into the asymmetric exclusion process a ‘slow bond’ through which particles hop 
with a reduced rate. Then in the steady state one can obtain phase separation 
between a region of high density behind the slow bond and a region of low density 
in front of the slow bond. Moving defects {i.e. particles with dynamics different 
from that of the others) have also been considered and exact solutions obtained 
[11, 12, 13]. One can think of a slow agricultural vehicle on a country road with a 
large queue of cars behind it and open road in front of it. 

A further question is whether systems related to the hopping particle models 
described so far, but without inhomogeneities, can exhibit phase ordering. A very 
simple model was introduced in [14] comprising three species of conserved particles, 
amongst which all possible exchanges are allowed. However a key feature is that 
the dynamics has a cyclic symmetry i.e. A particles move preferentially to the left 
of B particles which move preferentially to the left of C particles which in turn 
move preferentially to the left of A particles. The model exhibits strong phcise 
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separation into pure domains oi A B C. Similar strong phase separation occurs in 
other related models [15]. 

A final class of transitions in one-dimensional hopping particle models is that 
involving spatial condensation, whereby a finite fraction of the particles condenses 
onto the same site. Examples include the appearance of a large aggregate in models 
of aggregation and fragmentation [16] and the emergence of a single flock in dynam- 
ical models of flocking [17]. We will analyze a simple example of a condensation 
transition which occurs in the zero-range process which we now define. 

2 The zero-range process 

The zero-range process was introduced by Spitzer [9] and recent applications and 
developments have been reviewed in [10]. We consider a one-dimensional lattice 
of M sites with sites labeled fx = 1 . . . M and periodic boundary conditions (more 
generally one can consider the zero-range process on a lattice of arbitrary dimen- 
sion). Each site can hold an integer number of indistinguishable particles. The 
configuration of the system is specified by the occupation numbers of each site 
fx. The total number of particles is denoted by L and is conserved under the dy- 
namics. The dynamics of the system is given by the rates at which a particle leaves 
a site fx (one can think of it as the topmost particle - see Figure la) and moves to 
the left nearest neighbour site fx—1. The hopping rates u{n) are a function of n the 
number of particles at the site of departure. Some particular cases are: if u{n) = n 
then the dynamics of each particle is independent of the others; if u{n) = const 
forn > 0 then the rate at which a particle leaves a site is unaffected by the number 
of particles at the site (as long as it is greater than zero). 

The important attribute of the zero-range process is that it has a steady 
state described by a product measure. By this it is meant that the steady state 
probability P({n^}) of finding the system in configuration {ni, ri 2 . . . tim} is given 
by a product of factors /(n^) 

1 ^ 

( 3 ) 

^ ' fJi=l 

Here the normalization Z{M, L) is introduced so that the sum of the probabilities 
for all configurations, with the correct number of particles L, is one. 

In the basic model described above, /(n) is given by 

^ 1 

fip) — TT “ 7 — ^ n>l 

uim) 

= 1 for n = 0 (4) 

To prove (3,4) one simply considers the stationarity condition on the proba- 
bility of a configuration (probability current out of the configuration due to hops 
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is equal to probability current into the configuration due to hops): 



0(n^,)u(n^)P{ni . . . . . . nx,) = 

^(n^)^(rz^+i+l)P(ni . . . n^-1, n^^+i+l . . - n^) . (5) 

The Heaviside function 0{n,j) highlights that it is the sites with n > 1 that allow 
exit from the configuration (Ihs of (5)) but also allow entry to the configuration 
(rhs of (5)). Equating the terms on both sides of (5) and cancelling common 
factors assuming (3), results in 



w(n^)/(n^-i)/(%) = u{n^+i + !)/(% - l)/(%+i + 1) 
This equality can be recast as 






/(n^+i + 1) 



'^(^/x)t 7 = u{n^^i + 1) — - p ~- constant 

JxP’iJi tj 

Setting the constant equal to unity implies 

/(«M - 1) 



f{np) = 



u{n^) 



(6) 

(7) 

(8) 



and iterating (8) leads to (4). 

a) “(3) 

o 
o o 



U(l) 

L O 

site 12 3 4 



b) 



u(l) 



u(3) 



u(l) 



1 

particle 



3 4 



Figure 1: Equivalence of zero range process and asymmetric exclusion process. 

We can easily generalize to consider an inhomogeneous system by which we 
mean the hopping rates are site dependent: the hopping rate out of site /i when it 
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contains particles is It is easy to check that the steady state is simply 

modified to 

where are given by 



/mW = 



IT - 

iii 

1 for 



for n > 1 



n = 0 



( 10 ) 



The proof is identical to that for the homogeneous case, with the replacement of 
u{rif,) by w^(n^) 

There exists an exact mapping from a zero-range process to an asymmetric 
exclusion process. This is illustrated in Figure 1. The idea is to consider the par- 
ticles of the zero-range process as the holes (empty sites) of the exclusion process. 
Then the sites of the zero-range process become the moving particles of the ex- 
clusion process. Note that in the exclusion process we have M particles hopping 
on a lattice of M -f L sites. A hopping rate in the zero range process u{m) which 
is dependent on m corresponds to a hopping rate in the exclusion process which 
depends on the gap to the particle in front. So the particles can feel each other’s 
presence and one can have a long-range interaction. 



3 Condensation Transitions 

We now proceed to analyze the steady states of form (9) and the condensation 
transition that may occur. The important quantity to consider is the normalization 
Z{M,L) as it plays the role of the partition sum. The normalization is defined 
through the condition 



M 

Z(M,L)= (11) 

ni,n2---riM M /i,= l 

where the Ô function enforces the constraint of L particles. The normalization may 
be considered as the analogue of a canonical partition function of a thermodynamic 
system. 

We define the ‘speed’ v as the average hopping rate out of a site 



V 



Z{M,L) ^ 

ni ,n2...nM /x 

Z{M,L-l) 

Z{M,L) 



M 

L)u{ni) 

^=1 



( 12 ) 
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where we have used (9,10). Note that (12) tells us that the speed is independent 
of site and thus may be considered a conserved quantity in the steady state of 
the system. In the totally asymmetric system considered in Section 2 the speed is 
equal to the current of particles flowing between neighbouring sites. The speed is 
a ratio of partition functions of different system sizes (12) and corresponds to a 
fugacity. 

We now use the integral representation of the delta function to write the 
partition function as 






n . 



n=i 



(13) 



where 

CO 

m—0 



For large M, L (13) is dominated by the saddle point of the integral 
of 2 : at the saddle point is the fugacity. The equation for the saddle 
to 



L 

M 




and the value 
point reduces 

(15) 



which, defining 0 = L/M^ can be written as 



II 


(16) 


In the thermodynamic limit, 




M — » 00 with L = (j)M , 


(17) 



where the density (j) is held fixed, the question is whether a valid saddle point value 
of z can be found from (16). We expect that for low (f) the saddle point is valid but, 
as we shall discuss, there exists a maximum value of 2 : and if at this maximum 
value the rhs of (16) is finite, then for large (16) cannot be satisfied. We now 
consider how condensation may occur in the inhomogeneous and the homogeneous 
case. 



3.1 Inhomogeneous case 



To give an idea of how a condensation transition may occur we consider the case 
for m > 0 i.e. the hopping rate does not depend on the number of 
particles at a site. is given by 




/mW = 



(18) 
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and the probability of occupancies {ni,ri 2 , . . . .um] is 

= . (19) 

The mapping to an ideal Bose gas is evident: the L particles of the zero-range 
process are viewed as Bosons which may reside in M states with energies 
determined by the site hopping rates: ex.p{—PE^) = l/u^. Thus the ground state 
corresponds to the site with the lowest hopping rate. The normalization Z{M, L) 
is equivalent to the canonical partition function of the Bose gas. We can sum the 
geometric series (14) to obtain and F'^ then taking the large M limit allows 
the sum over fj, to be written as an integral 

z 

(j) = / duV{u) (20) 

U — Z 

where V{u) is the probability distribution of site hopping rates with ïimin the lowest 
possible site hopping rate. Interpreting V{u) as a density of states, equation (20) 
corresponds to the condition that in the grand canonical ensemble of an ideal Bose 
gas the number of Bosons per state is (j). The theory of Bose condensation tells us 
that when certain conditions on the density of low energy states pertain we can 
have a condensation transition. Then (16) can no longer be satisfied and we have 
a condensation of particles into the ground state, which is here the site with the 
slowest hopping rate. 

A very simple example is to have just one ‘slow site’ i.e. u\ = p while the 
other M — 1 sites have hopping rates = 1 when /u > 1. Using the mapping 
to an exclusion process, this corresponds one slow particle i.e. agricultural vehicle 
example described earlier. One can show [11] that for a high density of particles in 
the zero range process (low density of particles in the corresponding asymmetric 
exclusion process) we have a condensate since site 1 contains a finite fraction of 
the particles. In the low density phase the particles are evenly spread between all 
sites. 

3.2 Homogeneous case 

We now consider the homogeneous zero-range process where the hopping rates 
u{n) are site independent. Then (14) is independent of pi and reads 

oo n 

= E n 

n— 0 m— 1 

The fugacity 2 must be chosen so that F converges or else we could not have 
performed (14). Therefore 2 : is restricted io z < I3 where we define j3 to be the 
radius of convergence of F[z). From (21) we see that f3 is the limiting value of the 



u{m)\ 



( 21 ) 
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u{m) i.e. the limiting value of the hopping rate out of a site for a large number 
of particles at a site. We interpret (16) as giving a relation between the density 
of holes (number of holes per site) and the fugacity z. The saddle point condition 
(16) becomes 



zF'{z) 



F{z) 



( 22 ) 



Given that the rhs of (22) is a monotonically increasing function of 2 : we deduce 
that density of particle increases with fugacity. However if at 2 : = /5, the maximum 
allowed value of z, the rhs of (22) is still finite then one can no longer solve for the 
density and one must have a condensation transition. Physically, the condensation 
would correspond to a spontaneous symmetry breaking where one of the sites is 
spontaneously selected to hold a finite fraction of the particles. 

Thus, for condensation to occur {i.e. when (f) is large enough for (22) not to 
have a solution for the allowed values of 2 :) we require 



lim 



F'{z) 
“0 F{z) 



< 00 . 



(23) 



We now assume that u{n) decreases uniformly to /3 in the large n limit as 



u(n) = p(l + C(n)) (24) 

where ({n) is a monotonically decreasing function. Analysis of the series 

00 r n 

FW) = + 

n =0 t m=l 

00 ^ n 

n exp < — ln[l + C(m 

n=0 t m—1 

reveals that the condition for condensation is simply that F'{j3) is finite and this 
occurs if u{n) decays to (3 more slowly than /3{1 + 2/n). (This is easiest to see by 
expanding In [1 + Q and approximating the sum over m by an integral in (25).) 

It is interesting to translate this result into the language of the exclusion 
process. In this context we can have condensation if the hop rate of a particle 
into a gap of size n decays as /^(l + 2/n) therefore there is an effective long range 
interaction. 



F'iP) = 




3.3 Bus route model 

As an example of this let us consider the ‘bus route model’ [18]. The model is 
defined on a Id lattice. Each site (bus-stop) is either empty, contains a bus (a con- 
served particle) or contains a passenger (non-conserved quantity). The dynamical 
processes are that passengers arrives at an empty site with rate A; a bus moves 
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forward to the next stop with rate 1 if that stop is empty; if the next stop contains 
passengers the bus moves forward with rate j3 and removes the passengers. 

The model thus defined has not been solved but simulations reveal two 
regimes. At high bus density the gaps between buses are evenly distributed. How- 
ever at low bus density there is a condensed regime where the lead bus has a large 
gap to the next bus in front of it with bus-stops full of passengers in between. The 
other buses have small gaps between them. Thus the buses form a jam of buses 
and after a long delay all arrive at a bus-stop at once. 

The bus route model can be related to the zero-range process by a mean-field 
approximation in which we integrate out the non-conserved quantity (passengers) . 
The idea is that a bus-stop, next to bus 1 say, will last have been visited by a bus 
(bus 2) a mean time ago oi n/v where n is the distance between bus 2 and bus 1 
and V is the steady state speed. Therefore the mean-field probability that the site 
next to bus 1 is not occupied by a passenger is exp(— Ar^/^’). From this probability 
an effective hopping rate for a bus into a gap of size n is obtained by averaging 
the two possible hop rates 1,0: 

u{n) = 0 {1 — 0) exp{—Xn/v) . (26) 

We can now see that this mean-field approximation to the bus-route model is 
equivalent to a homogeneous zero-range process discussed earlier. 

Since u{n) decays exponentially the condition for a strict phase transition 
in the thermodynamic limit is not met. However on any finite system for A suf- 
ficiently small, an apparent condensation will be seen. In the bus route problem 
this corresponds to the universally irritating situation of all the buses on the route 
arriving at once. 



4 Conclusion 

We have shown how the zero range process exhibits two kinds of condensation 
transition. One is due to having an inhomogeneous system i.e. we get condensation 
of particles onto the site with the slowest hopping rate. Although the condensation 
is spatial the mechanism is equivalent to Bose condensation in an Ideal Bose Gas. 

The other type of condensation occurs on a homogeneous systems and in- 
volves the spontaneous selection of a site onto which a finite fraction of the particles 
condense. Recently this condensation mechanism has been used to understand the 
existence or non-existence of phase separation in a general class of one dimensional 
driven systems [19, 20]. 
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How to Simulate Fluid Criticality: 

The simplest ionic model has Ising behavior 
but the proof is not so obvious! 



Michael E. Fisher* 



Extended Abstract 

The essence of simulating criticality in asymmetric fluid models is to discover ef- 
fective, unbiased flnite-size scaling methods that (i) recognize that both the critical 
temperature, Tc, and the critical density, pc^ are unknown and (ii) that can resolve 
‘nearby’ critical universality classes. To this end precise, focused simulations for a 
range of ‘box’ sizes, (in d dimensions), are imperative. 

Indeed, since a true noncl 2 LSsical (i.e., non-van-der-Waalsian) critical point 
cannot be realized in a finite system, some form of systematic extrapolation on 
the linear size, L, of a system is crucial to drawing reliable conclusions not only 
as regards the location of the critical point but - of greater interest theoretically 
- as to the nature of the criticality. A long standing problem [2] in this regard 
concerns the critical character of the simplest ionic model, the so-called Restricted 
Primitive Model (RPM), namely, a 1:1 hard sphere electrolyte (half the spheres 
of diameter a carrying charge -\-q, half —q ) [2, 4, 5]. As the title above indicates, 
recent work [5] has rather convincingly solved this problem: but new methods of 
analyzing simulation data had to be developed. 

In order to navigate the temperature-density, (T, p ) , plane various special loci 
were devised that, in the thermodynamic limit, L — > oc, all spring from the criti- 
cal point (Tc,Pc)- Specifically, the /c-loci are defined by the points of inflection of 
p^~^ vs. pressure isotherms [3, 5-8]. Studying the L-dependence of these loci and 
using appropriate unbiased finite-size extrapolation techniques led to estimates of 
unprecedented precision and, it is believed, comparable accuracy for the critical 
parameters of the hard-core (attractive) square- well fluid (HCSqW) [3]. In addi- 
tion, the data (which did not require a massive computational effort) confirmed 
the expected Ising {n—l) behavior, with critical exponents 7 = 1.23g, = O.63O3, 

and j3 — 0.32e (for the susceptibility, correlation length, and coexistence curve, 
respectively). Furthermore, and importantly, Ising character was clearly resolved 

^Active coworkers: Y.C. Kim, E. Luijten, G. Orkoulas and A.Z. Panagiotopoulos; 
support from the National Science Foundation under Grant No. CHE 99-81772. 
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from the ‘nearby’ XY (n = 2) or self- avoiding walk (SAW, n = 0) universality 
classes with (7,z^,/3) - (1.316,0.670,0.347), and (1.159,0.588,0.302) [3]. 

However, the critical behavior of the RPM turns out to be highly asymmetric 
- much more so than the HCSqW fluids. Indeed, the observed asymmetry seems 
likely to entail significant mixing of the pressure, p, into the linear scaling fields for 
the chemical potential //, and the temperature, T, as is needed to describe a Yang- 
Yang anomaly [1] in which the second derivative {(PpL/dT‘^) on the critical isochore, 
p = Pc, diverges like the specific heat when T ^ Tc~ [in contrast to traditional 
expectations that {(Pp/dT^) remains finite]. As a result, the previous approach 
using the /c-loci proved inadequate. Instead, extending to noncritical densities an 
idea originally exploited by K. Binder, the primary analysis for locating Tc precisely 
[5] was based on an examination of the dimensionless moment ratios 

Ql{T;{p)) = with m = p-{p), (1) 

computed within a finite (L < oo) grand canonical ensemble. At fixed T the ratio 
Ql{{p)) exhibits a (unique) maximum at a density pq{T;L). As L ^ oo the 
resulting “Q-loci” tend to a limiting locus which, in turn, approaches the critical 
point when T Tc~\- [5, 6, 8]. More to the point, however, if one evaluates Ql 
on the corresponding Q-locus, i.e., Ql{T\ Pq{T; L)) one discovers [5, 6, 8] that the 
plots vs. T display tightly spaced successive intersections as L increases (say, in 
steps of size a) that rapidly converge to Tc and to a limiting value Qc- Knowing a 
precise estimate for Tc one can obtain unbiased and reasonably precise estimates 
of the critical density. This is known to be very low for the RPM: indeed, the 
simulations yield p* = pcO^ ~ 0.079(3) (see [5]) where the figure in parentheses 
represents the confidence limits. 

On the other hand, extensive previous work [9] has shown that the critical 
value, Qc, for grand canonical simulations in a cubic Lx Lx - ‘ - x L box is universal 
with values Qc = 0, 0.6236(2), and 0.8045(1) for short-range, (d = 3)-dimensional 
n = 0, 1, and 2 (i.e., SAW, Ising, and XY) systems. For long-range n = 1 systems 
with attractive potentials decaying as (for d = 3), one finds that Qc(^) 

and also 7(a) increase almost linearly from their classical (van der Waals) values, 
namely, Qc = 0.4569 • • • and 7 = 1, in the interval | < cr < (7/i/)„=i < 1.96e 
with Qc{cr=1.9) < 0.600 and 7(a=1.9) < 1.20s [9]. Consequently, the value of Qc 
is a rather clear and robust indicator of universality class! 

To complete the story, we note that the simulation data for the RPM [5] 
yield Qc = 0.624(2). This is remarkably close to the Ising value quoted above; it 
is also far from the classical, XY and SAW values and, as an extra bonus, appears to 
exclude also long-range effective power-law interactions decaying more slowly than 
1/^4. 9 j'j Ç ^ with a < 1.9). Granted the reliable values for Tc and pc, studying the 
effective susceptibility exponents, 7eff(T;L), leads to the estimate 7 = 1.24(3): 
this upholds Ising behavior while SAW and XY values are clearly less plausible 
[5]. Estimating the correlation exponent calls for further care in order to identify 
various estimators, Tj{L) - extrema of various derivatives and moments - above 
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Tc, which can, therefore, be reliably and precisely evaluated by the simulations 
and then suitably extrapolated to estimate u [5, 6, 8]. The value u — 0.63(3) is 
obtained [5] which, once again, points clearly to Ising criticality. 

Finally, it should be noted that, in order to provide some easing of the com- 
putational burden the precise simulations so far reported [5] were performed using 
a (C = 5)-level of fine discretization [5]. The estimated critical density and critical 
temperature, Tc(C) = 0.05069(2), must clearly depend on ; however, with rea- 
sonable confidence, one may believe that the universality class of critical behavior 
does not change with (> 3. Subsequent calculations for larger values of C (ii^ 
progress with Y.C. Kim) confirm this surmise, yielding essentially identical esti- 
mates for Qc with no dependence on ( . Furthermore, one finds that convergence 
to the continuum limit as C increases is quite rapid. 

In another direction, by studying the behavior of Ql{T, (p)) systematically 
below Tc , greatly improved methods have also been developed for estimating the 
coexistence curve even very close to Tc ; thence, independent estimates of the 
exponent (3 can be obtained and pressure mixing in the scaling fields for the RPM 
is unequivocally revealed (indicating a significant Yang- Yang ratio [1] with = 
0.26) [15]. In parallel work [11, 12], the nature of the ionic screening near gas-liquid 
criticality can be estimated by studying the Lebowitz length, ^l(^?p) [10-12]. In 
the RPM it appears that ^l(^^Pc) displays a critical behavior matching that of 
the entropy [12] as may be anticipated on general grounds. As yet, the Ising (or 
other?) type of criticality in asymmetric ionic models (see e.g., [13, 14]) remains 
an open question as does the issue of the possible divergence of some of the ionic 
screening lengths [10]. 

References 

[1] M.E. Fisher and G. Orkoulais, The Yang-Yang anomaly in fluid criticality: 
Experiment and scaling theory, Phys. Rev. Lett. 85, 696-699 (2000). 

[2] See, e.g., H. Weingartner and W. Schroer, Criticality in ionic fluids, Adv. 
Chem. Phys. 116 , 1 (2001). 

[3] G. Orkoulas, M.E. Fisher and A.Z. Panagiotopoulos, Precise simulation of 
criticality in asymmetric fluids, Phys. Rev. E 63 , 51507:1-17 (2001). 

[4] E. Luijten, M.E. Fisher and A.Z. Panagiotopoulos, The heat capacity of the 
restricted primitive model electrolyte, J. Chem. Phys. 114 , 5468-5471 (2001). 

[5] E. Luijten, M.E. Fisher and A.Z. Panagiotopoulos, Universality class of crit- 
icality in the restricted primitive model, Phys. Rev. Lett. 88, 185701:1-4 
( 2002 ). 

[6] Y.C, Kim, Fluid Criticality: Experiment, Scaling and Simulations , Ph.D. 
Thesis (University of Maryland, 2002). 




S440 



Michael E. Fisher Ann. Henri Poincaré 



[7] Y.C. Kim, M.E. Fisher and G. Orkoulas, Asymmetric fluid criticality I. Scal- 
ing with pressure mixing (arXiv;cond-mat/0212145, 6 Dec 2002) Phys. Rev. 
E (2003) [in press]. 

[8] Y.C. Kim and M.E. Fisher, Asymmetric fluid criticality II. Finite-size scaling 
and applications [to be submitted for publication]. 

[9] See references cited in [5]. 

[10] S. Bekiranov and M.E. Fisher, Fluctuations in electrolytes: the Lebowitz and 
other correlation lengths, Phys. Rev. Lett, 81 , 5836-39 (1998); Diverging cor- 
relation lengths in electrolytes: exact results at low densities, Phys. Rev. E 
59 , 492-511 (1999). 

[11] E. Luijten, M.E. Fisher, and A.Z. Panagiotopoulos, Criticality and charge 
fluctuations in the restricted primitive model, Bull. Amer. Phys. Soc. 46(1) 
71 (2001) All 4. 

[12] Y.C. Kim, M.E. Fisher and E. Luijten, to be published. 

[13] J.M. Romero-Enrique, G. Orkoulas, A.Z. Panagiotopoulos, and M.E. Fisher, 
Coexistence and criticality in size- asymmetric electrolytes, Phys. Rev. Lett. 
85 , 4558-61 (2000). 

[14] A.Z. Panagiotopoulos and M.E. Fisher, Phase transitions in 2:1 and 3:1 hard- 
core model electrolytes, Phys. Rev. Lett. 88, 045701:1-4 (2002). 

[15] Y.C. Kim, M.E. Fisher and E. Luijten, Precise simulation of near-critical fluid 
coexistence (arXiv: cond-mat/0304032, 1 Apr 2003) [submitted for publica- 
tion] . 



Michael E. Fisher 

Institute for Physical Science and Technology 
University of Maryland 
College Park, MD 20742 
USA 




I Annales Henri Poincaré 



Ann. Henri Poincaré 4, Suppl. 1 (2003) S441 - S444 
© Birkhâuser Verlag, Basel, 2003 
1424-0637/03/01S441-4 
DOI 10.1007/s00023-003-0934-x 



Infînîte Volume Limit and Spontaneous Replica 
Symmetry Breaking in Mean Field Spin Gl^lss Models 

Francesco Guerra and Fabio Lucio Toninelli 



Mean field spin glasses [1] were introduced around 30 years ago as an ap- 
proximation of realistic models for disordered magnetic alloys. In what follows, we 
make particular reference to the well known Sherrington-Kirkpatrick (SK) model 
[2], but most of the results we present can be proven in greater generality. The 
Hamiltonian of the SK model in a magnetic field /i, for a given configuration of 
the N Ising spins = ±1, z = 1, . . . , TV, is defined as 



h] J) 



1 



A” 



N 

Z=1 



( 1 ) 



where the couplings Jij are quenched i.i.d. Gaussian random variables with zero 
mean. The model can be generalized in many ways, for instance allowing for more 
general disorder distributions, or introducing interactions among p-ples of spins, 
with p > 2 (p-spin model [3]). The main object of interest is the infinite volume 
quenched free energy density, defined as 

F{p,h) = lim FNil3,h) = lim _^£ln Zjv(/3, /i; J). (2) 

N—yoo N^OO JM fj 

Here, is the disorder dependent partition function, at inverse temperature /5, 
and E denotes average with respect to the quenched disorder J. Due to their mean 
field character, these models are exactly solvable, at least in the framework of Parisi 
theory of replica symmetry breaking [1]. This predicts a solution for the quenched 
free energy, which we denote by h), expressed as the supremum of a 

suitable trial functional over the space of functional order parameters [1]. Prom 
the rigorous point of view, the situation is much more delicate, and even the 
problem of proving that the limit in (2) exists remained open for a very long time. 

Our methods make use of very simple interpolation techniques. Broadly 
speaking, many interesting results can be expressed in the form of comparisons 
(inequalities) between the free energies of two different systems, as we clarify be- 
low in some concrete cases. In order to obtain this, the idea is to introduce an 
auxiliary partition function, depending on an interpolating parameter 0 < t < 1, 
which reduces to the partition function of the two systems to be compared when 
t = 0 or 1. Then, if the derivative of the t-dependent free energy has a definite 
sign, the inequality simply follows by integration. Let us give some examples. 
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In [4] , [5] we solved the problem of the existence of the infinite volume limit 
(2), for a wide class of mean field spin glass models. The strategy is based on a 
suitable interpolation between the N spin system and a system made of two non- 
interacting subsystems of size Ni and N 2 , respectively, with Ni -\- N 2 = N. Then 
the definite sign of the derivative allows, as explained above, to prove subadditivity 
for the free energy. 



NFMiP^h) < NiF^Mh) + N2FMAP.h), (3) 

from which the existence of the infinite volume for Fn follows from standard 
methods. 

A much deeper result concerns the relation between the quenched free energy 
Fn and the solution proposed by Parisi, which is shown to be a rigorous 

lower bound for Fjsj for all N, j3^ h [6]: 

(4) 

While the precise form of the interpolating partition function is quite involved in 
this case (see [6] for details), the underlying physical idea is very simple: the model 
being of mean field type, it must be possible to substitute the random two-body 
interaction Jij with an effective random external field acting on every spin cr^. 
When the two-body potential is completely removed (corresponding to i = 0 in 
the interpolation), the model is exactly solvable, and its free energy is essentially 
given by the Parisi solution. Moreover, the difference between the l.h.s. and the 
r.h.s. of Eq. (4) can be expressed as the sum of positive fluctuations of the overlaps, 
in suitably defined states [6]. As a simple byproduct of (4) we showed that, below 
the Almeida-Thouless (AT) critical line [1], replica symmetry is broken, in the 
sense that the overlap between two configurations fluctuates even in the infinite 
volume limit [10]. 

Of course, one would like to prove also the bound opposite to (4), ie., 

+ (5) 

where o(l) vanishes for iV ^ 00, thereby fully justifying Parisi’s solution. Even if 
this task has not been accomplished in the general case yet, we were able to prove 
Eq. (5) in a region of high temperature or high magnetic field, where the system 
is in the so called replica-symmetric phase. (For previous results in this direction, 
see for instance [11], [12]). In this region, we proved [8] that 

Fjv(/3,/i) + (6) 

The proof requires to take two copies (replicas) of the system and to couple them 
via an interaction term depending on their mutual overlap (“quadratic replica 
coupling”). Moreover, in the same region we were able [9] to give limit theorems 
for the fluctuations. In particular, we proved that the fluctuations of the disor- 
der dependent free energy density — 1/(A^) \nZN{l^, h; J) around the limit value 
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F(/3, h) behave, on the scale 1 /a//V, as a centered Gaussian variable in the infinite 
volume limit. A central limit theorem holds also for the rescaled fiuctuations of the 
overlap between two configurations, around the value predicted by Parisi theory. 
Unfortunately, we were not able to control the whole expected replica- symmetric 
region, whose boundary is marked by the AT line, but this seems to be a common 
problem of all the rigorous approaches proposed in the literature so far. 

It is important to stress that our methods proved to be quite robust, in the 
sense that they are not specific of the SK model we are considering here. For 
instance, we showed in [13] how the quadratic replica coupling method can be 
extended to study the Viana-Bray diluted spin glass model [7]. This is a gener- 
alization of the SK model, where each site interacts only with a random finite 
number of other sites, even in the thermodynamic limit. We obtained a complete 
control of the system, including the thermodynamic limit for the free energy and 
limit theorems for the fiuctuations, in the whole replica symmetric region, which 
in this case corresponds to high temperature or high dilution. 
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Dimensional Reduction and Crossover 
to Mean-Field Behavior for Branched Polymers 

John Z. Imbrie 



Abstract. This article will review recent results on dimensional reduction for 
branched polymers, and discuss implications for critical phenomena. Parisi and 
Sourlas argued in [PS81] that branched polymers fall into the universality class of 
the Yang-Lee edge in two fewer dimensions. Brydges and I have proven in [BIOl] 
that the generating function for self-avoiding branched polymers in D -|- 2 contin- 
uum dimensions is proportional to the pressure of the hard-core continuum gas 
at negative activity in D dimensions (which is in the Yang-Lee or i(p^ class). I 
will describe how this equivalence arises from an underlying supersymmetry of the 
branched polymer model. 

I will also use dimensional reduction to analyze the crossover of two-dimensional 
branched polymers to their mean-field limit, and to show that the scaling is given 
by an Airy function (the same as in [CarOl]). 

Let us begin with a definition of branched polymers, first on the lattice, 
then in continuous space. On the lattice, branched polymers should look like the 
following picture. 




Define cm to be the number of N -vertex branched polymers mod translations. 
Let T be a tree graph on {!,..., N} and yi, ... ,yN be an embedding of the graph 
in the lattice. In other words, yi denotes the position of the vertex of the tree 
graph. If i and j are connected by a line in the tree graph, then yi and yj must be 
adjacent in the lattice. In addition, no two vertices may be mapped to the same 
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point; this would lead to a loop. The generating function Zbp(z) = z^cjv can 
be written as 



N=1 T y2,- -jyN ijÇ:T ij^T 

where 

( 2 ) 

and 

Uij = 1 — Sy.^y^ (3) 

enforce the adjacency and loop-free conditions, respectively. For example, cs = 
^•3-4*3 = 6inZ^as there are 3 tree graphs on {1,2,3}, and for each of them 
there are 4 choices for y 2 and 3 for 7/3. See [BIOl, Pro86] for details. 

Background 

The concept of dimensional reduction was introduced by Parisi and Sourlas in 1979 
in order to understand the behavior of the Ising model in a random magnetic field 
[PS79]. Although the proof of long-range order in three dimensions [Imb84, Imb85] 
showed that dimensional reduction did not work in that problem, dimensional 
reduction appeared to work for branched polymers [PS81]. Our proof answers any 
lingering questions about dimensional reduction for branched polymers and gives 
a clear picture of how it works. The question of what goes wrong for random fields 
continues to be debated in the literature [BD98, PS02, Fel02]. 

Dimensional reduction for branched polymers was employed recently by 
Cardy to determine the crossover scaling function for area-weighted self-avoiding 
loops in two dimensions [CarOl]. Below, I will use our dimensional reduction iden- 
tity to derive a similar result for the crossover from branched polymers to their 
noninteracting or mean-field limit. 

Branched Polymers in 

Lattice models of branched polymers have been written down that are supersym- 
metric after dropping irrelevant terms [Sha83, Sha85]. Here we work in continuous 
space so that the supersymmetry responsible for dimensional reduction can be 
preserved. Let us adjust the definitions given above for lattice branched polymers 
to the continuous case. In a branched polymer consists of 

1. A tree graph T on 1,. . . ,N and 

2. An embedding y = into such that 

• if ij e T then \yij\ = 1 and 

• if ij ^ T then \yij\ > 1. 
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The generating function for branched polymers (mod translations) is 
oo jv /* 

^BP(^) = J2^ J2 dy 2 -- dyN]l [2t/tl n Uij, (4) 

N^I ' T ijeT ij^T 

where 

Uij = U{\yi-y^\^} (5) 

and 

2Ul^ = 2U'{\yi - -yj\^ 1). (6) 

As before, U![j enforces adjacency, and Uij is a self- avoidance condition. One can 
think of these branched polymers as collections of hard spheres with kissing con- 
ditions determined by the tree graph T. 

The Hard-Core Gas in 

We shall relate branched polymers in to the hard-core gais in IR^. The hard- 

core gas has hard spheres centered at Xi G for i = 1, . . . , TV. Let 

Xij =Xi- Xj, tij = \xij\^, ( 7 ) 

and let 

U(tij) =9{tij -\) (8) 

enforce the hard-core constraint. Then 

oo jy „ 

-^Hc(2)=Xl^/ d^l-'-dXN n (9) 

N =0 ^ l<i<j<N 

Main Results 

Theorem 1. For all z such that the right-hand side eonverges absolutely, 

lim — log Z hc(^) = -27tZbp . (10) 

A/'RD |A| V 27T/ 



Examples 

Exact computation of the left-hand side is possible in D = 0, 1. For D = 0, 
Zhc{z) = 1 -\- z and so in dimension D + 2 = 2, 

Zbp{z) = -L iog(l - 2ttz). (11) 

Z7T 

The pressure of the one- dimensional gas of hard rods is also computable, see for 
example [BIOS]. Hence in dimension D -h 2 = 3, 

N=1 



( 12 ) 
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Here T{z) = — Lambert W(— 2 : ) is the tree function [CGHJK]. These equations 
allow us to write a simple expression for cn in two and three dimensions. 

Green’s Functions 

We also have a dimensional reduction formula for correlations. First, some defini- 
tions. Let 

N N 

P{i) = '^S{x - Xi), p(y) = '^S{y -Vi), (13) 

. i=l z=l 

where x, Xi G and ÿ, yi G E^+^. Then the density-density correlation functions 
of the two systems can be written as 

00 N n 

GBp{ÿi,ÿ2-,z) = dyi---dyNp{ÿi)p{ÿ2)Y[{‘2Ulj]'[[u,j, 

Ghc{xi,X2;z) = lim (p(Æi)p(Æ2))hca- (14) 

Here (•)hc,a is the expectation in the measure for which ^hc(^) is the normalizing 
constant. 

Theorem 2. 

Ghc(0,x;2) = -27t J (fiwGBP > (1^) 

where y = {x,w) G E^+^. 

Analogous results can be obtained for the n-point functions. 

Consequences for Critical Exponents 

One can define a critical exponent ohc by looking at the singularity of the pressure 
of the hard-core gas: 



Likewise, a susceptibility exponent 7 hc can be defined from the behavior of Zbp: 

Keep in mind that Zc is negative and — |^ is positive - Theorems 1 and 2 relate 
the critical behavior of branched polymers to the critical behavior of the hard-core 
gas at negative activity. Theorem 1 equates the two singularities, so 



«HC = 7bp- 



(18) 
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Examining the 2-point functions of each model near their critical points, we 
can define exponents rjuc, î?bp from the asymptotic forms 

(?Hc(0,a;;2) ~ Khc{x/^hc), (19) 

GBp(0,æ;2) ~ (20) 

as |x| ^ CO with x := x/^ fixed. Here d = D -f- 2, ^hc, ^bp are the correlation 
lengths of the respective systems, and Kuc^ ^bp are scaling functions. Theorem 
2 implies that ^hc = ^bp, so in particular the exponents = pbp, where 

(21) 

^BP{Z) - ( 22 ) 

3.S z \ Zc- Theorem 2 also implies that 

rjBC — Vbp (23) 

ÎOT D > 1. Furthermore, if one defines 6 from 

oo 

^bp(^) = ^ cnz^ (24) 

N^l 

with 

CN ~ z;^N~^. (25) 

Then 

0 = 3 — 7bp- (26) 

Further results can be obtained for the scaling functions. We can express 
i^BP in terms of Kuc and its radial derivative: 

Kbp{x) = [xKiicix) - (P - 2 + 7?hc)^hc(*)] • (27) 

In JD = 1 we can compute [BIOS] 



Kncm = -~e-^ (28) 

so that in + 2 = 3 dimensions, 

Kbp{x) = -F e-^ (29) 

TT^X 

which agrees with the prediction of Miller [Mil91j. Let us tabulate the known or 
for the critical exponents of the hard-sphere gas in various dimensions. 
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D 


a 




^HC 


a = 1 — a 


0 


2 






-1 


1 


3 

2 


1 

2 


-1 


1 

2 


9 


7 


5 


4 


1 




6 


12 


5 


6 


MFT D >6 


1 

2 


1 

4 


0 


1 

2 



In D = 0,1, the values are rigorous. In = 2, the value a = ^ is obtained 
from Baxter’s solution to the hard hexagon model [Bax82, Dha83, BL87]. The 
last column gives values for cr, the Yang-Lee edge exponent. Assuming that the 
hard-sphere gas at negative activity falls into the Yang-Lee edge universality class, 
we have a = I — a. Actually, from hyperscaling {Duuc = 2 — q;hc) and Fisher’s 
relation a = we know that there is only one independent exponent in 

each dimension. The value t^hc = is independently confirmed from conformal 
field theory [Car85]. 

The branched polymer critical exponents in D + 2 dimensions can be equated 
to the corresponding values for the hard sphere gas in D dimensions; see the first 
three columns of the following table. 



d = DF2 


7 = 


Z/BP = l^HC 


^BP = me 


6» = 3-7 


2 


2 






1 


3 


3 

2 


1 

2 


-1 


3 

2 


4 


7 


5 


4 


11 














6 


12 


5 


6 


MFT D>S 


1 

2 


1 

4 


0 


5 

2 



The values in two and three dimensions are now rigorous. Note that in high di- 
mensions {d > 8) it has been proven that 7 bp = pbp = ^bp = 0 (at least for 
spread-out lattice models) [HS90, HS92, HvS03]. While our results do not apply 
to lattice models, they give a strong indication that the corresponding hard-core 
exponents have the same (mean-field) values for D > 6. 

Forest-Root Formula 

The underlying mechanism behind all these results is an interpolation formula 
(the Forest-Root formula). Let /(t) depend on t = {Uj),{ti) for 1 < z < j < N. 
Assume that / ^ 0 at oo. Let tij — \wi — = \wi\‘^ with Wi e C. Then 




The sum is over forests F and roots R (collections of bonds ij^ and vertices z, 
respectively) such that each tree of F has exactly one root R. See Fig 1. 
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forest 



Figure 1: Example of a forest 



For a simple example, consider the case N = 1,F = — {!}. Then the 

formula reduces to the fundamental theorem of calculus in one variable: 

/(o)= //V)— = - T/w- (31) 

Jc -7T Jo 



Supersymmetry 



To understand the Forest-Root formula in general, we need to exploit supersym- 
metry. Replace in /(t) each variable U with 



= WiWi -h 



dwi A dwi 
27TZ 



(32) 



and tij with 



dwij A dwij 
2m 



(33) 



(Recall that w^j = wt — Wj). Then /(r) is defined by its Taylor series. I claim that 
a “localization” formula holds: 



[ fiz) = /(O). (34) 

This formula becomes the Forest-Root formula when expanded out. The absence 
of loops comes from the fact that {dw A dw)^ = 0 if G has a loop. We prove 
the formula by deforming the problem to the independent case, using ideas from 
[Wit92]. 

An alternate argument can be given, exploiting the linearity of (34) to reduce 
to the case where / is an exponential [BW88]. Let 



poo 

/(a) = / dt/(t)e"* (35) 

Jo 

be the Laplace transform of /. Here a = (a^), (a^j) is dual to t, that is a • t = 
'Eijttijtij -f- Yjiaiti. Let us assume that / has exponential decay in each ti variable. 
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SO that we can take the a^’s to have positive real part. Cases where / decreases 
more slowly at infinity can be treated by taking limits. We can write / in terms 
of its Laplace transform, 



/(t) = 




(36) 



with the integration contours running parallel to the imaginary axis. We see that 
(34) holds if it is true for functions of the form 
Let us write 



where 



, . , / dw . dw \ 


(37) 


Aij = —dij^ ^^3-) 


(38) 


An — di ^ ^ dij . 


(39) 







The matrix A is symmetric and has positive real part. Then by the properties of 
the differential forms dwi, dwi, 




j 

j 

Jcf 






/ dw ^ dw \ 
\ \/27ri ’ \/27ri / 



N 



- {w,Aw) 



det 



—dwi A dwi 
2m 



(40) 

(41) 



With Wi — Xi iyi^ we have that dwi A dwi = —2idxi A dyi^ so this is 



n-^àetA j = 1 , 



(42) 



and we obtain (34) for /(t) = e ^ 



Decoupling in Two Extra Dimensions 

Use the Forest-Root formula to decouple the spheres in the hard-core gas by mov- 
ing them apart in two extra dimensions. At fixed N we have an integral over 
X Ç C of 

m= n U{\xi^\^), (43) 

where ti = \wi\^ = ^. Extend this to tt; 7 ^ 0 by writing 

/(t) = H + %) ^ (large t cutoff). 

l<i<j<N 



( 44 ) 
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Apply the Forest- Root formula, 




Then each when applied to Uij^ becomes ^ surface measure for the combined 
integrals over yij = (xij^Wij), This means that spheres are stuck together accord- 
ing to the forest F . Furthermore, the trees of the forest decouple in the limit as 
the large t cutoff is removed. This is because they are spread apart in the w di- 
rections, so that they rarely interact. As a result, Zhc can be written in the form 
of an exponential of tree graphs. Thus log Zhc is evaluated as a sum/integral over 
branched polymers, and Theorem 1 is proven. See [BIOl] for details. 



Relation with the Yang-Lee Edge 

Repulsive gases near criticality at negative activity are described by an field 
theory. More precisely, they should fall into the same universality class - see [Car82, 
Dha83, LF95, PF99]. To see this, replace the hard-core potential with a repulsive, 
smooth, rapidly decaying two-body potential 



u{xij) = v{\xij\‘^) with u{k) > 0. 
By the sine- Gordon transformation, the repulsive gas 



^ f 

/ dxi-- dxjv exp 



-(3 ^ w(a 



can be written as a field theory. With 5 = ^ we have 



4^) = / exp ly dx 



M 



(46) 



(47) 



(48) 



If 5 is negative enough, this action looks critical. In fact at the critical value 
of z, the lowest order term in the action A similar situation occurs for Ising 

or model in a sufficiently large imaginary field [Fis78]. 



Dimensional Reduction: Soft Branched Polymers 

The arguments leading to Theorems 1 and 2 work for any U (t) for which U — 1 



and U' have sufficient decay. Take 




U{t) = exp{-0v{t)) 


(49) 


to get the identity 




lim -^logZ^;(2;) = -27tZbp,v (~:^) , 
|A| V 27T/ 


(50) 
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where Zbp^v is given by the same formula (4) as before but with this [/. In this 
model of “soft” branched polymers, the repulsion between monomers is given by a 
factor exp{— f3v{\xij\‘^)) , and the attraction linking neighboring monomers in the 
tree graph is given by a factor —2/3vWxij\‘^)). Together with the sine-Gordon rep- 
resentation for Zbp,vj we obtain a direct representation for branched polymers in 
D-\-2 dimensions as a field theory in D dimensions. As this is presumably in 

the Yang-Lee class, we have confirmed the prediction of Parisi and Sourlas [PS81]. 

Crossover to Mean-Field Branched Polymers 

We wish to understand what happens as the repulsion between monomers is turned 
off. In the limit, we obtain a mean-field or noninteracting model (on the lattice 
such models were analyzed in [Fro86, BCvS]). Let v = v{0) be small and 2 : be large 
so that a redefined i zve^^‘^ is fixed. In these variables (50) becomes 

log f exp -- = -27 tZbp,„ . (51) 

J-oo L J ^27T/ 

I have reversed the sign of 2 ^, so the branched polymer singularity is now at posi- 
tive z. 

It should be clear that the 0 limit simplifies both sides of this identity. 
On the 0-dimensional Yukawa gas side, 'C ^ 0 means the integral reduces to finding 
the critical point of ze^"^ + On the two-dimensional branched polymer side, 
V 0 eliminates the self- avoidance. Every tree has one fewer edges than vertices. 
Assuming v' remains proportional to -c, as v ^ 0 and zv ^ z^ this means that 
-^Zbp,v goes to a finite limit, which is a generating function of non-self-avoiding 
(or mean-field) branched polymers containing the origin. 

We can observe the crossover by considering Zpp.v as a function of i and v 
near = 0 and ^ = Zc- With S{(p) = ze^'^ ^ we have 

S'i^p) = p -\- ize^"^ = 0 when iip — x with xe~^ — z (52) 



z=xe 




The solution is x = x{z) = T{z) as in the graph. We also solve 

S"(ip) = 1 — ze^"^ = 0 when 5 = e~^ = (53) 

so Xc = 1, Zc = e~^ has 5" = S'" = 0. 
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Let US expand S{(p) about ^ = —i with 5 close to Zc = e L With (p := (f -\-i 
we obtain 

S{p-i) = - ip - \ eze"^ 

= \p^ — ip — ^ - e{zc — z)e''’^ + . (54) 

Let t = This becomes 

— 1 — ip+ ^p“^) — — itp-\-0{p^)-{-tO{p'^) — t-{-^, (55) 

and so from (51) 



-2ttZbp,v 



- ( + it(fi + + tO{if^) + t — - 

V \ o Z 



dp 

J V27TU 

(56) 



Let us apply ^ to get the sum of branched polymers containing the origin. With 
Gnp,v(y) = ^ZBP,v{y), we have 






exp 



1 / ip' 






+ itp + 0{p'^) + tO{p‘^) + t 



dp. (57) 



The expression in the exponent is useful for p small. Consider the crossover 
regime, \t\ < I wish to show that the contribution to the integral from 

\p\ > is smaller than any power of v. Make a kink in the contour of inte- 

gration for (^, so that it lies at angles ^ and This turns ip'^ into —\p\^- From 
(55), one can check that on this contour, 

^eS{p) > const min{|(^|^, \p^} ~ ^0{p) — t T (58) 



so that p is effectively of order This means that ^O(p^) and ^0{p‘^) are 

0(u^/^) and that these terms do not contribute to Zbp,v to leading order in 
Likewise when computing ^ ~ term 0{p‘^) = 

(9(^2/3) -g subleading order. The term — ^ ^ is nonsingular. Hence the 

singular behavior of Gbp,v is given to leading order by 



G 



(sing) 

BP,v 




A 

dz 



■■1 



log / exp 



ip 



+ itp 



dp. 



(59) 



We have arrived at a point very similar to the one Cardy reached in his 
analysis of the crossover from area-weighted self-avoiding loops to self-avoiding 
loops [CarOl]. He related that problem to branched polymers and, as we have 
done here, used dimensional reduction to compute the crossover scaling function 
as an Airy integral. If we rescale (p in (59), we obtain as in [CarOl] that 






(60) 
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where 

F{s) = bo-^ln Ai{bis). (61) 

as 

Here bo and bi are constants, and Ai(^) = ^ is the Airy function. 

The expression (60) interpolates between two different critical behaviors for 
branched polymers. We have already seen that 7 bp = 1 in two dimensions; this 
corresponds to the pole in (60) that occurs when v is fixed and nonzero, and t 
decreases through 0 to the first zero of Ai{bitv~^). This is the limit of convergence 
of Zbp; past this point (51) no longer makes sense. 

If, instead, we send v to zero and then let t \ 0, this means (f is replaced by 
its critical value (fc where S'{(fc — i) = 0, c.f. (52). Then, 



lim 




lim log [ 

dz J 



exp 



1 / 

— (ze 

V ^ 



Ilf 






dip 



(62) 



It is clear from the graph above that ipc ~ Thus the leading singularity 

of (62) is and 7 = This agrees with the value determined for the 

mean field models in [Pro86, BCvS]. This also agrees with the conjectured value 
of the entropie exponent of self-avoiding walks. The square-root behavior can be 
seen in (60) also, as F{s) ~ as s oo. 

The identity of the crossover scaling functions and critical exponents for self- 
avoiding loops and branched polymers suggests a close connection between the 
two systems. It is natural to think of loops arising as the frontier of branched 
polymers. In this picture, the self- avoidance interaction of the branched polymer 
gives rise to an effective area weighting for the loop, and the mean-field limit of 
branched polymers corresponds to loops without area weighting. This fits in with 
recent results [LSWOl] on the Hausdorff dimension of the frontier of Brownian 
motion (suggesting a link with self- avoiding walks) . 
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Polymer Transport in Random Potentials 
and the Genetic Code: The Waltz of Life 

M. Aldana- Gonzalez, G. Cocho, H. Larralde and G. Martmez-Mekler 



Abstract. We address the question of why the translation from nucleic acids to pro- 
tein forming amino acids is carried out by triplets known as codons. We approach 
this problem from a dynamical point of view by considering the translocation 
properties of primitive molecular machines operating under prebiotic conditions 
[1, 2, 3, 4]. Our model captures some basic ribosome-mRNA interaction features. 
We consider short chains of charged particles interacting with polymers via elec- 
trostatic forces, constrained to move in quasi one dimensional geometries, subject 
to external forcing. Our numerical and analytic studies of statistical properties of 
random chain/polymer potentials suggest that, under very general conditions, a 
dynamics is attained in which the chain moves along the polymer in steps of three 
monomers, traversing swiftly two monomers and lingering on the third one as in 
a waltz. This behavior is enhanced when we consider present day protein coding 
sequences. We also comment on noncoding sequences. We argue that this property 
could be one of the underlying causes for the three base codon structure of the 
genetic code. 



1 Introduction 

The genetic code relates nucleic acids from the genetic material to protein con- 
stituent amino acids. Key elements of this process are the number m of nitrogenous 
basis and the number aa of codified amino acids. Words from one of the alphabets 
correspond to elements of the other. In the genetic code these nitrogenous basis 
words, known as codons, have a fixed size^ n. One of the main questions regard- 
ing the genetic code is to achieve an understanding of why m = 4, aa = 20 and 
n = 3. If we take aa = 20 (which is the present number of protein constituent 
amino acids) and vary m and n we find several arrangements that allow for the 
20 amino acids. For example, in the table below we highlight with bold letters 
the simplest choices with an even number of basis (this avoids undecidable situ- 
ations in a replication process where one base may correspond to more than one 
“complementary” base). The first column in the table is the number of elements 
in the codon, the second is the size of the nucleic acid alphabet, the third shows 
the number of nitrogenous base combinations (amino acid words) and the third 
indicates whether or not there is complementarity. 

^ Though it has been pointed out [5] that more efficient codes can be obtained by allowing the 
length of codons to vary. 
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n 


m 


aa 


complementarity 


1 


20 


20 


yes 


2 


4 


16 


yes 


2 


5 


25 


no 


2 


6 


36 


yes 


3 


2 


8 


yes 


3 


3 


27 


no 


3 


4 


64 


yes 




2 


16 


yes 


4 


3 


81 


no 



Rows 1,4 and 7 are possible choices. It would be desirable to have some 
“first principle” argument for choosing a value of n independently from m, or 
viceversa. This is the main motivation for this work. Here we review a proposition 
put forth by us [1, 2, 3, 4] where n is associated to the value of 3 based on the 
dynamical properties of primitive synthesis molecular machines which would be 
operating under pre or protobiotic conditions. We will only highlight the main 
lines of argument, detailed descriptions can be found in the above references. The 
general setup relies on the following considerations: 

• Protein synthesis takes place at the ribosome which is a macromolecular 
cell structure where a polymer, the messenger ribonucleic acid (mRNA), 
is translocated through the ribosome along a tube-like geometry. The path 
along which the mRNA slides is mainly conformed by ribosomal ribonucleic 
acid (rRNA). The ribosome acts as a complex molecular machine that moves 
along the mRNA. 

• A primitive synthesis and replication molecular machinery may have con- 
sisted of polymers (mRNA antecedents) moving in quasi 1-dimensional chan- 
nels (ribosomal tube antecedents). 

• The relative rRNA/mRNA motion corresponds to the relative movement be- 
tween free polymers and polymers fixed to the channel. Both polymers would 
be constituted by various electrically charged monomers. Motion would be 
induced by external electromagnetic fields.^ 

• The transport properties of such primitive molecular machines would condi- 
tion chemical reactions and coding processes. 

Concerning the last two points we consider the generic situation of poly- 
mers with randomly charged monomers. We first look at their mutual interaction 
potentials and then focus on the statistical properties of the distance between 

^In [1, 2, 3] we argue in favor of the plausibility of an outer space origin of life scenario [6] 
where interstellar dust or meteorite surfaces would provide the quasi 1 dimensional templates 
and UV radiation would charge the monomers 
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neighboring potential minima. We then show how these static properties influence 
the statistical characterization of dynamical quantities via an equation of motion. 

Our central result is that under the above conditions, a dynamics is attained 
in which the polymer moves along the channel in effective “steps” whose mean 
length is three monomers. We suggest that this dynamical feature may be an 
important underlying cause of the three base codon structure of the genetic code. 



2 The model 



We propose a model consisting of a chain of M monomers, with charges {pi}, 
interacting with a very long polymer composed of N monomers with charges {qj}, 
as shown in Fig. 1. The charges we consider are not necessarily Coulomb charges, 
both {pi} and {çj} could be dipolar moments, induced polarizabilities, or similar 
quantities resulting from electrostatic interactions between chain monomers and 
polymer monomers with potentials of the form 1/r^, where a characterizes the 
“charge” type (a = 1 corresponds to an ion-ion interaction, a — 2 represents an 
ion-dipole interaction, and so forth). We do assume that all the charges are of the 
same nature. We consider the charges {pi} and {qj} to be discrete independent 
random variables, acquiring one of m different values with the same probability. 
All monomers have the same length L and their charge is taken to be uniformly 
distributed along their length. The interaction potential V^j{x) between the ith- 
monomer in the chain and the jth-monomer in the polymer is given by 



rx+i nj 

Vij{x) = Kpiqj / — 

jj-l [(x' — ; 



dx'dx" 



na/2 



( 1 ) 



where AT is a constant whose value depends on the unit system used to measure 
the physical quantities. We have taken L = 1 in the above expression. The overall 
interaction potential V^{x) between the whole chain and the entire polymer is 
given by the superposition of the individual potentials Vij{x): 



M N 



Z=1 j=l 



( 2 ) 



3 Statistical Properties of Random Potentials: 

We start with the simplest M = 1 case, in which the chain consists of just one 
monomer. We will refer to this as the particle-polymer case. Fig. 2 shows three 
graphs of the potential V^{x) for a = 0.5 and different values of the parame- 
ter a. Charges {qj} are uniformly distributed amongst m = 6 different values 
{±1,±2,±3}. Notice that the distribution of maxima and minima along the po- 
tential does not change significantly by varying the value of the parameter a, in 
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Figure 1: Chain composed of M monomers interacting with a polymer, N monomers in length. 
Charges {pi}, {qj} are independent random variables. The chain is at a fixed perpendicular 
distance a from the polymer, but can move along the polymer, x is the relative polymer/chain 
horizontal position. 



the sense that all the maxima and minima remain essentially at the same posi- 
tions. As a takes on larger values, the potential approaches a step-like function. 
A similar behavior is encountered if instead of varying the interaction range we 
decrease the distance from the particle to the polymer and keep a constant. 

We are mainly concerned in the statistical behavior of the distance d between 
consecutive minima, since by looking at the average value d, the most probable 
value d* and the probability distribution function of d, we will obtain important 
dynamical information of the system. If an external force is acting on the particle 
(or the chain), forcing it to move in one direction along the polymer, the particle 
will spend more time in the energy minima than in the maxima. Such a movement 
may be interpreted by considering the particle as “jumping” from one minimum 
to the next (see Fig. 3). 

Using the step- like limit, we have derived [1] that for a long polymer (in the 
large N limit) d is given by: 



- _ 

^ 2m -1 



( 3 ) 



where we recall that m is the number of different monomer charges or monomer 
types if we use the charge as a monomer label. The mean distance d is hence always 
between 3 and 4, approaching 3 asymptotically as m oo. 

For the probability distribution function Pm(d), of two consecutive minima 
being separated by a distance d when there are m different types of monomers we 
have closed expressions [4] valid for the step-like limit: 



6 

2m — 1 



d-2 



E 



2fe + 1 



Nm{d - k) 



Pm{d) = 



( 4 ) 
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Figure 2: Particle-polymer interaction potential for cr = 0.2 and different values of a. (a) 

a = 1; (b) a = 3; (c) a = 5. Charges along the polymer, selected at random, are the same in the 
three graphs shown. Note that cis a increases, the potential becomes a step-like function, but the 
positions of the maxima and minima do not change. There are 33 potential minima distributed 
along 100 monomers, giving a mean distance between consecutive minima d= 100/33 = 3.03. 
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Figure 3: In a one dimensional potential, the dynamics is determined by the distribution of 
maixima and minima. 



if d is integer: c? = 2, 3, 4, . . ., and 

^ - k) (5) 

k=0 

if d is half-integer: d = d = 7j2, d = 9/2, . . ., where d' = Int[d]. In the above 
expressions, d is measured as the distance between the midpoints of the adjacent 
minima and Nm{d) is a polynomial whose degree and coefficients depend on m. 
For m = 2, m = 3 and m = 4 the polynomials are given by N 2 {d) = l,Ns(d) = 
1 -h 2d -h 2d^ and N 4 {d) = -2 -h 1/8 [30d -h 73d‘^ -h 22d^ -f 3d^] . 

For the case of m = oo we have also derived a closed expression [4], which 
has a much simpler form: 



Poo{d) = 3j^^{d^+d-2) ( 6 ) 

The preceding distributions, which are plotted in Fig. 4, show that the most 
probable distance d* between consecutive potential minima is d* = 2, except for 
the case m = 2 in which d* = 3. Hence, according to the transport mechanism 
suggested in Fig. 3, whenever there are more than two different types of monomers, 
the particle will move along the polymer in “jumps” whose mean length is close 
to three, but whose most probable length is actually two. The difference between 
the mean distance d and the most probable distance d* is due to the presence of 
“tails” in the probability function P^(d). Namely, to the fact that Prn(d) has non 
zero values even for large d. However, these “tails” can be considerably shrunk [4] 
when the chain is made up of more than one particle (M >1). 

We now consider the case M > 1. This situation is closer to the present day 
ribosome based dynamics since at each given time, the ribosome interacts with the 
mRNA at several points, giving rise to a multiple interaction. Electron microscopy 
has revealed that the mRNA thread passes across the ribosome through a tunnel 
about lOnm long and 2nm in diameter [8]. Since nucleotide length is about 0.5nm, 
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Figure 4: Probability function Pm.(d) of two consecutive minima being separated by a distance 
d, for different values m of monomer types. 

around 20 nucleotides may be in a position to interact simultaneously with the 
ribosome. Taking the above into account, we will work with a small chain, assuming 
M = 10 as a reasonable length. 

Numerical simulations of Pm(d) show that it has a strongly fluctuating behav- 
ior. Its shape depends crucially on the particular realization of monomer charges 
in the chain. For example. Fig. 5a shows Pm(d) for a polymer 500 monomers in 
length and a random realization of charges in a 10-monomer chain. Charge values 
were {±1, ±2}. Fig. 5b shows a similar graph for a different realization of charges 
along the chain. Polymer charges were kept the same in both cases. Note that the 
first graph presents a very sharp maximum at d* = 2, whereas the second one 
does so at d* = 3. When we consider 4500 charge realizations for a 10-particle 
chain interacting with a 500 monomer polymer (typical length of sequences coding 
functional proteins [8]) we obtain the following table for the occurrence of either 
d* = 2 or d* - 3: 



d* 


Fraction of Occurrences 


æ = 3 


0.53 


d* = 2 


< 0.33 




Table I 
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Figure 5: Probability functions P 4 (<i) corresponding to two different realizations of monomer 
charges for a 10-monomer chain interacting with a 500 monomer polymer. Mean distance is 
d ~ 2.36 for (a) and d ~ 2.98 for (b). 



The above result shows that when a multiple interaction between the poly- 
mer and the chain is considered, the probability of having a random interaction 
potential whose consecutive minima are more often separated by three monomers, 
is dominant. 



4 Statistical Properties of Genetic Sequence ‘‘Potentials” 

So far we have studied generic behaviors in the sense that they refer to almost 
all possible electrostatic interactions and hold for randomly associated monomer 
charges. If we now assign the charges {qj} along the polymer in correspondence 
with the genetic sequence of an organism via a lookup table, we may test the 
model’s response to the genetic information accumulated along evolution. Since 
genetic sequences are made out of four different bases (A, U, C and G) we consider 
four different possible values for the charges {qj}, i.e. m = 4. An arbitrary look 
up table is the following: 



(7) 
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Figure 6: Probability function P 4 (d) computed by assigning the charges in the polymer in 
correspondence with genetic sequences of the Drosophila malanogaster genome: (a) protein- 
coding, d = 3.15, (b) intergenic, d = 3.43. 



Figure Fig. 6a shows the probability distribution P 4 (d) computed numerically 
by using a Drosophila melanogaster protein-coding sequence, 45500 bases in length 
(several genes were concatenated to construct this sequence) considering the M — 
1, particle-polymer situation. The mean distance between consecutive potential 
minima for this sequence is d ~ 3.15 while the most probable distance is d* =3. 
Therefore, even for particle-polymer interactions, in the “real sequence case” not 
only is the mean distance d very close to 3, but also the most probable distance 
d* turns out to be 3. A comparison of the m = A case of Fig. 4 with Fig. 6a, shows 
that for protein-coding genetic sequences, the potential minima along the polymer 
are more often separated by three monomers than in the random case, the value 
of P 4 (d) increases when d = 3 and decreases at d = 2. 

The above behavior does not occur when non-coding sequences of the genome 
are used for the monomer charge assignment along the polymer. Fig. 6b shows 
P 4 (d) for an intergenic sequence of the Drosophila’s genome. The length of the 
sequence is again 45500 bases. In this case P 4 (d) looks much more like the one 
obtained for the random case. 

It is important to remark that the behavior of P 4 (d) exhibited in Fig. 6a for 
real protein-coding sequences does not depend on the particular correspondence 
between bases and charge values being used, as long as they are of similar order of 
magnitude and they can allow for an order relation.^ Furthermore, the probability 



^The four bases A, T, C and G have charges of the same order of magnitude [7]. 
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function P 4 (d) is invariant under baise permutations [4], this also holds for non- 
coding sequences. 

In reference [4] we looked at several organisms of a variety of types and found 
that the maximum at d* = 3 of P 4 (d) is a general characteristic associated with 
the protein-coding sequences of living organisms. The fact that the above charac- 
teristic is absent in non-coding genetic sequences may be interpreted in evolution- 
ary terms. Genetic sequences directly involved in the protein- translation processes 
were selected (among other things) as to bring the distance between consecutive 
potential minima closer to 3, both in mean and frequency of occurrence. This pro- 
cess may have been enhanced by multiple chain-polymer interactions which, as 
pointed out in connection with Table I, favor a value of d* = 3. 



5 Dynamics 



Ribosome translocation is an intricate process dependent on many physical and 
chemical factors [8]. A detailed modeling of this process is practically unman- 
ageable. However, as with other complex phenomena, it is fruitful to focus on 
specific matters relevant at a given level of description. For example, the recent 
experimental evidence suggesting a ratchet-like behavior [9] is indicative of the 
appropriateness of an analysis within the perspective of the current developments 
on the study of ratchet dynamics [10]. In this work, we have chosen to look into 
the behavior of oversimplified molecular models which might capture some essen- 
tial dynamical features of the system and shed some light on how this mechanism 
could have arisen. 

Our modeling of the dynamics of the system is governed by the application 
of an external force Fex to the chain in the horizontal direction, i.e. parallel to 
the polymer. By this means the chain will be forced to move along the polymer. 
In principle, the force Fex may be time dependent, but we will restrict ourselves 
to a constant term. This force might come from a chemical pump or from any 
other electromagnetic force which may have been present in prebiotic conditions. It 
should be strong enough as to drive the chain along the polymer (which is assumed 
to be fixed, N ^ oo limit), without getting trapped in some of the minima of the 
polymer-chain interaction potential V (x ) , and weak enough in order to “detect” 
the characteristics of the potential. Therefore, we assume that \Fex\ is slightly 
larger than the maximum value of \dV {x) / dx\. 

Our analysis relies on Newton’s equation of motion in a high friction regime, 
where inertial effects can be neglected. This regime actually exists in biological 
molecular ratchets similar to the one we are considering [10]. Under such condi- 
tions, the equation of motion acquires the form 



7 



dx dV^ {x) 



dt 



dx 



FFe: 



( 8 ) 



"^This property is a consequence of the order in which the bases or codons appear along the 
sequence and is not related to the relative weight (fraction) of their occurrence [4]. 
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Figure 7: (a) Particle velocity along the polymer as a function of the position x. (b) Local 

transit time t as a function of the position x. 



where 7 is the friction coefficient. In what follows, we will set 7 = 1, which is equiv- 
alent to rescaling the time units. The above, though a deterministic equation, gives 
rise to a random dynamics inherited from the randomness of the interaction poten- 
tial V^{x). For the particle polymer case we show in Fig. 7a a typical realization 
of the velocity of the particle v{x) as a function of its position x along the polymer 
determined from equation (8), with a = 2 and a = 0.5, and monomer charge val- 
ues of {±1, ±2} (case m = A). Fig. 7b shows the local-transit times of the particle 
along a short segment of the polymer (40 monomers in length) computed by count- 
ing how many time steps the particle spent in every spatial interval Ax = 0.25. It 
is clear from this figure that the particle spends more time in certain regions than 
in others, the former being more or less regularly spaced along the polymer. 

Spatial regularities in the dynamics of the system appear when we consider 
the particle velocity Fourier power spectrum which is shown in Fig. 8 for 

different realizations of monomer charges in the polymer. The parameter values in 
Fig. 8a and Fig. 8b are {a = 0.5, 0 = 1} and {a = 0.5, a = 4} respectively. Charge 
values are {±1, ±2}. Notice the dominant frequency k* in the power spectrum of 
the velocity (the highest peak), whose corresponding spatial periodicity is A* = 
27 t//c* - 3.3. 

If we now consider the power spectrum of the velocity of the particle through- 
out the polymer, using as in the previous section protein-coding sequences, we find 
a behavior as shown in Fig. 9 for the tuberculosis bacillus. This graph was gener- 
ated using coding sequences of 10000 monomers. The dominant sharp peaks are 
much higher now than for the random sequences and closer to 3. In reference [4] 
we show that this behavior is recurrent for different types of organisms. 
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Figure 8: Power spectrum of the velocity of the particle along the polymer for two random 
monomer charge realizations, = 27r/fc*. 



Tuberculosis Bacillus 




k 



Figure 9: Power spectrum of the velocity of the particle using protein-coding sequences of 
tuberculosis bacillus to assign the polymer charges. The sharp peak corresponds to a spatial 
A* = 3. The peak around fc = 4 is the second harmonic. 
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Figure 10: Power spectrum of the velocity of the particle using non-coding sequences of S. 
cerevisiade (yeast) and C. elegans (worm). In this case the spatial periodicity is weaker (or 
absent) than in Fig. 16. It seems that the non-coding sequences of real organisms have a random 
like structure. 



In Fig. 10 we show the velocity power spectra for intergenic non-coding se- 
quences which turn out to be similar to the ones obtained in the random case. In 
this sense, intergenic regions again appear to have a random structure. 

The most interesting dynamics occurs when an extended chain is interacting 
with the polymer. Under this situation, a multiple interaction prevails. At every 
moment there are several contact points between the chain and the polymer. As 
we have already pointed out, the multiple interaction between the chain and the 
polymer gives rise to a widely fluctuating probability function P^(d). The same 
occurs with the power spectrum of the velocity of the chain along the polymer. 

In Fig. 11 we show the power spectra of the velocity of the chain along the 
polymer for two different random realizations of monomer charges in the chain. The 
charges in the polymer were the same in both cases. These graphs were constructed 
with a polymer 500 monomers in length and a 10-monomer chain. The parameter 
values used were a = 0.5 and a — 1 with charge values {±1, ±2}. The figure shows 
considerable structure with dominant frequencies, even though the charges in both 
the polymer and the chain were assigned at random. In Fig. 11a this frequency 
corresponds to a a spatial periodicity A* ~ 2, whereas in Fig. 11b to A* ~ 3. The 
difference in these values may be related to the fact that for case (a) P 4 (d) has a 
maximum at d* =2, while for case (b) this occurs at d* =3. Further numerical 
exploration shows that whenever the probability function has a sharp maximum 
at a distance d*, the power spectrum of the velocity also presents a sharp peak 
corresponding to a spatial periodicity A* = d*. This observation, together with 
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Figure 11: Power spectrum of the velocity of a 10-monomer chain along a polymer 500 monomers 
in length for two realizations of monomer charges in the chain. 



the dominance of probability density functions P 4 (d) with d* = 3 for multiple 
interactions, implies that when we assign charges at random to the monomers 
in the polymer and in the chain, the most general outcome will be a dynamics 
registering three monomer lengths. 

6 Summary and discussion 

The results presented throughout this work suggest a possible scenario for the 
origin of the three base codon structure of the genetic code. In this scenario, the 
relative movement between a long and a short polymer in a quasi one dimensional 
geometry (one of them fixed to it), with a random interaction potential, subject to 
external forcing, exhibits a regular dynamics with a “preference” for a movement 
in steps of three bases. By “steps” we mean a slowing down of the relative velocity 
nearly every three monomers (see Fig. 7b). 

The preceding property is quite robust inasmuch as it hardly depends on the 
particular kind of electrostatic interaction between the polymer and the chain. 
The spatial distribution of interaction potential minima does not depend on the 
particular values of the monomer charges. Furthermore, for the polymer-particle 
case, the probability function Pm(o^), which gives the probability of two consecutive 
minima being separated by a distance d, only depends on m, namely, the number 
of different types of monomers. As m increases, the mean distance d between 
consecutive potential minima approaches three. Nevertheless, the most probable 
distance is d* = 2 (for m > 2). 
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Considerable changes take place when the charges along the polymer are as- 
signed in correspondence with protein-coding genetic sequences of real organisms. 
In this case, not only is the mean distance d between neighboring potential min- 
ima nearly three, but also the most probable one, d*, happens to be three. This is 
a remarkable property of protein-coding sequences, perhaps acquired throughout 
evolution. The fact that this “refinement” is absent in non-coding sequences of real 
organisms, strongly suggests that it is a consequence of the dynamical processes 
involved in the protein synthesis mechanisms. 

This interpretation is supported by the results obtained when the dynamics 
of the particle moving along the polymer is considered. In the random sequence 
case, there are dominant frequencies in the power spectrum of the particle velocity 
related with the spatial regularities of the interaction potential. Moreover, in the 
protein-coding sequence case, the power spectrum of the velocity shows a very well 
defined peak corresponding almost exactly to a spatial distance A* =3. Again, 
this behavior does not occur for non-coding sequences of real organisms, which are 
not involved in the translation processes. 

A richer dynamics emerges when the chain is composed of several mono- 
mers. In this, more realistic, multiple-interaction case, the probability function 
^m{d) presents very wide fiuctuations, depending on the particular assignment of 
monomer charges in the chain. Nevertheless, the most encountered configurations 
are those for which the probability function has its highest value at d* = 3. For 
these configurations, the power spectrum of the chain velocity along the polymer 
is indicative of a well defined spatial periodicity at A* 3. 

Our results suggest an origin of life scenario in which primordial molecular 
machines of chains moving along polymers in quasi one-dimensional geometries, 
that eventually led to the protein synthesis processes, were biased towards a dy- 
namics favoring the motion in “steps” or “jumps” of three monomers. The higher 
likelihood of these primitive “ribosomes” may have led to the present ribosomal 
dynamics where mRNA moves along rRNA in a channel conformed by the ribo- 
some. Thus, dynamics may have acted as one of the evolutionary filters favoring 
the three base codon composition of the genetic code. 
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Optimization and Physics: On the Satisfiability 
of Random Boolean Formulae 

Marc Mézard 



Abstract. Given a set of Boolean variables, and some constraints between them, is it 
possible to find a configuration of the variables which satisfies all constraints? This 
problem, which is at the heart of combinatorial optimization and computational 
complexity theory, is used as a guide to show the convergence between these fields 
and the statistical physics of disordered systems. New results on satisfiability, both 
on the theoretical and practical side, can be obtained thanks to the use of physics 
concepts and methods. 

Combinatorial optimization aims at finding, in a finite set of possible configura- 
tions, the one which minimizes a certain cost function. The famous example of 
the traveling salesman problem (TSP) can serve as an illustration: A salesman 
must make a tour through N cities, visiting each city only once, and coming 
back at its starting point. The cost function is a symmetric matrix Cij, where 
Cij is the cost for the travel between cities i and j. A permutation tt of the N 
cities gives a tour 7t(1) ^ 7t(2) ^ 7t(3) Taking into account the equiv- 

alence between various starting points and the direction of the tour, one sees 
that the number of distinct tour is {N — l)!/2. For each tour tt, the total cost is 
C = C 7 r(Tv) 7 r(i) + ^n(r) 7 r(r+i) , wMch cau be computed in N operations. The 

problem is to find the tour tt with lowest cost. 

As can be seen on this example, the basic ingredients of the optimization 
problems which will interest us are the following: 

• An integer N giving the size of the problem (in the TSP, it is the number of 
cities) . 

• A set of configurations, which typically scales like exp(A'^). 

• A cost function, which one can compute in polynomial time 0{N'^). 

Let me mention a few examples, beside the TSP [1]. 

In the assignment problem, one is given a set of N persons i = 1, . . . , A', a 
set of N tasks a = l,...,A, aAxA cost matrix c where Cia is the cost for having 
task a performed by person i. An assignment is a permutation n E assigning 
task a = 7t(z) to person z, and the problem is to find the lowest cost assignment, 
i.e. the permutation which minimizes C = Ci 7 r(i)- 

In the spin glass problem[2], one is given a set of N spins, which could 
be for instance Ising variables G {il}, the total energy of the configuration 
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is a sum of exchange interaction energies between all pairs of spins, E{{a}) = 
~ 'I2i<i<j<N seeks the ground state of the problem, i.e. the con- 

figuration (among the 2^ possible ones) which minimizes the energy. 

In physical terms, optimization problems consist in finding ground states. 
This task can be non trivial if a system is frustrated, which means that one cannot 
get the ground state simply by minimizing the energy locally. This is typically what 
happens in a spin glass. In some sense, statistical physics addresses a more general 
question. Assuming that the system is at thermal equilibrium at temperature 
T, every configuration C is assigned a Boltzmann probability, P{C) = 

Beside finding the ground state, one can ask also interesting questions about which 
are the typical configurations at a given temperature, like counting them (which 
leads to the introduction of entropy), or trying to know if they are located in one 
single region of phase space, or if they build up well separated clusters, as often 
happens in situation where ergodicity is broken. The generalization introduced 
by using a finite temperature, beside leading to interesting questions, can also be 
useful for optimization, both from the algorithmic point of view (for instance this 
is the essence of the simulated annealing algorithm[3], which is a general purpose 
heuristic algorithm that can be used in many optimization problems), but also 
from an analytic point of view [2] . Conversely, smart optimization algorithms turn 
out to be very useful in the study of frustrated physical systems like spin glasses 
or random field models, and the cross-fertilization between these two fields (and 
also with the related domain of error correcting codes for information transmission 
[4]) has been very fruitful in the recent years [5]. 

Before proceeding with one such example, let us briefly mention a few im- 
portant results in optimization which will provide the necessary background and 
motivation. One of the great achievements of computer science is the theory of 
computational complexity. It is impossible to present it in any details here and I 
will just sketch a few main ideas, the interested reader can study it for instance in 
[ 6 ], 

Within the general framework explained above, we can define three types 
of optimization problems: the optimization problem in which one wants to find 
the optimal configuration, the evaluation problem in which one wants to compute 
the optimal cost (i.e. the ground state energy), the decision problem in which one 
wants to know, given a threshold cost Co, if there exists a configuration of cost 
less than Cq. 

One classification of decision problems is based on the scaling with N of 
the computer time required to solve them in the worst case. There are two main 
complexity classes: 

• Class P = polynomial problems: they can be solved in a time t < N^. 
The assignment is an example of a polynomial problem, as is the spin glass 
problem in 2 dimensions. 

• Class NP = non-deterministic polynomial: Given a ’yes’ solution to a NP 
problem, it can be checked in polynomial time. Roughly speaking this means 
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that the energy is computable in polynomial time, so this class contains a 
wide variety of problems, including most of the ones of interest in physics. 
All problems mentioned above are in NP. 

One nice aspect of focusing on polynomiality is that it allows to forget about the 
details of the definition of N, the implementation, language, machine, etc. . . : any 
’reasonable’ such change (for instance one could have used the number of possible 
links in the assignment) will change the exponent of N appearing in the computer 
time of a problem in P, but not transform it into an exponential behaviour. A 
problem A is said to be at least as hard as a problem B if there exists a polynomial 
algorithm which transforms any instance of B into an instance of A. 

This allows to define the very important class: 

• NP-complete A problem is NP complete if it is at least as hard as any 
problem in NP. 

So the NP-complete are the hardest problems in NP. If one such problem can be 
solved in polynomial time, then all the problems in NP are solved in polynomial 
time. Clearly P is contained in NP, but it is not yet known whether P = NP , and 
this is considered as a major challenge. 

A great result was obtained in this field by Cook in 1971 [7]: The satisfiability 
problem, which we shall describe below, is NP-complete. Since then hundreds of 
problems have been shown to belong to this class, among which the decision TSP 
or the spin glass in dimension larger or equal to 3. 

The fact that 3-d spin glass is NP-complete while 2-d spin glass is P might 
induce one to infer that NP-completeness is equivalent with the existence of a 
glass transition. This reasoning is too naive and wrong; the reason is that the 
complexity classification relies on a worth-case analysis, while physicists study the 
behaviour of a typical sample. The development of a typical case complexity theory 
has become a major goal [8], also motivated by the experimental observations that 
many instances of NP-complete problems are easy to solve [5]. 

One way of addressing this issue of a typical case complexity is to define a 
probability measure on the instances (= the ’samples’) of the optimization problem 
which one is considering. Typical examples are: 

• TSP with independent random points uniformly distributed in the unit 
square 

• assignment with independent affinities uniformly distributed on [0, 1] 

• CuMn spin glass at one percent Mn 

Once this measure has been defined, one is interested in the properties of the 
generic sample in the N ^ oo limit. In most cases, global properties (e.g. extensive 
thermodynamic quantities, among which the ground state energy), turn out to be 
self averaging. This means for instance that the distribution of the ground state 
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energy density becomes more and more peaked around an asymptotic value in the 
large N limit: almost all samples have the same ground state energy density. In 
this situation, a statistical physics analysis is appropriate. Early examples of the 
use of statistical physics in such a context are the derivation of bounds for the 

optimal length of a TSP[9], the exact prediction of the ground state energy in the 

2 

random assignment problem defined above, where the result E’o derived in 

1985 through a replica analysis [10], was recently confirmed rigorously by Aldous 
[11], or the link between spin glasses and graph partitioning [12]. 

As statistical physics can be quite powerful at understanding the generic 
structure of an optimization problem, one may also hope that it can help finding 
better optimization algorithms. A successful example which was developed recently 
is the satisfiability problem, to which we now turn. 

As we have seen, satisfiability is a core problem in optimization and com- 
plexity theory. It is defined as follows [13]: A configuration is a set of N Boolean 
variables G {0, 1} i = 1, . . . , A'. One is given M constraints which are clauses, 
meaning that they are in the form of an OR function of some variables or their 
negations. For instance: xi V X27 V X3, xn V X2, are clauses (notation: X3 is the 
negation of 0:3). So the clause xi V X 27 V X 3 is satisfied if either = 1, or X 27 = 1, 
or X3 = 0 (these events do not exclude each other). The satisfiability problem is 
a decision problem. It asks whether there exists a choice of the Boolean variables 
such that all constraints are satisfied (we will call it a SAT configuration). This 
is a very generic problem, because one can see it as finding a configuration of the 
N Boolean variables which satisfies the logical proposition built from the AND of 
all the clauses (in our example (rri V X27 V X3) A {xn V X2) A . . .), and any logical 
proposition can be written in such a ’conjunctive normal form’. 

Satisfiability is known to be NP complete if it contains clauses of length > 3, 
but common sense and experience show that the problem can often be easy; for 
instance if the number of constraints per variable a = ^ is small, the problem is 
often SAT, if it is large, the problem is often UNSAT. 

This behaviour can be characterized quantitatively by looking at the typical 
complexity of the random 3-SAT problem, defined as follows. Each clause involves 
exactly three variables, chosen randomly in {xi, . . . ,xtv}; a variable appearing in 
the clause is negated randomly with probability 1/2. This defines the probability 
measure on instances for the random 3-SAT problem. The control parameter is 
the ratio Constraints/Variables a = ^ , 

The properties of random 3-SAT have been first investigated numerically 
[14, 15], and exhibit a very interesting threshold phenomenon at Oc ^ 4.26: a 
randomly chosen sample is generically SAT for a < ac (meaning that it is SAT 
with probability 1 when N oo), generically UNSAT for a > c^c. The time used 
by the best available algorithms (which have an exponential complexity) to study 
the problem also displays an interesting behaviour: For a well below ac, it is easy 
to find a SAT configuration; for a well above Oc, it is relatively easy to find a 
contradiction in the constraints, showing that the problem is UNSAT. The really 
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difficult region is the intermediate one where a ^ ac, where the computer time 
requested to solve the problem is much larger and increases very fast with system 
size. A lot of important work has been done on this problem, to establish the 
existence of a threshold phenomenon, give upper and lower bounds on etc, show 
the existence of finite size effects around a ~ etc with scaling exponents. We refer 
the reader to the literature [14, 15, 16, 17, 18, 19, 20], and just here extract a few 
crucial aspects for our discussion. The threshold phenomenon is a phase transition, 
and the neighborhood of the transition is the region where the algorithmic problem 
becomes really hard. 

This relationship between phase transition and complexity makes a statistical 
physics analysis of this problem particularly interesting. Monasson and Zecchina 
have been the first ones to recognize this importance and to use statistical physics 
methods for an analytic study of the random 3-SAT problem [21, 22]. Basically this 
problem falls into the broad category of spin glass physics. This is immediately seen 
through the following formulation. To make things look more familiar, physicists 
like to introduce for each Boolean variable Xi G {0, 1} an Ising spin ai G {-1,1} 
through Xi = A satisfiability problem like 



(a:i V X27 V X3)A(xii V X3 V X2)A. . . (1) 

can be written in terms of an energy function, where the energy is equal to one 
for each violated clause. Explicitely, in the above example, one would have 

1+CTi 1+(J27 1 — CT3 , 1— <^11 1+^3 1+^2 , 

2 ^ + ^ 2 

This is clearly a problem of interacting spins, with 1,2, and 3 spin interactions, 
disorder (in the choice of which variable interacts with which), and frustration 
(because some constraints are antagonist). More technically, the problem has a 
special type of three-spin interactions on a random hypergraph. 

Using the replica method, Monasson and Zecchina first showed the existence 
of a phase transition within the replica symmetric approximation, at Qc = 5.18, 
then showed that replica symmetry must be broken in this problem. Some varia- 
tional approximation to describe the replica symmetry breaking effects were de- 
veloped in particular in [23, 24]. 

Recently, in a collaboration with G.Parisi and R. Zecchina [25, 26], we have 
developed a new approach to the statistical physics of the random 3-SAT problem 
using the cavity method. While the cavity method had been originally invented to 
deal with the SK model where the interactions are of infinite range [2] , it was later 
adapted to problems with ’finite connectivity’, in which a given variable interacts 
with a finite set of other variables. While this is easily done for systems which 
are replica symmetric (like the assignment, or the random TSP with independent 
links) , it turned out to be considerably more difficult to develop the corresponding 
formalism and turn it into a practical method, in the case where replica symmetry 
is broken. This has been done in the last two years in joint works with G.Parisi 
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Figure 1: A pictorial description of the phase diagram for random 3-SAT obtained 
in the cavity method: clusters of SAT (in green/light gray) or UNSAT (in red/dark 
grey) configurations. One finds three qualitatively different phases, the EASY-SAT 
phase for a < the HARD-SAT phase for < o < Oc, the UNSAT phase 
obtained for ac < ol (see text) 



[28], and has opened the road to the study of finite connectivity optimization 
problems with replica symmetry breaking like random K-sat. Curiously, while the 
cavity method is in principle equivalent to the replica method (although it proceeds 
through direct probabilistic analysis instead of using an analytic continuation in 
the number of replicas), it turns out that it is easier to solve this problem with 
the cavity method. 

The analytic study of [25, 26] for the random 3-SAT problem at zero tem- 
perature shows the following phase diagram (see fig.l): 

• For a < ad = 3.921, the problem is generically SAT; the solution can be 
found relatively easily, because the space of SAT configurations builds up a 
single big connected cluster. A T = 0 Metropolis algorithm, in which one 
proposes to flip a randomly chosen variable, and accepts the change iff the 
number of violated constraints in the new configuration is less or equal to 
the old one, is able to find a SAT configuration. We call this the EASY-SAT 
phase 

• For ad < a < ac = 4.267, the problem is still generically SAT, but now 
it becomes very difficult to find a solution (we call this the HARD-SAT 
phase). The reason is that the configurations of variables which satisfy all 
constraints build up some clusters. Each such cluster, which we call a ’state’, 
is well separated from the other clusters (passing from one to the other 
requires flipping an extensive number of variables). But there also exist many 
“metastable states”: starting from a random configuration, a local descent 
algorithm will get trapped in some cluster of configurations with a given 
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finite energy (they all have the same number of violated clauses, and it is 
impossible to get out of this cluster towards lower energy configurations 
through any descending sequence of one spin flip moves). The number of 
SAT clusters M is exponentially large in N , it behaves as A/^ ~ exp(AE); 
but the number of metastable clusters is also exponentially large with a 
larger growth rate, behaving like exp(A'Ems) with Tims > T. The most 
numerous metastable clusters, which have an energy density Cth, will trap 
all local descent algorithms (zero temperature Metropolis of course, but also 
simulated annealing, unless it is run for an exponential time). 

• For a > ac, the problem is typically UNSAT. The ground state energy 
density eo is positive. Finding a configuration with lowest energy is also very 
difficult because of the proliferation of metastable states. 

A more quantitative description of the thermodynamic quantities in the var- 
ious phases is shown in fig. 2. The most striking result is the existence of an in- 
termediate SAT phase where clustering occurs. A hint of such a behaviour had 
been first found in a sophisticated variational one step replica symmetry breaking 
approximation developed in [23]; however this approximation predicted a second 
order phase transition (clusters separating continuously) , while we now think that 
the transition is discontinuous: an exponentially large number of macroscopically 
separated clusters appears discontinuously dX a = otd- Another point which should 
be noticed is the fact that the complexity, and the energy eth in the HARD- 
SAT phase are rather small: around eth ~ 3 10“^ violated clause per variable for 
a = 4.2. This shows that until one reaches problems with at least a few thousands 
variables, one cannot feel the true complexity of the problem. This can explain why 
the existence of the intermediate phase went unnoticed in previous simulations. 

The second type of results found in [25, 26, 27] is a new class of algorithms 
dealing with the many states structure. Basically this algorithm amounts to using 
the cavity equations on one given sample. Originally the cavity method was de- 
veloped to handle a statistical distribution of samples. For instance in the random 
3- SAT case its basic strategy is to add one extra variable and connect it randomly 
to a number of new clauses which has the correct statistical distribution. In the 
large N limit, the statistics of the local field on the new variable should be identical 
to the statistics of the local fields on any other variable in the absence of the new 
one. It turns out that this strategy can be adapted to study a single sample: one 
considers a given clause a, which involves three variables cri , ct 2 , < 73 . The cavity field 
on <Ji is the field felt by in the absence of a. In the catse where there exist many 
states, there is one such field for each possible state of the system, and the order 
parameter is the survey of these fields, in all the states of fixed energy density e: 

(3) 

a ^ ^ 
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Figure 2: Thermodynamic quantities for the random 3-SAT problem: eo is the 
ground state energy density (minimal number of violated clauses per variable); eth 
is the energy density of the most numerous metastable states, which trap the local 
descent algorithms; S is the complexity of SAT states with E = 0 



One can then write a recursion recursion between these surveys. Looking for in- 
stance at the structure of fig. 3, one gets the following iteration equation: 



PS (h) = C*J P:, (91 )dgi PS, {hi)dh^ {g2)dg2 PS, {h2)dh2 

d{h - f{gi,hi,g2,h2))exp (^-^w{gi,hi,g2,h2)j (4) 



The function / just computes the value of the new cavity field on (Jq in 
terms of the four cavity fields pi, hi, ^ 2 ,^ 2 . It is easily computed by considering 
the statistical mechanics problem of the five-spin system {o"o, <Ji, ti, ct 2 , T 2 } and 
summing over the 16 possible states of the spins <ti, ri, <J 2 , T 2 . The function w 
computes the free energy shift induced by the addition of ao to the system with 
the four spins ai , ri , (J 2 , T 2 . The exponential reweighting term in (4) is the crucial 
piece of survey propagation: it appears because one considers the survey at a fixed 
energy density e. As the number of states at energy E = Ne -\- 6 E increases in 
exp(y(5£’), where y = this favours the states with a large negative energy shift. 

The usual cavity method for the random 3-SAT problem determines the 
probability, when a new variable is added at random, that its survey Pq (/i) is a 
given function P(h): the order parameter is thus a functional, the probability of 
a probability. Because the fields are distributed on integers, this object is not as 
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Figure 3: The basic iterative structure of survey propagation. This subgraph is 
a part of a 3-SAT problem, in which the variable <Ji belongs to three 3-clauses, 
involving respectively the variables {<Ji, ri}, {< 72 , T 2 }, {< 73 , rs}. When the clause 3 
is switched off, the local cavity field survey Pq (/i) on spin ctq can be computed in 
terms of the cavity field surveys on each of the spins (Ji, n, ct 2 , T 2 . 



terrible as it may look and it is possible to solve the equation and compute the 
’complexity curve’ S(P), giving the phase diagram described above. 

The algorithm for one given sample basically iterates the survey propagation 
equation on one given graph. It is a message passing algorithm which can be 
seen as a generalization of the belief propagation algorithm familiar to computer 
scientists[29]. The belief propagation is a propagation of local magnetic fields, 
which is equivalent to using a Bethe approximation [30]. Unfortunately, it does 
not converge in the hard- SAT region because various subparts of the graph tend 
to settle in distinct states, and there is no way to globally choose a state. In this 
region, the survey propagation, which propagates the information on the whole set 
of states (in the form of a histogram), does converge. 

Based on the surveys, one can detect some strongly biased spins, which are 
fixed to one in almost all SAT configurations. The “Survey Inspired Decimation” 
(SID) algorithm fixes the spin which is most biased, then it re-runs the survey 
propagation on the smaller sample so obtained, and then iterates. . . An example 
of the evolution of the complexity as a function of the decimation is shown in 
4. This algorithm has been tested in the hard SAT phase. It easily solves the 
’large’ benchmarks of random 3sat at a; = 4.2 with N = 1000, 2000 available at 
[31]. It turns out to be able to solve typical random 3-SAT problems with up to 
N — 10^ at Of == 4.2 in a few hours on a PC, which makes it much better than 
available algorithms. The main point is that the set of surveys contains a lot of 
detailed information on a given sample and can probably be used to find many 
new algorithms, of which SID is one example. 
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Figure 4: The total complexity (= the In of the number of states) as a function of 
the total energy (= the number of violated clauses) for one given sample with N = 
10"^ variables and M = 4.2 10"^ constraints. The top curve is the original complexity. 
The next curves are the various complexities obtained for the decimated samples, 
plotted here every 200 decimations. One sees a global decrease of the number of 
metastable states, and also a global decrease of the threshold energy. In the end the 
problem has no more metastable states and can be solved by simple algorithms. 



To summarize, the recent statistical physics approaches to the random 3-SAT 
problem give the following results: 

• An analytic result for the phase diagram of the generic samples 

• An explanation for the slowdown of algorithms near Oc = 4.267: this is due to 
the existence of a HARD-SAT phase at a G [3.921, 4.267], with exponentially 
many metastable states. 

• An algorithm for single sample analysis: Survey propagation converges and 
yields very non trivial information on the sample. It can be used for instance 
to decimate the problem and get an efficient SAT-solver in the hard-SAT 
region. 

This whole set of results calls for a lot of developments in many directions. 

On the analytical side, the cavity method results quoted above are believed 
to be correct, but they are not proven rigorously. It would be very interesting 
to turn these computations into a rigorous proof. A very interesting step in this 
direction was taken recently by Franz and Leone who showed that the result for 
the critical threshold Oc obtained by the cavity method on random K-SAT with K 
even actually give a rigorous upper bound to the critical Oc [32]. The whole con- 
struction of the cavity method with the clustering structure, the many states and 
the resulting reweighting, has actually been checked versus rigorous computations 
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on a variant of the SAT problem, the random XORSAT problem, where rigorous 
computations [33] have confirmed the validity of the approach. 

On the numerical side, one needs to develop convergence proofs for survey 
propagation, and to derive the generalization of the algorithm for the case in 
which there exists local structures in the interaction graphs. This will amount 
to generalizing from a Bethe like approximation (with many states) to a cluster 
variational method with larger clusters (and with many states). 

The techniques can also be extended to other optimization problems like col- 
oring [34]. Beside the problems mentioned here, there exist many other fascinating 
problems of joint interests to physicists and computer scientists, like e.g the dy- 
namical behaviour of algorithms in optimization or error correcting codes, which 
I cannot survey here. Let me just quote a few recent references to help the readers 
through the corresponding bibliography [35, 36, 37, 38]. 
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of Nonequilibrium Statistical Mechanics? 



David Ruelle 



Abstract. This is a discussion of a certain number of questions that occur naturally in 
the perspective of creating a unified theory of nonequilibrium statistical mechanics. 



Introduction 

As to the question in the title of this talk, the answer is currently: NO! There 
is a fairly satisfactory theory of nonequilibrium close to equilibrium, with the 
Onsager reciprocity relations, and the Green-Kubo formula, and many important 
results have been obtained recently far from equilibrium, but not a unified theory. 
In fact, nonequilibrium covers a variety of different physical phenomena, such as 
hysteresis and decay of metastable states, and one does not really expect a unified 
description for all those. But in situations like the study of nonequilibrium steady 
states, one might hope for a unification of the different idealizations currently 
studied (small system interacting with stochastic reservoirs, small system with 
isokinetic thermostat, etc.). This unification has up to now not been attained, and 
one may thus be concerned that perhaps some of the idealizations used are not 
doing justice to the physics of the problem. 

But let us not be too pessimistic. We do know a certain number of things 
about the physics of nonequilibrium, and we also have some ideas of where to look 
in order to progress. This is what I would like to discuss in this talk, and also why 
one would like to have a more conceptual and unified theory. 

One thing that characterizes nonequilibrium is (strictly positive) entropy pro- 
duction. It may be surprizing that one can speak of entropy production even though 
entropy itself is not very well defined far from equilibrium. Indeed there have been 
speculations that, because biological phenomena are far from equilibrium, they 
could defeat the second law. In fact if one considers any process starting and end- 
ing close to equilibrium (however far one may go in between), the entropy at the 
end is bigger than at the beginning (if this were not the case on could build a 
device that would convert heat into mechanical work, an the evidence is against 
the existence of such a device). So, we can speak of entropy production, and com- 
pute it in different idealizations of nonequilibrium far from equilibrium. This does 
not mean that we have a good understanding of all the situations of interest. In 
particular we shall want to come back to the discussion of biological phenomena 
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which are currently treated by ad hoc methods that remain unsatisfactory from a 
fundamental point of view. 

One mathematical description of entropy production is as en increase of the 
volume of phase space “occupied by the system”. The idea is that microscopic 
states that are indistinguishable macroscopically occupy a volume of phase space 
that increases with time. Let us give only the briefest discussion of this somewhat 
messy subject, where there is still disagreement between experts. The problem 
is that Hamiltonian evolutions preserve phase space volume. One way to have 
nevertheless an apparent volume increase is that the (Hamiltonian) time evolution 
spreads an initial blob of phase space thinly so that it becomes dense in a larger 
volume of phase space. But this leads one to introduce “coarse graining” which 
is hard to handle in a mathematically satisfactory manner. One is then led to 
studying infinite Hamiltonian systems, which are technically difficult and for which 
the concept of volume no longer makes natural sense. For such infinite Hamiltonian 
systems the entropy production arises from the fact that there are correlations in 
the description of the system which, in the course of time, become spread out over 
larger and larger regions of space, to be finally lost “at infinity” . 

A mathematically convenient way to obtain nontrivial entropy production 
is to use the non-Hamiltonian dynamics corresponding to one of the “Gaussian 
thermostats” introduced by Hoover and Evans, for instance the isokinetic (IK) 
thermostat. Here one subjects a finite Hamiltonian system (“small system”) to 
an non-Hamiltonian force, and and adds a deterministic term corresponding to a 
“thermostat” L The small system gives entropy to the thermostat, so that there is 
volume contraction in the (compact) phase space of the small system. The average 
rate of phase space contraction of the phase space volume of the small system is 
in fact the average entropy production. It may be computed for a nonequilibrium 
steady state represented by a singular measure invariant under time evolution. 
Gaussian thermostats were introduced for the purpose of computer simulations, 
but turn out to be very well suited to theoretical investigations. Not everyone is 
happy with their physical interpretation, but they are mathematically very conve- 
nient, and allow an in depth discussion of otherwise inaccessible questions. Among 
the results obtained is the important Fluctuation Theorem of Gallavotti and Cohen 
[4] , other developments are reviewed in [9] . 

While this is not the place to give an extensive bibliography we must also 
mention here the rigorous results of Eckmann and coworkers [3] on a classical 
small system (anharmonic chain) coupled to infinite reservoirs. The difficulty of 
the mathematical problems encountered in nonequilibrium is illustrated by the 
fact that no proof of Fourier’s law has been obtained at this time. Quantum spin 
systems spin systems coupled to infinite reservoirs are relatively accessible math- 
ematically, and have been studied by Ruelle [11], and by Jaksic and Pillet [6]. 

^for the casual reader, let us insist that Gaussian thermostats have nothing to do with Gaus- 
sian stochastic forces, the Gaussian thermostat is strictly deterministic, as we shall see below in 
the example of the IK thermostat. 
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Let us now look at the dynamical processes that can lead to entropy produc- 
tion. One has of course tried to study heat conduction through a chain of harmonic 
oscillators, and the results obtained are pathological: Fourier’s law is violated. Why 
is simple dynamics inadequate for nonequilibrium? Using the formalism of Gaus- 
sian thermostats where entropy production is the rate of volume contraction, or 
equivalently minus the sum of all Lyapunov exponents, one sees that nontrivial 
entropy production entails some non zero Lyapunov exponents, which means some 
kind of dynamical chaos. Completely integrable systems, like harmonic chains, are 
thus excluded. Chaotic dynamics appears thus to be linked with irreversible be- 
havior. This remark applies without regard to the spatial extension of the system. 
For systems with spatial extension, irreversibility is typically linked with diffusive 
behavior. This is perhaps best seen in the parabolic character of the macroscopic 
evolution equations (for material diffusion, heat propagation, of hydrodynamics) . 
At the microscopic level, diffusive behavior appears in the study of quantum spin 
systems where the dynamics is deterministic. But microscopic diffusive behavior 
also arises in classical systems as a consequence of chaos, because the dynamically 
unstable time evolution approximates a stochastic motion. 

We are loosely using the terms “nonequilibrium statistical mechanics” and 
“irreversibility” to describe the microscopic behavior underlying the well known 
macroscopic irreversible behavior described by nonequilibrium thermodynamics. 
The microscopic dynamics may be chaotic, diffusive, or stochastic, and we have 
seen the physical relations between these different types of dynamics. But at the 
mathematical level we certainly do not have a unified theory. 

In spite of this lack of unity one can observe a common concern for the anal- 
ysis of fluctuations (as in the Gallavotti-Cohen Fluctuation Theorem), and also 
of linear response of a system to small changes in the forces applied to it. The 
two questions are connected. In fact, close to equilibrium we have a Fluctuation- 
Dissipation Theorem, which relates the linear response {= dissipation) to the spon- 
taneous fluctuations in the system. Technically, the fluctuations are expressed in 
terms of time (auto-) correlation functions. The Fourier transform of a time auto- 
correlation function is a so-called power spectrum. The linear response is given by a 
response function, which has a Fourier transform known as susceptibility. We shall 
concern ourselves below with the problem of extending the fluctuation-dissipation 
theorem to situations far from equilibrium. At this point let us remark that the 
physical diversity of nonequilibrium systems expresses itself at the level of fluctua- 
tions. For instance, under the Gallavotti-Cohen chaotic hypothesis a small system 
with Gaussian thermostat has in principle exponentially decreasing time correla- 
tion functions. But an extended system with diffusion in d dimensions has time 
correlations decaying like 
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Nonlocal fluctuations in stochastic models 

The matrix method of Derrida permits a rigorous study of a one-dimensional lat- 
tice system (chain) where particles jump independently from any occupied site to 
one of the two neighboring sites provided the latter is empty. Using this method, 
Derrida, Lebowitz and Speer [2] have studied the continuous limit of a chain for 
which the densities of occupied sites at the two ends are maintained at different 
values. As could be expected one finds a linear density profile along the chain, 
interpolating between the the values imposed at the endpoints. Interestingly one 
can compute exactly the probability of density fluctuations, and the expression ob- 
tained turns out to be nonlocal, ie., it is not an integral over the chain of a local 
function of the density and its derivatives. This is quite different from the behav- 
ior of fluctuations in equilibrium statistical mechanics (and contradicts some older 
ideas). Jona-Lasinio and coworkers [1] have obtained more general exact formulas 
of the same type, but their approach is less rigorous. It is interesting that nonlocal- 
ity is observed even though the dynamics is stochastic and thus not quite realistic. 
For a more realistic (Hamiltonian) system, one does expect nonlocal fluctuations 
because entropy production corresponds to loss of information at infinity, as we 
have indicated earlier. Procaccia, who made an early study of the sort of model 
discussed here [7] , has convinced me that the nonlocality observed in the stochastic 
models is of a somewhat more trivial nature as that which would correspond to 
loss of information at infinity. 



Interlude: biology 

Nonequilibrium situations are, in many cases of physical interest, locally close to 
equilibrium, i.e., close to equilibrium near each point of space and time. This ig 
believed to be usually the case in transport phenomena like diffusion, heat propa- 
gation, and hydrodynamics. It is however not the case in chemical reactions, and 
in particular in most phenomena associated with life. Where biology is concerned 
one is thus confronted with problems of nonequilibrium far from equilibrium, an 
example being that of motor proteins^ which are currently the object of intense in- 
vestigation. Let me mention in this respect a recent book by Howard [5] which tries 
to dispel a certain number of misconceptions. The author insists that the proteins 
that play a role in biological motors have the stiffness of “hard plastic” , not jello, 
and that friction is intense: inertia is unimportant, motion is overdamped, nonoscil- 
latory. To study the physics of motor proteins, one uses the pheneomenological laws 
of friction and other ad hoc assumptions. These are physically reasonable, but can 
one not do better? If one had a fundamental theory of nonequilibrium statistical 
mechanics that one could apply, this would be esthetically more satisfactory. But 
more importantly, it would ensure a correct approach to the problem, and avoid 
possibly serious inconsistencies. 
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Chaotic models with Gaussian thermostats 

I want here to discuss the question of linear response in the framework of chaotic 
models with a Gaussian thermostat. This is not a new topic, and the reason to 
come back to it is that there has been some concern about the correctness of results 
announced in [8]. Indeed the proof given there is careless, but the critics of the 
earlier version (D. Dolgopyat and M. Jiang) accept the revised argument given 
in [10]. 

Consider the Hamiltonian equations 

Ifp) = 

dt \q) \p/m) 

where ^{q) = — gradl^ (q). A nonequilibrium time evolution is obtained if we replace 
^{q) by a non gradient term, in particular if ^ is a vector field which is locally but 
not globally a gradient. But then the energy is not conserved, so that the time 
evolution is no longer constrained to a compact submanifold of phase space. A 
nice way to constrain the system to a compact manifold is due to Gauss. In the 
special example where the kinetic energy is maintained at a fixed value (this is the 
IK thermostat), the Hamiltonian evolution is replaced by 

f^iq) - ap\ 
dt \qj V P/'rn ) 

where a = a{p,q) = P-^/p-P- For our purposes we may just as well consider a 
general smooth time evolution on a compact manifold and, for simplicity, we shall 
take the time variable to be discrete (in Z instead of R). Of the original physics we 
retain that the entropy production is the rate of volume contraction (which volume 
element is used will not be important). We shall impose that the time evolution 
is chaotic in the strong form of the “chaotic hypothesis” of Gallavotti and Cohen, 
ie., assume that the time evolution is uniformly hyperbolic (in the sense of Smale’s 
Axiom A) . Finally, as “physical measure” describing a nonequilibrium steady state 
we choose an SRB measure on an attractor for the Axiom A diffeomorphism / 
describing the time evolution. If we start with a measure absolutely continuous 
with respect to Lebesgue in the basin of the attractor, the SRB measure p/ is 
its large time limit. The problem of linear response is to study how pf changes, 
to first order, when / is varied. We write Sfof~^—X,so that X is the vector 
field associated with an infinitesimal change of /. On the attractor which supports 
pfj one can write X = X^ + X'^ the decomposition of X into components in the 
stable (contracting) and unstable (expanding) directions respectively. With these 
definitions we have now the following theorem [8], [10]: 

Let r > 3, and K C M be a mixing Axiom A attractor with SRB state pf 
for the (T diffeomorphism f : M ^ M. Then the map f pj is from the 

(7 diffeos of M to the space of distributions on M. To first order we 
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have 

oo 

n— 0 
oo 

= ^ p/[((grad$) o r, {TDX’>) - ($ o Ddiv“X“] 

n=0 

where div^ is the divergence on unstable manifolds computed with repect to the 
natural measure on those manifolds. 

The natural measure on unstable manifolds is that induced by the SRB mea- 
sure. Since is in general only Holder continuous, div^X^ is in principle only 
a distribution (as remarked by Dolgopyat). It turns out however that (because of 
absolute continuity of the stable foliation), div^X^ is in fact a Holder continuous 
function on K. Also, one needs to take the divergence with respect to the natu- 
ral measure on unstable manifolds, otherwise an extra term appears as noted by 
Jiang, 

The above formulas are important in view of physical applications. Let us 
introduce instead of X = Sf o a time dependent perturbation Xt = Stf o f~^ 
(where f remains time independent). The perturbed state p/ + Stp is now given 
(at time i = 0) by 

oo 

^ p/{grad($o/"),X_n) 

n=0 

oo 

= E P/K(g>^ad$) o r, (TDXl J - ($ o /")div“X“ J 

n=0 

and similarly for continuous time. (We do not try to be precise here: there should 
be conditions on which functions t Xt are allowed). Note that Jop(^) is the 
response at time 0 to the perturbations ôtf, and only the pertubations for t < 0 
play a role (causality). The two terms in the right-hand side both converge expo- 
nentially in n. The function t i— » p/[(^ o /^)div^X^] is a time correlation func- 
tion, and the fact that one term in the response function is a time correlation 
function may be viewed as a form of the fluctuation dissipation theorem (what re- 
mains of the fluctuation-dissipation theorem far from equilibrium). But the term 
p/((grad^) o /^, (T/^)X^) is of a different sort. On one hand, since the attractor 
contains the unstable directions, it is not too astonishing that the term contain- 
ing X^^ be expressible in terms of spontaneous fluctuations of the system. On the 
other hand the term containing X^ corresponds to shaking the system into direc- 
tions that it cannot spontaneously explore, this term is thus of a different nature. 
[Near equilibrium, ie., for Hamiltonian dynamics, the measure p/ is smooth in 
stable as well as unstable directions, which explains the more complete form of 
the fluctuation-dissipation theorem in that case]. 

A word of caution to finish. The above considerations are based on the chaotic 
hypothesis of uniform hyperbolicity of the dynamics. This will virtually never 
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be satisfied in practice, and we are still far from a more general understanding 
of dynamics which would permit a serious weakening of the chaotic hypothesis. 
Furthermore, Gaussian thermostats are just one idealization of nonequilibrium 
statistical mechanics, and it remains to be seen to what extent the results indicated 
above extend to other idealizations. 
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Finite-Dimensional Spin Glasses: 
States, Excitations, and Interfaces 

C.M. Newman and D.L. Stein* 



Abstract. We discuss the underlying connections among the thermodynamic prop- 
erties of short-ranged spin glasses, their behavior in large finite volumes, and the 
interfaces that separate different pure states, and also ground states and low-lying 
excitations. 



1 Introduction 

In this talk, we focus on the following question: assuming that there exists in 
finite dimensions a low-temperature spin glass phase with broken spin-inversion 
symmetry, what are the relationships among the possible types of ground states 
that might be present, their low-energy excitations, and the interfaces that separate 
those states? Answering this question provides a handle on the broader nature of 
the low- temperature spin glass phase, if it exists. 

While our results apply to a wide class of short-ranged models, we focus here 
for specificity on the Edwards- Anderson (EA) Ising model [1], with Hamiltonian: 

7^^ “ ^ ^ Jxy^x^y ^ ^ ^ 5 (1) 

<x,y> X 

where x is a site in a d-dimensional cubic lattice, <7a; = ±1 is the Ising spin at 
site X, h is an external magnetic field, and the first sum is over nearest neighbor 
pairs of sites. To keep things simple, we take h — 0 and the spin couplings Jxy to 
be independent Gaussian random variables whose common distribution has mean 
zero and variance one. The absence of a field and the symmetry of the coupling 
distribution results in an overall global spin inversion symmetry. We denote by J 
a particular realization of the couplings, corresponding physically to a specific spin 
glass sample. 

It is important to consider both equilibrium and nonequilibrium properties 
arising from (1), but we consider here only the former. We remain far from a 
comprehensive theory of the statistical mechanics of the EA Hamiltonian, but 
nonetheless there has in recent years been substantial analytical - and even some 
rigorous - progress in understanding what can and cannot occur in a putative 
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low-temperature spin glass phase. In this talk we examine the relationships among 
three properties of this phase: 

• Interfaces between ground states. A ground state is an infinite- volume spin 
configuration whose energy cannot be lowered by the flip of any finite subset of 
spins. It can be constructed as the limit of a sequence of finite- volume ground 
states, i.e., the lowest-energy spin configurât ion (s) consistent with some boundary 
condition in a finite volume. The interface between two ground states is the set of 
all couplings that are satisfied in one and not the other. 

• Thermodynamic voliune {V ^ oo) behavior. 

• Behavior in typical large finite volumes. 

Our emphasis will be on the interconnectedness of these three: conclusions 
about any one can provide important information about the others. However, these 
relationships may not be at all straightforward; one theme emerging from our work 
is that they are far more complicated than in homogeneous systems. 

2 ‘Observable’ vs. ‘Invisible’ States 

How might one “see” the structure of the spin glass phase? There are several 
possible procedures. As in many other statistical mechanical problems, one can 
study the effects of a change in boundary conditions. Consider a cube Al of side 
L centered at the origin with periodic boundary conditions. There will a spin-flip- 
related pair of spin configurations within Al of lowest energy. This finite- volume 
ground state pair will change in general with the boundary condition; for example, 
if one switches to antiperiodic boundary conditions, then there will be a new 
ground state pair with some interface relative to the old one. 

One important question is whether such interfaces are pinned or deflect to 
infinity: if for any fixed Lq, the relative interface eventually moves (and stays) 
outside of Alo as L ^ cx), the interface has ‘deflected to infinity’; if it remains 
inside Alq, it is ‘pinned’ (for an illustration, see Fig. 2 of [2]). Pinned interfaces 
imply the existence of infinite- volume multiple ground state pairs. 

More recent techniques are useful for studying excitations whose energies 
above a finite- volume ground state are of 0(1) independent of the volume size. 
Consider again a cube Al with periodic boundary conditions. The method of 
Krzakala and Martin (KM) [3] is to choose an arbitrary pair of spins, and force 
them to have an orientation opposite to that in the ground state pair. The method 
of Palassini and Young (PY) [4] is to add a perturbation that lowers the energy 
of a spin configuration by an amount proportional to the fraction of bonds in an 
interface relative to the ground state pair. Either way, there will be a new spin 
configuration pair that minimizes the energy. 

What these and most other widely-used procedures have in common is that 
they are not explicitly coupling-dependent; e.g., the boundary conditions used are 
independent of the coupling realization J. We call ‘observable’ any interfaces (and 
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resulting states) that can be constructed in this way; interfaces or states that can 
only be seen using coupling- dependent boundary conditions we call ‘invisible’. A 
detailed discussion of the reasons behind these designations is given in [5] (see also 
the discussion in Sec. 3 of [6]). 

3 Interfaces 

We now focus on interfaces separating spin configurations. The analysis is at T = 0 
but can be extended to nonzero temperature. That is, we confine our attention 
here to interfaces separating either different ground states or ground states and 
excitations. For now, we won’t worry how the interfaces arise: they might have 
arisen between ground states in a single volume under a switch from periodic to 
antiperiodic boundary conditions, or from using the KM or PY procedures, or 
through some other coupling-independent method. Our discussion below applies 
to all. 

The main features we study include: 

• Spatial Structure. In other words, is the interface ‘space-filling’ or ‘zero- 
density’? By space-filling we mean the following. Consider in d dimensions a se- 
quence of cubes Al, each containing an interface generated through a common 
procedure (e.g., switching from periodic to antiperiodic boundary conditions). If, 
for each L, the number of bonds in the interface scales as with dg = d, then 
the interface is space-filling. If dg < d, it is zero-density. 

One of the interesting possibilities arising in spin glasses is the possibility of 
space-filling interfaces separating ground or pure states; such a possibility cannot 
arise, e.g., in ferromagnets. Space-filling interfaces (suitably redefined) are believed 
[7] to separate different pure states in the low-temperature phase of the infinite- 
ranged Sherrington-Kirkpatrick (SK) model [8]. 

• Energetics. Spin glass interfaces can also have unusual energetic properties. 
In a ferromagnet, whether homogeneous or disordered, the energy of an interface 
scales linearly with the number of bonds comprising it. This is not necessarily so in 
a spin glass. One intriguing possibility is that - as happens in the SK model - an 
interface might have 0(1) energy independently of its size. The other possibility is 
that the energy of an interface in the volume A^ is O(L^), with d > 0. (We ignore 
unlikely special cases such as logarithmic and other non-power-law dependences.) 

• Pinning. A property crucial to the nature of states separated by an interface 
is whether the interface is pinned or deflects to infinity, as discussed in Sec. 2. As 
shown in [2, 9], which of these occurs is not independent of the spatial structure, for 
interfaces separating observable states: a space-filling interface must be pinned, and 
a zero-density one (generated by a coupling-independent procedure) must deflect 
to infinity. 

Each of these interface properties corresponds to a major scenario proposed 
for the low- temperature spin glass phase of the EA model in finite dimensions. 
This is summarized in Fig. 1, and each will be discussed in turn. 
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Figure 1: Table illustrating the correspondence between a type of interface and a 
scenario for the structure of the low- temperature spin glass phase in finite dimen- 
sions. 



3.1 Mean-Field Picture 

The picture arising from Parisi’s solution [7, 10, 11, 12, 13] of the infinite-ranged SK 
model is known as the replica symmetry breaking (RSB) theory. There have been 
many papers written in support of the notion that the RSB theory should apply 
as well to short-ranged spin glasses. We will not discuss all of RSB’s features here; 
given the subject of this talk, we focus on the prediction [14, 15] of RSB theory that 
interfaces separating ground states are both space-filling and can have energies of 
0(1), i.e., that don’t scale with the size of the system. Hence the designation ‘RSB 
interfaces’ in the upper left-hand corner of the table in Fig. 1. 

A central result of [2] was a proof that the existence of space-filling interfaces 
(regardless of how their energies scale, as long as the interfaces are generated 
by coupling- independent methods) is a sufficient condition for the existence of 
multiple thermodynamic pure state pairs. In order to have a nontrivial overlap 
function (at nonzero temperature) of the sort characteristic of RSB theory [7], 
we have shown that the set of all states generated through sequences of finite- 
volume Gibbs states must partition into a union of thermodynamic mixed states 
r, with each F being a nontrivial collection of infinitely many pure states with 
certain weights. Different F’s would appear in different finite volumes; for details, 
see [16, 17, 18, 19]. One can also prove that there must be an uncountable infinity 
of pure states in the union of all F’s [20]. 

So in this picture the connections among the three properties listed in Sec. 1 
is that RSB interfaces imply (and are implied by) this complex thermodynamic 
structure, and by nontrivial overlap structure in large finite volumes. However, 
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thermodynamic arguments show [2, 16, 17, 18, 19, 21] that this thermodynamic 
structure cannot be supported in finite-dimensional, short-ranged spin glasses, in 
turn ruling out the possibility that space-filling, 0(1) energy interfaces can arise 
in these systems. 



3.2 Chaotic Pairs Picture 

The upper right-hand corner of Fig. 1 raises the possibility of space-filling interaces 
whose energy scales as with ^ > 0. (For reasons that won’t be discussed here, 
there is an upper bound [22, 23] of ^ < {d— l)/2 for observable states.) This leads 
directly to the chaotic pairs picture [16, 17, 18, 19, 23] in which many pure state 
pairs exist, but only a single spin- reversed pair appears in any large finite volume, 
leading to a trivial overlap structure and no ultrametricity or related features of 
the RSB scenario. 



3,3 Droplet /Scaling Picture 

Continuing clockwise around the table in Fig. 1, we come to interfaces that are zero- 
density and with energy scaling as L^, with 0 > 0. These interfaces characterize 
droplet excitations in a two-state picture developed by Macmillan [24], Bray and 
Moore [25], and Fisher and Huse [22]. This picture is well-known and won’t be 
discussed in detail here. 



3.4 ‘TNT’ Picture 

We come finally to the last entry, which conjectures interfaces that are both zero- 
density and whose energies remain 0(1) independently of lengthscale. This picture 
was proposed by Krzakala and Martin [3] (who denoted the picture ‘TNT’ for 
trivial link overlap and nontrivial spin overlap) and Palassini and Young [4] . It can 
be shown that zero- density interfaces cannot separate observable pure or ground 
states [9], so these states, should they exist, would necessarily be excitations, as 
in the droplet- scaling picture. 



4 Conclusion 

The purpose of this talk has been to demonstrate that there are deep connections 
among the thermodynamic structure of spin glass states, their behavior in large 
finite volumes, and the interfaces that separate these states (and/or their excita- 
tions). Tools developed for any one can lead to strong conclusions about the other 
two. This interconnection has allowed substantial analytical, and even rigorous, 
progress on short-ranged spin glasses. 
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The Functional Renormalization Group 
Treatment of Disordered Systems, a Review 



Kay Jorg Wiese 



Abstract. We review current progress in the functional renormalization group treat- 
ment of disordered systems. After an elementary introduction into the phenomenol- 
ogy, we show why in the context of disordered systems a functional renormalization 
group treatment is necessary, contrary to pure systems, where renormalization of a 
single coupling constant is sufficient. This leads to a disorder distribution, which af- 
ter a finite renormalization becomes non-analytic, thus overcoming the predictions 
of the seemingly exact dimensional reduction. We discuss, how a renormalizable field 
theory can be constructed, even beyond 1-loop order. We then discuss an elastic 
manifold embedded in N dimensions, and give the exact solution for A” ^ oo. This 
is compared to predictions of the Gaussian replica variational ansatz, using replica 
symmetry breaking. We finally review depinning, both isotropic and anisotropic, 
and the scaling function for the width distribution of an interface. 



1 Introduction 

Statistical mechanics is by now a rather mature branch of physics. For pure sys- 
tems like a ferromagnet, it allows to calculate so precise details as the behavior of 
the specific heat on approaching the Curie-point. We know that it diverges as a 
function of the distance in temperature to the Curie-temperature, we know that 
this divergence has the form of a power-law, we can calculate the exponent, and 
we can do this with at least 3 digits of accuracy. Best of all, these findings are in 
excellent agreement with the most precise experiments. This is a true success story 
of statistical mechanics. On the other hand, in nature no system is really pure, 
i.e. without at least some disorder (“dirt”). As experiments (and theory) seem to 
suggest, a little bit of disorder does not change the behavior much. Otherwise ex- 
periments on the specific heat of Helium would not so extraordinarily well confirm 
theoretical predictions. But what happens for strong disorder? By this I mean that 
disorder completely dominates over entropy. Then already the question: “What is 
the ground-state?” is no longer simple. This goes hand in hand with the appear- 
ance of so-called metastable states. States, which in energy are very close to the 
ground-state, but which in configuration-space may be far apart. Any relaxational 
dynamics will take an enormous time to find the correct ground-state, and may 
fail altogether, as can be seen in computer-simulations as well as in experiments. 
This means that our way of thinking, taught in the treatment of pure systems, has 
to be adapted to account for disorder. We will see that in contrast to pure systems, 
whose universal large-scale properties can be described by very few parameters, 
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Figure 1: An Ising magnet at low temperatures forms a domain wall described by 
a function u{x) (right). An experiment on a thin Cobalt film (left) [14]; with kind 
permission of the authors. 



disordered systems demand the knowledge of the whole disorder-distribution func- 
tion (in contrast to its first few moments). We show how universality nevertheless 
emerges. 

Experimental realizations of strongly disordered systems are glasses, or more 
specifically spin- glasses, vortex-glasses, electron- glasses and structural glasses (not 
treated here). Furthermore random- field magnets, and last not least elastic systems 
in disorder. 

What is our current understanding of disordered systems? It is here that 
the success story of statistical mechanics, with which I started, comes to an end: 
Despite 30 years of research, we do not know much: There are a few exact solutions, 
there are phenomenological methods (like the droplet-model), and there is the 
mean- field approximation, involving a method called replica- symmetry breaking 
(RSB). This method is correct for infinitely connected systems, e.g. the SK-model 
(Sherrington Kirkpatrick model), or for systems with infinitely many components. 
However it is unclear, to which extend it applies to real physical systems, in which 
each degree of freedom is only coupled to a finite number of other degrees of 
freedom. 

Another interesting system are elastic manifolds in a random media, which 
have the advantage of being approachable by other (analytic) methods, while still 
retaining all the rich physics of strongly disordered systems. Here, I review recent 
advances obtained in collaboration with Pierre Le Doussal [1, 2, 3, 4, 5, 6, 7, 8, 9, 
10, 11, 12]. This review is an extended version of [13]. 

2 Physical realizations, model and observables 

Before developing the theory to treat elastic systems in a disordered environment, 
let us give some physical realizations. The simplest one is an Ising magnet. Im- 
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posing boundary conditions with all spins up at the upper and all spins down at 
the lower boundary (see figure 1), at low temperatures, a domain wall separates a 
region with spin up from a region with spin down. In a pure system at temperature 
T = 0, this domain wall is completely flat. Disorder can deform the domain wall, 
making it eventually rough again. Two types of disorder are common: random 
bond (which on a course-grained level also represents missing spins) and random 
field (coupling of the spins to an external random magnetic field). Figure 1 shows, 
how the domain wall is described by a displacement field u(x). Another example is 
the contact line of water (or liquid Helium), wetting a rough substrate, see figure 2. 
(The elasticity is long range). A realization with a 2-parameter displacement field 
u(x) is the deformation of a vortex lattice: the position of each vortex is deformed 
from X to X -h u(x). A 3-dimensional example are charge density waves. 

All these models have in common, that they are described by a displacement 

field 

— » u{x) € . (2.1) 

For simplicity, we set AT = 1 in the following. After some initial coarse-graining, 
the energy 7i = 7ie\ + Hdo consists out of two parts: the elastic energy 

Hei[u]= J d'^x^(Vu{x)f ( 2 . 2 ) 

and the disorder 

^doM = J d"^xV{x,u{x)) . (2.3) 




Figure 2: A contact line for the wetting of a disordered substrate by Glycerine 
[15]. Experimental setup (left). The disorder consists of randomly deposited is- 
lands of Chromium, appearing as bright spots (top right). Temporal evolution of 
the retreating contact-line (bottom right). Note the different scales parallel and 
perpendicular to the contact-line. Pictures courtesy of S. Moulinet, with kind per- 
mission. 
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Figure 3: Cartoon of an elastic lattice (e.g. a vortex lattice) deformed by disorder. 
This is described by a vector u{x). 



In order to proceed, we need to specify the correlations of disorder. Suppose that 
fluctuations u in the transversal direction scale as 

{u{x) - u{y)f ~ |a; - (2.4) 

with a roughness- exponent C < 1- Starting from a disorder correlator 

V (u, x)V (u', x') = f{x — x')R{u — u') (2.5) 

and performing one step in the RG- procedure, one has to rescale more in the 
a:-direction than in the ix-direction. This will eventually reduce f{x — x') to a 
^-distribution, whereas the structure of R{u — u') remains visible. We therefore 
choose as our starting-model 

V{u,x)V{u',x') := S^{x - x')R{u - u') . (2.6) 

There are a couple of useful observables. We already mentioned the roughness- 
exponent C- The second is the renormalized (effective) disorder. It will turn out that 
we actually have to keep the whole disorder distribution function R{u), in contrast 
to keeping a few moments. Other observables are higher correlation functions or 
the free energy. 

3 Treatment of disorder 

Having defined our model, we can now turn to the treatment of disorder. The 
problem is to average not the partition-function, but the free energy over disorder: 
= \nZ. This can be achieved by the beautiful replica-trick. The idea is to write 

lnZ= lim - - l) = lim - (2” 1) (3.1) 

n^O n ^ ' n^o n 

and to interpret as the partition-function of an n times replicated system. 
Averaging disorder then leads to the replica- Hamiltonian 

'H[u\ = f ^ - Ub(x)) . (3.2) 
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Let us stress that one could equivalently pursue a dynamic or a supersymmetric 
formulation. We therefore should not, and in fact do not encounter, problems 
associated with the use of the replica-trick. 

4 Dimensioned reduction 

There is a beautiful and rather mind-boggling theorem relating disordered systems 
to pure systems (i.e. without disorder), which applies to a large class of systems, 
e.g. random field systems and elastic manifolds in disorder. It is called dimensional 
reduction and reads as follows [16]: 

Theorem: A d- dimensional disordered system at zero temperature is equivalent to 
all orders in perturbation theory to a pure system in d — 2 dimensions at finite tem- 
perature. Moreover the temperature is (up to a constant) nothing but the width 
of the disorder distribution. A simple example is the 3-dimensional random-field 
Ising model at zero temperature; according to the theorem it should be equiva- 
lent to the pure 1-dimensional Ising-model at finite temperature. But it has been 
shown rigorously by Imbrie [17], that the former has an ordered phase, whereas 
we have all solved the latter and we know that there is no such phase at finite 
temperature. So what went wrong? Let me stress that there are no missing dia- 
grams or any such thing, but that the problem is more fundamental: As we will 
see later, the proof makes assumptions, which are not satisfied. Nevertheless, the 
above theorem remains important since it has a devastating consequence for all 
perturbative calculations in the disorder: However clever a procedure we invent, 
as long as we do a perturbative expansion, expanding the disorder in its moments, 
all our efforts are futile: dimensional reduction tells us that we get a trivial and 
unphysical result. Before we try to understand why this is so and how to overcome 
it, let me give one more example. Dimensional reduction allows to calculate the 
roughness-exponent defined in equation (2.4). We know (this can be inferred 
from power-counting) that the width u oî a. d-dimensional manifold at finite tem- 
perature in the absence of disorder scales as u ^ Making the dimensional 

shift implied by dimensional reduction leads to 

(u(x) — u{0)Ÿ ~ i-e. C = 



5 The Larkin-length 

To understand the failure of dimensional reduction, let us turn to an interesting 
argument given by Larkin [18]. He considers a piece of an elastic manifold of size 
L. If the disorder has correlation length r, and characteristic potential energy /, 
this piece will typically see a potential energy of strength 







(5.1) 
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R{Ua{x) - Uh[x)) 

a b 



C(x -y) = 



a b 

X 

0R{Ua - Ub) = 

y i---é 

a h 



- 2 



a b 




a a 



= f C{x-yf[R”{Ua 

Jx-y 



- Uhf - 2R''{Ua - Ub)R”{Ua ~ Ma)] 



Figure 4: The disorder vertex R(ua{x) — Uh{x)) and the correlation- function C{x — 
y), with Fourier- transform C{k) = which is diagonal in replica-space (left). 
Contracting two disorder- vertices with two correlation- functions leads to the two 1- 
loop contributions 5R to the disorder-correlator R (right). The integral j^_y C[x — 

yY = where L is some IR-cutoff. 

On the other hand, there is an elastic energy, which scales like 

Ee\=cL^-‘^. (5.2) 

These energies are balanced at the Larkin-length L — Lc with 




More important than this value is the observation that in all physically interesting 
dimensions d < 4, and at scales L > Tc, the membrane is pinned by disorder; 
whereas on small scales elastic energy dominates. Since the disorder has a lot of 
minima which are far apart in configurational space but close in energy (metasta- 
bility), the manifold can be in either of these minimas, and the ground-state is no 
longer unique. However exactly this is assumed in e.g. the proof of dimensional 
reduction; as is most easily seen in its supersymmetric formulation [19]. 

6 The functional renormalization group (FRG) 

Let us now discuss a way out of the dilemma: Larkin’s argument suggests that 4 
is the upper critical dimension. So we would like to make an e = 4 — d expansion. 
On the other hand, dimensional reduction tells us that the roughness is ^ 

(see (4.1)). Even though this is systematically wrong below four dimensions, it 
tells us correctly that at the critical dimension d = 4, where disorder is marginally 
relevant, the field u is dimensionless. This means that having identified any rele- 
vant or marginal perturbation (as the disorder), we find immediately another such 
perturbation by adding more powers of the field. We can thus not restrict ourselves 
to keeping solely the first moments of the disorder, but have to keep the whole 
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-R"(u) A 



Figure 5: Change of —R"{u) under renormalization and formation of the cusp. 



disorder-distribution function R(u). Thus we need a functional renormalization 
group treatment (FRG). Functional renormalization is an old idea going back to 
the seventies, and can e.g. be found in [20]. For disordered systems, it was first 
proposed in 1985 by D.S. Fisher [21]. Performing an infinitesimal renormalization, 
i.e. integrating over a momentum shell à la Wilson, leads to the flow deR{u), with 
(e = 4-d) [22] 

deR{u) = (e - 4C) R{u) + CuR'{u) + ^R"{uf - R"(u)R"(0) . (6.1) 

The first two terms come from the rescaling of R and u respectively. The last two 
terms are the result of the 1-loop calculations, which are sketched in figure 4. 

More important than the form of this equation is it actual solution, sketched 
in figure 5. After some finite renormalization, the second derivative of the disorder 
R"{u) acquires a cusp at = 0; the length at which this happens is the Larkin- 
length. How does this overcome dimensional reduction? To understand this, it is 
interesting to study the flow of the second and forth moment. Taking derivatives 
of (6.1) w.r.t. u and setting u to 0, we obtain 

diR'\0) - (e - 2C) R"{0) + R"'{0f — > {e - 2Q R"{0) (6.2) 

diR""{0) = eR""(0) + 3R""{0f + 4R'"{0)R"'"{0) — > eR""{0) + 3R""{0f . (6.3) 

Since R{u) is an even function, R'"{0) and R'""{Q) are 0, which we have already 
indicated in Eqs. (6.2) and (6.3) . The above equations for R"{0) and R""{0) are 
in fact closed. Equation (6.2) tells us that the flow of R"{0) is trivial and that in 
order to have a fixed point, ( = ef 2 = This is exactly the result predicted by 
dimensional reduction. The appearance of the cusp can be inferred from equation 
(6.3). Its solution is 

^""(0)1. = i_3e(e'! - li)/, - ^ (6-4) 

Thus after a finite renormalization R""{0) becomes infinite: The cusp appears. By 
analyzing the solution of the flow-equation (6.1), one also finds that beyond the 
Larkin- length R"{Q) is no longer given by (6.2) with R"'{0)^ = 0. The correct 
interpretation of (6.2), which remains valid after the cusp- format ion, is (for details 
see below) 



deR"{0) = {e- 2C) R"{0) + R'"{0^f . 



(6.5) 
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U 



Figure 6: The toy model (7.1) before (left) and after (right) the Larkin-scale. 



Renormalization of the whole function thus overcomes dimensional reduction. The 
appearance of the cusp also explains why dimensional reduction breaks down. The 
simplest way to see this is by redoing the proof for elastic manifolds in disorder, 
which in the absence of disorder is a simple Gaussian theory. Terms contributing to 
the 2-point function involve R"(0), TR""(0) and higher derivatives of R(u) at n = 
0, which all come with higher powers of T. To obtain the limit of T 0, one sets 
r = 0, and only R"(0) remains. This is the dimensional reduction result. However 
we just saw that R""(0) becomes infinite. Thus R""(0)T may also contribute, and 
the proof fails. 

7 Why is a cusp necessary? 

The appearance of a cusp might suggest that this approach is fatally ill. Let me 
present a simple argument due to Leon Balents [23], why a cusp is a physical 
necessity and not an artifact. To this aim, consider a toy model with only one 
Fourier-mode u = Uq 

n[u] = y^^ + V~eV{u) . (7.1) 

Since equation (6.1) has a fixed point of order R{u) ~ e for all e > 0, V{u) scales 
like y/e for e small and we have made this dependence explicit in (7.1) by using 
V{u) = y/eV{u). The only further input comes from the physics: For L < Lc, i.e. 
before we reach the Larkin length, there is only one minimum, as depicted in figure 
6. On the other hand, for L > Lc, there are several minima. Thus there is at least 
one point for which 

^ H[u] =q'^ + Vl V”(u) < 0 . (7.2) 

In the limit of e — » 0, this is possible if and only if ^R""(0), which a priori should 
be finite for e ^ 0, becomes infinite: 

= oo . (7.3) 

This argument shows that a cusp is indeed a physical necessity. 
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8 Beyond 1 loop? 

Functional renormalization has successfully been applied to a bunch of problems 
at 1-loop order. From a field theory, we however demand more. Namely that it 

• allows for systematic corrections beyond 1-loop order 

• be renormalizable 

• and thus allows to make universal predictions. 

However, this has been a puzzle since 1986, and it has even been suggested that the 
theory is not renormalizable due to the appearance of terms of order C 2 [24]. Why 
is the next order so complicated? The reason is that it involves terms proportional 
to R"'{0). A look at figure 3 explains the puzzle. Shall we use the symmetry of 
R{u) to conclude that R"'{0) is 0, leading us back to dimensional reduction? Or 
shall we take the left-hand or right-hand derivatives, related by 

i?"'(0+) := Urn R"\u) = - fim R"'{u) =: (8.1) 

u-fO 

Even worse, already the 2-point function at 1-loop order is proportional to 
and thus a priori undefined [4]. In the following, I will present our solution of this 
puzzle, obtained at 2-loop order in [1, 4] and then at large N in [2]. The latter 
approach allows for another independent control-parameter, and sheds further light 
on the cusp- formation. We refer the reader to these works for further references. 

9 Results at 2-loop order 

For the flow-equation at 2-loop order, we find [1] 

deR{u) = (e - 40 R(u) + CuR'(u) + ]^R”{uf - R"{u)R"(0) 

{R"{u) - R"{0)) R'"{u)^ - ^R'''{0+)^R"iu) . (9.1) 

The first line is the result at 1-loop order, already given in (6.1). The second line is 
new. The most interesting term is the last one, which involves R'"(0^)^ and which 
we therefore call anomalous. The hard task is to fix the prefactor (-|). We have 
up to now invented six algorithms to do so; one leads to inconsistencies and shall 
not be reported here. The other five algorithms are consistent with each other: 
The sloop- algorithm, recursive construction, reparametrization invariance, renor- 
malizability, and potentiality [1, 4]. For lack of space, we restrain our discussion 
to the last two ones. At 2-loop order the following diagram appears 

— > ~(R"{u)-R"{0))R'"{uf -^R''{u)R"'{0+)'^ (9.2) 
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d = 2 


0.417 


0.444 


0.42 ± 0.02 


0.41 ±0.01 


25 




d=l 


0.625 


0.687 


0.67 ±0.02 
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Figure 7: Results for ( in the random bond case. 



leading to the anomalous term. The integral (not written here) contains a sub- 
divergence, which is indicated by the box. Renormalizability demands that its 
leading divergence (which is of order 1/e^) be canceled by a 1-loop counter-term. 
The latter is unique thus fixing the prefactor of the anomalous term. (The idea is 
to take the 1-loop correction SR in figure 2 and replace one of the R" in it by SR" 
itself, which the reader can check to leading to the terms given in (9.2) plus terms 
which only involve even derivatives, which will even cancel at the end.) 

Another very physical demand is that the problem remain potential, i.e. 
that forces still derive from a potential. The force-force correlation function being 
— R"{u), this means that the flow of R'{0) has to be strictly 0. (The simplest way 
to see this is to study a periodic potential.) Prom (9) one can check that this does 
not remain true if one changes the prefactor of the last term in (9); thus fixing it. 

Let us give some results for cases of physical interest. First of all, in the case 
of a periodic potential, which is relevant for charge-density waves, the fixed-point 
function can be calculated analytically as (we choose period 1, the following is for 
u € [ 0 , 1 ]) 

•R*(w) = “ (^ + (9-3) 

This leads to a universal amplitude. In the case of random field disorder (short- 
ranged force- force correlation function) C = f • For random-bond disorder (short- 
ranged potential-potential correlation function) we have to solve (9.1) numerically, 
with the result ( — 0.20829804e -h 0.006858e^. This compares well with numerical 
simulations, see figure 7. 



10 Large N 



In the last section, we have discussed renormalization in a loop expansion, i.e. 
expansion in e. In order to independently check consistency it is good to have a 
non-perturbative approach. This is achieved by the large- A' limit, which can be 
solved analytically and to which we turn now. We start from 



— n[uj] = -^'^ Ua{x) (-V^+m^) Ua{x) - ^ / Ja{x)Ua\ 
è [ B{{Ua{x)-Ub{x)f) , 

a, 6=1 






( 10 . 1 ) 
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where in contrast to (3.2), we use an AT-component field u. For = 1, we identify 
B{v?) = R{u). We also have added a mass m to regularize the theory in the infra- 
red and a source j to calculate the effective action T{u) via a Legendre transform. 
For large N the saddle point equation reads [2] 

ê'iul,) = B' (ul, + 2Th + ^h[B'{ul,) - B'(0)]) (10.2) 

This equation gives the derivative of the effective (renormalized) disorder B as 
a function of the (constant) background field = {ua — in terms of: the 
derivative of the microscopic (bare) disorder B, the temperature T and the inte- 
grals In \= (/e2+^^2)n. 

The saddle-point equation can again be turned into a closed functional renor- 
malization group equation for B by taking the derivative w.r.t. m: 

diB(x) = = (e-4C)B(x) + 2CxÉ'{x) + \è'(x)^ - B'{x)B'{0) + 

(10.3) 

This is a complicated nonlinear partial differential equation. It is therefore sur- 
prising, that one can find an analytic solution. (The trick is to write down the 
flow-equation for the inverse function of B'{x)^ which is linear.) Let us only give 
the results of this analytic solution: First of all, for long-range correlated disor- 
der of the form B'{x) ~ the exponent C can be calculated analytically as 
C = 2 ( 1 + 7 ) ■ ^Lgrees with the replica-treatment in [26] and the 1-loop treatment 
in [24]. For short-range correlated disorder, C = 0- Second, it demonstrates that 
before the Larkin- length, B{x) is analytic and thus dimensional reduction holds. 
Beyond the Larkin length, B"(0) = oo, a cusp appears and dimensional reduction 
is incorrect. This shows again that the cusp is not an artifact of the perturbative 
expansion, but an important property even of the exact solution of the problem 
(here in the limit of large A^). 

11 Relation to Replica Symmetry Breaking (RSB) 

There is another treatment of the limit of large N given by Mezard and Parisi 
[26]. They start from (10.1) but without a source- term j. In the limit of large N, 
a Gaussian variational ansatz of the form 

^Bg[u\ = I Ua{x) (-V+m^)Ua(x) - ^ è aabUa{x)Üb{x) (11.1) 

a=l ^ a, 6—1 

becomes exact. The art is to make an appropriate ansatz for Uab- The simplest 
possibility, dab = ^ for sl\ a ^ b reproduces the dimensional reduction result (4.1), 
which breaks down at the Larkin length. Beyond that scale, a replica symmetry 
broken (RSB) ansatz for dab is suggestive. To this aim, one can break dab into four 
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blocks of equal size, choose one ( variât ionally optimized) value for the both outer 
diagonal blocks, and then iterate the procedure on the diagonal blocks, resulting 
in 



( 11 . 2 ) 



One finds that the more often one iterates, the better the results. In fact, one has 
to repeat this procedure infinite many times. This seems like a hopeless endeavor, 
but Paris! has shown ^hat the infinitely often replica symmetry broken matrix 
can be parameterized by a function [cr]{z) with 2: E [0, 1]. In the SK-model, 2; has 
the interpretation of an overlap between replicas. While there is no such simple 
interpretation for the model (11.1), we retain that 2: == 0 describes distant states, 
whereas 2 ; = 1 describes nearby states. The solution of the large- TV saddle-point 
equations leads to the curve depicted in figure 8. Knowing it, the 2-point function 
is given by 




(UkU-k) 



1 




r dz M jz) \ 

Jo M {z) + m2 y 



(11.3) 



The important question is: What is the relation between the two approaches, which 
both pretend to calculate the same 2-point function? Comparing the analytical 
solutions, we find that the 2-point function given by FRG is the same as that 
of RSB, if in the latter expression we only take into account the contribution 
from the most distant states, i.e. those for 2 : between 0 and Zm (see figure 8). To 




Figure 8: The function [a] (u) -h m^ £ls given in [26]. 
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understand why this is so, we have to remember that the two calculations were 
done under quite different assumptions: In contrast to the RSB-calculation, the 
FRG- approach calculated the partition function in presence of an external field j, 
which was then used to give via a Legendre transformation the effective action. 
Even if the field j is finally turned to 0, the system might remember its preparation, 
as is the case for a magnet: Preparing the system in presence of a magnetic field 
will result in a magnetization which aligns with this field. The magnetization will 
remain, even if finally the field is turned off. The same phenomena happens here: 
By explicitly breaking replica- symmetry through an applied field, all replicas will 
settle in distant states, and the close states from the Parisi-function [cr] (z) + m? 
(which describes spontaneous RSB) will not contribute. However, we found that 
the full RSB-result can be reconstructed by remarking that the part of the curve 
between Zm and Zc is independent of the infrared cutoff m, and then integrating 
over m [2] {rUc is the mass corresponding to Zc): 



RSB 

(UkU-k) 

k—0 



RLio) . n dfi;(o) 

fji^ ml 



(11.4) 



We also note that a similar effective action hats been proposed in [27]. While it 
agrees qualitatively, it does not reproduce the correct FRG 2-point function, as it 
should. 



12 Finite N 



Up to now, we have studied the functional RG in two cases: For one component 
N = 1 and in the limit of a large number of components, N ^ oo. The general 
case of finite N is more difficult to handle, since derivatives of the renormalized 
disorder now depend on the direction, in which this derivative is taken. Define 
amplitude u \u\ and direction u := u/\u\ of the field. Then deriving the latter 
variable leads to terms proportional to 1/u, which are diverging in the limit of 
u ^ 0. This poses additional problems in the calculation, and it is a priori not 
clear that the theory at N 1 exists, even if this is the case for TV = 1. At 1-loop 
order everything is well-defined [24]. We have found a consistent RG-equation at 
2- loop order [8]: 



deR{u)={e-4C)R{u)HuR'{u)+^R"{u)^-R"{0)R"{u)+Rf^^[^-2R"i0)] 



+ i [iî"(w)-iî"(0)] R'"{uf+^ 



-R'"{0+y 






( 12 , 1 ) 



The first line is the 1-loop equation, given in [24]. The second and third line repre- 
sent the 2-loop equation, with the new anomalous terms proportional to 
(third line). 
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C 




Figure 9: Results for the roughness C at 1- and 2-loop order, as a function of the 
number of components N. 



The fixed point equation (12.1) can be integrated numerically, order by order 
in e. The result, specialized to directed polymers, i.e. e = 3 is plotted on figure 9. 
We see that the 2-loop corrections are rather big at large TV, so some doubt on the 
applicability of the latter down to e = 3 is advised. However both 1- and 2-loop 
results reproduce well the two known points on the curve: (^ = 2/3 for N = 1 
and C = 0 for TV = oo. The latter result has been given in section 10. Via the 
equivalence [28] of the directed polymer problem in TV dimensions treated here 
and the KPZ-equation of non-linear surface growth in TV dimensions, which relate 
the roughness exponent C of the directed polymer to the dynamic exponent 2 ;kpz 
in the KPZ-equation via ( = we know that C(TV=l) = 2/3. 

The line (^ = 1/2 given on figure 9 plays a special role: In the presence of 
thermal fluctuations, we expect the roughness-exponent of the directed polymer 
to be bounded by > 1/2. In the KPZ-equation, this corresponds to a dynamic 
exponent zkpz = 2, which via the exact scaling relation 2 :kpz + Ckpz = 2 is an 
upper bound in the strong-coupling phase. The above data thus strongly suggest 
that there exists an upper critical dimension in the KPZ-problem, with due ~ 2.4. 
Even though the latter value might be an underestimation, it is hard to imagine 
what can go wrong qualitatively with this scenario. Strong objections might arize 
from numerical simulations, such as [29]. However the latter use a discrete RSOS 
model, and the exponents are measured for interfaces, which in large dimensions 
have average thickness of the discretization size, suggesting that the data are far 
from the asymptotic regime. We thus strongly encourage numerical simulations on 
a continuous model, in order to settle this issue. 
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13 Depinning transition 

Another important class of phenomena for elastic manifolds in disorder is known 
under the name “depinning transition”: Applying a constant force to the elas- 
tic manifold, e.g. a constant magnetic field to the ferromagnet mentioned in the 
introduction, the latter will only move, if a certain critical threshold force fc is 
surpassed, see figure 10. (This is fortunate, since otherwise the magnetic domain 
walls in the hard-disc drive onto which this article is stored would move with 
the effect of deleting all information, depriving the reader from his reading.) At 
/ = /c, i.e. at the so-called depinning transition, the manifold has a distinctly dif- 
ferent roughness exponent ( (defined in Eq. (2.4)) from the equilibrium (/ = 0). 
For f > fci fhe manifold moves, and close to the transition, new observables and 
corresponding exponents appear: 

• the dynamic exponent 2 : relating correlation functions in spatial and temporal 
direction 

t ^ 

• a correlation length ^ set by the distance to fc 

e-iz-Zci-" 

• furthermore, the new exponents are not all independent, but satisfy the 
following exponent relations [30, 31] 

0 = i'{z-C) 

The equation describing the movement of the interface is 

dtu{x,t) = 7n^)u{x,t) F{u{x,t),x) , F{u,x) = -duV{u,x) (13.2) 

This model has been treated at 1-loop order by Natterman et al. [30, 31] and by 
Narayan and Fisher [32]. The 1-loop flow-equations are identical to those of the 

V 



fc f 

Figure 10: Velocity of a pinned interface as a function of the applied force. Zero 
force: equilibrium, f = f^: depinning. 
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statics. This is surprising, since physically, the phenomena at equilibrium and at 
depinning are quite different. There is even the claim by [32], that the roughness 
exponent in the random field universality class is exactly ( == e/3, as it is in the 
equilibrium random field class. After a long debate among numerical physicists, 
the issue is today resolved: The roughness is significantly larger, and reads e.g. for 
the driven polymer ( = 1.25, instead of ^ = 1. Clearly, a 2-loop analysis [1, 3] is 
necessary, to resolve these issues. Such a treatment starts from the dynamic action 



5= / û{x,t){dt — rn^)u{x,t) -\- / ü{x,t)A{u{x,t) - u{x,t'))û{x,t') , 

Jx,t 

(13.3) 

where the “response field” û{x,t) enforces the equation of motion (13.2) and 

F{u,x)F{u',x') = A{u - u')0^{x - x') = -R'\u - u')5'^{x - x') (13.4) 



is the force-force correlator, leading to the second term in (13.3). As in the statics, 
one encounters terms proportional to A'(0“^) = Here the sign- problem 

can uniquely be resolved by supposing that the membrane only jumps ahead. 



t > t' u{x,t) > u{x,t') . 



(13.5) 



Practically this means that when evaluating diagrams containing A{u{x,t) — 
u{x,t')), one splits them into two pieces, one with t < t' and one with t > t' . 
Both pieces are well defined, even in the limit of t ^ As the only tread-off of 
this method, diagrams can become complicated and difficult to evaluate; however 
they are always well-defined. 

Physically, this means that we approach the depinning transition from above. 
This is reflected in (13.5) by the fact that u{x, t) may remain constant; and indeed 
correlation-functions at the depinning transition are completely independent of 
time[3j. On the other hand a theory for the approach of the depinning transition 
from below (/ < /c) has been elusive so far. 

At the depinning transition, the 2-loop functional RG reads [1, 3] 

deR{u) = {e - 4QR{u) + CuR'{u) + ^R''{u)^-R"{u)R"(0) 

+ \ [R"{u) - -R"(0)1 R"'{uf + i R"'{0+fR"{u) . (13.6) 

First of all, note that it is a priori not clear that the functional RG equation, 
which is a flow equation for A{u) = —R''{u) can be integrated to a functional 
RG-equation for R{u). We have chosen this representation here, in order to make 
the difference to the statics evident: The only change is in the last sign on the 
second line of (13.6). This has important consequences for the physics: First of 
all, the roughness exponent ( for the random-field universality class changes from 
C = I to 

C = ^(l+0.1433l€+...) . 
o 



(13.7) 
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Second, the random-bond universality class is unstable and always renormalizes to 
the random-field universality class, as is physically expected: Since the membrane 
only jumps ahead, it always experiences a new disorder configuration, and there is 
no way to know of wether this disorder can be derived from a potential or not. This 
non-potentiality is most strikingly observed in the random periodic universality 
class, which is the relevant one for charge density waves. The fixed point for a 
periodic disorder of period one reads (remember A{u) = —R"{u)) 

= à -(1 + ^ 



Integrating over a period, we find (suppressing in F{u, x) the dependence on the 
coordinate x for simplicity of notation) 



[ duA*{u)= [ duF{u)F{u') = 
Jo Jo 



ÏÔ8 



(13.9) 



In an equilibrium situation, this correlator would vanish, since potentiality requires 
LMuF(w) = 0. Here, there are non-trivial contributions at 2-loop order (order 
e^), violating this condition and rendering the problem non-potential. This same 
mechanism is also responsible for the violation of the conjecture C = f ^ which could 
be proven on the assumption that the problem remains potential under renormal- 
ization. Let us stress that the breaking of potentiality under renormalization is a 
quite novel observation here. 





d 


e 




estimate 


simulation 




“3“ 


0.33 


0.38 


0.38T0.02 


0.34±0.01 


c 


2 


0.67 


0.86 


0.82T0.1 


0.75±0.02 




1 


1.00 


1.43 


1.2±0.2 


1.25±0.01 






0.89 


0.85 


0.84ib0.01 


0.84±0.02 


p 


2 


0.78 


0.62 


0.53T0.15 


0.64T0.02 




1 


0.67 


0.31 


0.2±0.2 


0.25 ...0.4 




Z] 


0.58 


0.61 


0.62T0.01 




u 


2 


0.67 


0.77 


0.85T0.1 


0.77T0.04 




Ij 


0.75 


0.98 


1.25±0.3 


1±0.05 





e 




estimate 


simulation 


c 


0.33 


0.47 


0.47±0.1 


0.39T0.002 


ZJ 


0.78 


0.59 


0.6T0.2 


0.68T0.06 


2 : 


0.78 


0.66 


0.7±0.1 


0.74T0.03 




1.33 


1.58 


2±0.4 


1.52T0.02 



Figure 11: The critical exponents at the depinning transition, for short range 
elasticity (left) and for long range elasticity (right). 
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The other critical exponents mentioned above can also be calculated. The 
dynamical exponent 2 ; (for RF-disorder) reads [1, 3] 

z = 2-^e- 0.04321e2 + . . . . (13.10) 

All other exponents are related via the relation (13.1). That the method works 
well even quantitatively can be inferred from figure 11. 

14 Universal width distribution 

Exponents are not the only interesting quantities: In experiments and simulations, 
often whole distributions can be measured, as e.g. the universal width distribution 
of an interface [11, 12]. Be (u) the average position of an interface for a given 
disorder configuration, then the spatially averaged width 

~jd J (“(^) “ ( 14 - 1 ) 

is a random variable, and one can calculate and measure its distribution P{w‘^). 
The rescaled function ^{z), defined by 

P{w^) - l/w"^ ^ (14.2) 

will be universal, i.e. independent of microscopic details and the size of the system. 

Supposing all correlations to be Gaussian, ^(z) can be calculated analytically. 
It depends on two parameters, the roughness exponent ( and the dimension d. 
Numerical simulations displayed on figure 12 show spectacular agreement between 
analytical and numerical results. As expected, the Gaussian approximation is not 




Figure 12: Scaling function ^{z) for the (1 + 1) -dimensional harmonic model, 
compared to the Gaussian approximation for = 1.25. Data from [11]. 
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exact, but to see deviations in a simulation, about 10 ^ samples have to be used. 
Analytically, corrections can be calculated: They are of order and small. 

Physically, the distribution is narrower than a Gaussian. 

15 Anisotropic depinning, directed percolation, 
branching and all that 

We have discussed in section 13 isotopic depinning, which as the name suggest is a 
situation, where the system is invariant under a tilt. This isotropy can be broken 
through an additional anharmonic elasticity 

-^elastic ^ j \ + C4 [Vw(x)]'‘ , (15.1) 

leading to the distinct anisotropic depinning universality class, as found recently 
in numerical simulations [33]. It has been observed in simulations [34, 35], that 
the drift-velocity of an interface is increased, which can be interpreted as a tilt- 
dependent term, leading to the equation of motion in the form 

dtu{x, t) = V‘^u{x, t) + A \^u{x, t)Ÿ + F{x, u{x, t)) f . (15.2) 

However for long time it was unclear, how this new term (proportional to A), 
which usually is referred to as a KPZ-term, is generated, especially in the limit of 
vanishing drift- velocity. In [5] we have shown that this is possible in a non- analytic 
theory, due to the diagram given in figure 13. 

For anisotropic depinning, numerical simulations based on cellular automa- 
ton models which are believed to be in the same universality class [36, 37], indicate 
a roughness exponent Ç, ~ 0.63 in d = 1 and Ç, ~ 0.48 in d = 2. On a phenomeno- 
logical level it has been argued [36, 37] that configurations at depinning can be 
mapped onto directed percolation in d = 1 -f - 1 dimensions, which yields indeed a 
roughness exponent Cdp = = 0.630 ± 0.001, and it would be intriguing to 

understand this from a systematic field theory. 

This theory was developed in [5], and we review the main results here. A 
strong simplification is obtained by going to the Cole-Hopf transformed fields 

Z{x,t) 



0 

Figure 13: The diagram generating the irreversible nonlinear KPZ term with one 
disorder vertex and one C 4 vertex (the bars denote spatial derivatives). 



. - IniZix.t)) /.Kox 

u[x,t) = . (15.3) 
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The equation of motion becomes after multiplying with XZ{x,t) (dropping the 
term proportional to /) 

dtZ{x,t) = V^Z(x,t) + XF 

and the dynamical action (after averaging over disorder) 



ln(Z(x, t)) 

Â 



Z(x, t) 



(15.4) 



'= [ Z{x,t){dt-V^)Z{x,t) 

J xt 

J xtt' 



2 



Z{x,t')Z{x,t') (15.5) 



This leads to the FRG flow equation at 1-loop order 

diA{u) = {e- 2C)A(u) + (uA'{u) - A"(u) (A(u) - A(0)) - A'{uf 

+2AA(i^)A'(0+) + 2A^ (A(u)2 -h A(u)A(0)) (15.6) 



The first line is equivalent to (6.1) using A{u) = —R"{u). The second line is new 
and contains the terms induced by the KPZ term, i.e. the term proportional to A 
in (15.2). 

Equation (15.6) possesses the following remarkable property: A three param- 
eter subspace of exponential functions forms an exactly invariant subspace. Even 
more strikingly, this is true to all orders in perturbation theory [5]! The subspace 
in question is (0 < < 1/A) 

A(w) = ^(a + 6e-^“ + ce^“) (15.7) 

The FRG-flow (15.6) closes in this subspace, leading to the simpler 3-dimensional 
flow: 



dia = a-\- 4a^ + 4ac + 46c (15.8) 

dib = 6(1 + 6n + 6 + 5c) (15.9) 

^£C = c(l + 6a + 6 + 5c) (15.10) 

This flow has a couple of fixed points, given on figure 14. They describe diflPerent 
physical situations. The only globally attractive fixed point is SAP, describing self- 
avoiding polymers. This fixed point is not attainable from the physically relevant 
initial conditions, which lie (as fixed point RP) between the two séparatrices given 
on figure 14. All other fixed points are cross-over fixed points. 

In the Gole-Hopf representation, it is easy to see why the exponential manifold 
is preserved to all orders. Let us insert (15.7) in (15.5). The complicated functional 
disorder takes a very simple polynomial form [5] . 



: f z{x,t){dt-v^)z{x,t)- 

J xt 

I j Z{x,t)Z{x,t') [aZ{x,t)Z{x,t') + bZ{x,t)‘^ -\- cZ{x,t'Ÿ) . (15.11) 

J X J t<t' 
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Figure 14: Fixed point structure for A = 2, which is a typical value. The ratio 
cjh is not renormalized, see (15.9)-(15.10), such that cjh is a parameter, fixed by 
the boundary conditions, especially A. The fixed points are Gaussian G, Random 
Periodic RP (the generalization of the RP fixed point for A = 0), Self- Avoiding 
Polymers SAP, and Unphysical U. 



The vertices are plotted on figure 15. It is intriguing to interpret them as particle 
interaction (a) and as different branching processes (5 and c): Z destroys a particle 
and Z creates one. Vertex h can e.g. be interpreted as two particles coming to- 
gether, annihilating one, except that the annihilated particle is created again in the 
future. However, if the annihilation process is strong enough, the reappearance of 
particles may not play a role, such that the interpretation as particle annihilation 
or equivalently directed percolation is indeed justified. 

One caveat is in order, namely that the fixed points described above, are all 
crossover fixed points, and nothing can a priori be said about the strong coupling 
regime. However this is the regime seen in numerical simulations, for which the 
conjecture about the equivalence to directed percolation has been proposed. Thus 
albeit intriguing, the above theory is only the starting point for a more complete 
understanding of anisotropic depinning. Probably, one needs another level of FRG, 
so as standard FRG is able to treat directed polymers, or equivalently the KPZ- 
equation in the absence of disorder. 



time 



a 




Figure 15: The three vertices proportional to a, 6 and c in equation (15.11). 
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16 Perspectives 

More interesting problems have been treated by the above methods, and much 
more remains to be done. We have applied our techniques to the statics at 3- 
loop order [6] and to the random field problem [9]. An expansion in 1/A', (by 
now we have obtained the effective action [7]), should allow to finally describe 
such notorious problems as the strong-coupling phase of the Kardar-Parisi- Zhang 
equation. Many questions are still open. Some have already been raised in these 
notes, another is wether FRG can also be applied to spin-glasses. We have to leave 
this problem for future research and as a challenge for the reader to plunge deeper 
into the mysteries of functional renormalization. 
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Abstract. This talk briefly reviews the current experimental and theoretical de- 
velopments in high Tc superconductivity, focusing on the pseudogap regime of the 
underdoped region of the phase diagram. We argue that the appearance of supercon- 
ductivity cannot be explained based on the traditional approach of quasiparticles 
exchanging some bosons, and that something fundamentally different is called for. 



I shall frame my talk by asking a list of questions. 

1. Is high Tc superconductivity a genuinely novel phenomenon? Is this a deep 
unsolved problem? 

2. Is there a consensus on a qualitative physical picture? 

3. Is there a consensus on a detailed theory? If not, what are the candidates? 

4. Are there experiments which can distinguish different candidates? 

The phenomenon of high temperature superconductivity in cuprates is generally 
agreed to be associated with doping into a Mott insulator. The undoped material 
is an antiferromagnet with a large exchange energy J of order 1500 K. The doped 
holes hop with a matrix element t, which is estimated to be approximately 3J. 
However, the Néel state is not favorable for hole hopping, because after one hop the 
spin finds itself in a ferromagnetic environment. Thus it is clear that the physics is 
that of competition between the exchange energy J and the hole kinetic energy per 
hole xt. Apparently the superconducting state emerges as the best compromise, 
but how and why this occurs is the central question of the high Tc puzzle. In the 
underdoped region this competition results in physical properties that are most 
anomalous. The metallic state above the superconducting Tc behaves in a way 
unlike anything we have encountered before. Essentially, an energy gap appears 
in some properties and not others, and this metallic state has been referred to 
as the pseudogap state. We will focus our attention on this region because the 
phenomenology is well established and we have the best chance of sorting out the 
fundamental physics. Indeed, it is now apparent that there are sufficiently many 
novel experimental facts to answer the first question in the positive. Furthermore, 
I will argue that a consensus is emerging on the framework needed to describe 
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this phenomenon. While detailed proposals are still hotly debated, some of these 
proposals are amenable to experimental tests. 

The pseudogap phenomenon is most clearly seen in the uniform susceptibility. 
For example, Knight shift measurement in the YBCO 124 compound shows that 
while the spin susceptibility is almost temperature independent between 700 K 
and 300 K, as in an ordinary metal, it decreases below 300 K and by the time 
the Tc of 80 K is reached, the system has lost 80% of the spin susceptibility. [1] 
Similarly, the linear T coefficient of the specific heat shows a marked decrease be- 
low room temperature. It is apparent that the spins are forming into singlets and 
the spin entropy is gradually lost. On the other hand, the frequency dependent 
conductivity behaves very differently depending on whether the electric field is in 
the ab plane {(Tab) or perpendicular to it (ac). At low frequencies (below 500 cm“^) 
{aab) shows a typical Drude-like behavior for a metal with a width which decreases 
with temperature, but an area (spectral weight) which is independent of temper- 
ature. [2] Thus there is no sign of the pseudogap in the spectral weight. This is 
surprising because in other examples where an energy gap appears in a metal, such 
as the onset of charge or spin density waves, there is a redistribution of the spec- 
tral weight from the Drude part to higher frequencies. On the other hand below 
300 K a c{co) is gradually reduced below 500 cm~^ and a deep hole is carved out of 
ac{cu) by the time Tc is reached. Finally, angle-resolved photoemission shows that 
an energy gap (in the form of a pulling back of the leading edge of the electronic 
spectrum from the Fermi energy) is observed near momentum (0, tt) and the onset 
of superconductivity is marked by the appearance of a small coherent peak at this 
gap edge. 

The pseudogap phenomenology is well explained by a cartoon picture which 
emerges from the RVB (resonating valence band) theory of Anderson. [3] The spins 
are paired into singlet pairs. However, the pairs are not static but are fluctuating 
due to quantum mechanical superposition, hence the term quantum spin liquid. 
The singlet formation explains the appearance of the spin gap and the reduction 
of spin entropy. The doped holes appear as vacancies in the background of singlet 
pair liquid and can carry a current without any energy gap. However in c-axis 
conductivity and electron is removed from one plane and placed on the next. The 
intermediate state is an electron which carries spin 1/2 and therefore it is neces- 
sary to break a singlet pair and pay the spin-gap energy. The same consideration 
applies to the photoemission experiment. Finally, according to RVB theory, su- 
perconductivity emerges when the holes become phase coherent. The spin singlet 
familiar in the BCS theory has already been formed. 

The original Anderson proposal assumed that the undoped antiferromagnet 
is a spin liquid state. This was soon shown not to be the case experimentally. 
However it may still be possible that RVB physics emerges upon the introduction 
of a finite but small concentration of holes. The notion of spin charge separation 
is an integral part of RVB physics, i.e., unlike Landau’s quasiparticle, the electron 
fractionalizes into spin and charge components. Then the question arises as to 
whether this exotic state of affairs applies only at finite temperature or whether it 
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describes the ground states. In the former case, we are considering only a cross-over 
phenomenon, whereas in the latter case precise statements can be made which have 
experimental consequences. Senthil and Fisher [4] have spearheaded this latter 
point of view. Unfortunately, the experiment they proposed to test spin-charge 
separation as a ground state property has so far turned out to be negative. [5] 
Thus the current opinion seems to disfavor the existence of these exotic ground 
states in high Tc cuprates. 

Nevertheless, the RVB proposal and the search for spin-charge separation 
have inspired and produced significant advances in theoretical physics. Early work 
based on mean-field theory [6] or large N expansion [7] of frustrated Heisenberg 
models showed that fractionalization is possible based on the deconfined phase 
of Z(2) gauge theory. [8] Further, the concept of topological order (i.e., the clas- 
sification of ground states based on the degeneracy under topologically distinct 
boundary conditions) introduced by Wen [6] has proven to be essential. More re- 
cently, the Z(2) gauge theory was greatly elucidated by the work of Senthil and 
Fisher. [4] Furthermore, in the past two years, a number of Hamiltonians which 
support fractionalized excitations have been discovered. While none of these are 
Heisenberg models, the list is growing rapidly and the plausibility of experimental 
realization is also improving. Here I can mention the dimer model on the triangular 
lattice by Moessner and Sondhi [9] and the analytic theory on the Kagome lat- 
tice by Misguich, Serban and Pasquier. [10] Senthil and Motrunich [11] introduced 
bosonic models which can be realized by specially designed Josephson junction 
arrays. Special anisotropic spin models on the Kagome, [12] honeycomb [13] and 
square [14] lattice (the latter two are exactly soluble) have been shown to sup- 
port fractionalized states. As far as the Heisenberg model itself, up to now only 
numerical work is available which indicates the existence of spin liquid states in 
the Kagome lattice [15] and the triangular lattice with a surprisingly small ring 
exchange term. [16] 

Returning to the high Tc problem, while the RVB picture is appealing, there 
has been another popular explanation which at first sight sounds less exotic. The 
idea is that the pseudogap phenomenology can be understood as a superconductor 
with robust amplitude but strong phase fluctuation. The superfluid density ps 
which controls the phase stiffness is proportional to the doping concentration x 
and becomes small in the underdoped region. As emphasized by Uemura [17] and 
by Emery and Kivelson, [18] Tc is controlled by ps and is much lower than the 
energy gap. We shall now argue that these two points of views are not unrelated 
to each other. Setting aside the question of where the strong pairing amplitude 
comes from in the first place, that the phase fluctuation scenario is incomplete 
can be seen from the following argument. In two dimensions the destruction of 
superconducting order is via the Berezinskii-Kosterlitz-Thouless (BKT) theory of 
vortex unbinding. Above Tc the number of vortices proliferate and the normal 
metallic state is reached only when the vortex density is so high that the cores 
overlap. (There is considerable latitude in specifying the core radius, but this 
does not affect the conclusion.) At lower vortex density, transport properties will 
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resemble a superconductor in the flux flow regime. In ordinary superconductors, 
the BKT temperature is close to the mean field temperature, and the core energy 
rapidly becomes small. However, in the present case, it is postulated that the 
mean field temperature is high, so that a large core energy is expected. Indeed, in 
a conventional core the order parameter and energy gap vanish, costing Aq/Ef 
per unit area of energy. Using a core radius of = Vf/Aq, the core energy of 
a conventional superconductor is Ef- In our case, we may replace Ef by J. If 
this were the case, the proliferation of vortices would not happen until a high 
temperature ~ J independent of x is reached. Thus for the phase fluctuation 
scenario to to work, it is essential to have “cheap” vortices, with energy cost of 
order Tc- Then the essential problem is to understand what the vortex core is 
made of. Put another way, there has to be a competing state with energy very 
close to the d-wave superconductors which constitute the core. The vortex core 
indeed offers a glimpse of the normal state reached when H exceeds Hc 2 , and is 
an important constituent of the pseudogap state above Tc- 

To summarize, the pseudogap phenomenology is so clearly defined by exper- 
iments that I believe a broad consensus on the theoretical framework is emerging. 
The following points are common to a number of investigators. 

1. The key issue is how holes are accommodated in a doped Mott insulator. 

2. The superconducting state is characterized by a large energy gap and a small 
Ps. The transition temperature is controlled by phase coherence. 

3. The superconducting state does not evolve out of a normal state with well 
defined quasiparticles. Therefore the question of pairing mechanisms in the 
traditional sense of identifying the boson being exchanged between quasipar- 
ticles is not the right question. 

4. What is needed is to identify the competing state which lives inside the 
vortex core. 

What are the candidates for the competing order? A candidate which has at- 
tracted a lot of attention is the stripe phase. [19, 20] In the LSCO family, dynamical 
stripes (spin density waves) are clearly important, especially near x = There 
are recent report of incommensurate SDW nucleating around vortices. However, 
until now there has been little evidence for stripes outside of the LSCO family. 
On the theoretical side, as a competing state it is not clear how the stripes are 
connected to d-wave superconductivity and it is hard to understand how the nodal 
quasiparticles turn out to be most sharply defined on the Fermi surface, since these 
have to transverse the stripes at a 45° angle. 

A second candidate for the vortex core is the antiferromagnetic state. This 
possibility was first proposed several years ago in the context of the SO (5) theory 
[21]. This theory is phenomenological in that it involves only bosonic degrees of 
freedom (the SDW and pairing order parameters). The quasiparticles are out of 
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the picture and indeed this theory applies equally well to d-wave or a fully gapped 
s-wave superconductor. Thus the fundamental question of how the holes are ac- 
commodated has not really been addressed. The nucléation of SDW around the 
core in the LSCO family lends support to the existence of antiferromagnetic order 
inside the core. Furthermore, there are reports of enhanced antiferromagnetic spin 
fluctuations, and perhaps even static order, using NMR. [22, 23] I shall argue next 
that other considerations also lead to antiferromagnetic fluctuations and possibly 
static orders inside the vortex core, so that the observation of antiferromagnetic 
cores does not necessarily imply the existence of SO (5) symmetry. 

Finally, I come to the candidate which we favor - the staggered flux state 
with orbital currents. [24] Indeed, Lee and Wen have successfully constructed a 
“cheap” vortex state. [25] The staggered flux phase has an advantage over other 
possibilities in that its excitation spectrum is similar to the d-wave superconductor 
and the SU(2) theory allows us to smoothly connect it to the superconductivity. 
We also regard the staggered flux phase as the precursor to Néel order, so that 
antiferromagnetic fluctuations or even SDW order are accommodated naturally. 
Of course, it is experiments which have the final say as to which candidate turns 
out to be realized. Our strategy is to work out as many experimental consequences 
as we can and propose experiments to confirm or falsify our theory. 

The staggered flux state was first introduced as a mean field solution at half- 
filling [26] and later was extended to include finite doping. [24] At half-filling, due 
to the constraint of no double occupation, the staggered flux state corresponds 
to an insulating state with power law decay in the spin correlation function. It is 
known that upon including gauge fluctuations which enforce the constraint, the 
phenomenon of confinement and chiral symmetry breaking occurs, which directly 
corresponds to Néel ordering. [27, 28] The idea is that with doping, confinement is 
suppressed at some intermediate energy scale, and the state can be understood as 
fluctuating between the staggered flux state and the d-wave superconducting state. 
Finally, when the holes become phase coherent, the d-wave superconducting state 
is the stable ground state. Thus the staggered flux state may be regarded as the 
“mother state” which is an unstable fixed point due to gauge fluctuations. It flows 
to Néel ordering at half-filling and to the d-wave superconductor for sufficiently 
large x. Thus the staggered flux state plays a central role in this kind of theory. We 
should point out that the staggered flux state (called the D-density wave state) 
has recently been proposed as the ordered state in the pseudogap region. [29] As 
explained elsewhere, [30] we think that this view is not supported by experiment 
and we continue to favor the fluctuation picture. 

The above picture finds support from studies of projected wavefunctions, 
where the no-double-occupation constraint is enforced by hand on a computer. 
With doping the best state is a projected d-wave state. Recently we calculated 
the current-current correlation function of this state Cj{k,£) =< j{k)j{i) > where 
j(k) is the physical electron current on the bond k. The average current < j{k) > 
is obviously zero, but the correlator exhibits a staggered circulating pattern. [31] 
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Such a pattern is absent in the d-wave BCS state before projection, and is 
a result of the Gutzwiller projection. Our result for Cj is consistent with exact 
diagonalization of two holes in 32 sites. [32] 

The staggered current generates a staggered physical magnetic field (esti- 
mated to be 10-40 gauss) [27, 31] which may be detected, in principle, by neutron 
scattering. In practice the small signal makes this a difficult, though not impossible 
experiment and we are motivated to look for situations where the orbital current 
may become static or quasi-static. Recently, we analyzed the structure of the hcj2e 
vortex in the superconducting state within the SU(2) theory and concluded that 
in the vicinity of the vortex core, the orbital current becomes quasi-static, with 
a time scale determined by the tunneling between two degenerate staggered flux 
states. [25] It is very likely that this time is long on the neutron time scale. Thus 
we propose that a quasi-static peak centered around (tt, tt) will appear in neutron 
scattering in a magnetic field, with intensity proportional to the number of vor- 
tices. The time scale may actually be long enough for the small magnetic fields 
generated by the orbital currents to be detectable by /i-SR or Yttrium NMR. 
Again, the signal should be proportional to the external fields. (The NMR exper- 
iment must be carried out in 2-4-7 or 3 layer samples to avoid the cancellation 
between bi-layers.) We have also computed the tunneling density of states in the 
vicinity of the vortex core, and predicted a rather specific kind of period doubling 
which should be detectable by atomic resolution STM. [33] The recent report [34] 
of a static field of dzl8 gauss in underdoped YBCO which appears in the vortex 
state is promising, even though muon cannot distinguish between orbital current 
or spin as the origin of the magnetic field. We remark that in the underdoped 
antiferromagnet, the local moment gives rise to a field of 340 gauss at the muon 
site. Thus if the 18 gauss signal is due to spin, it will correspond to roughly l/20th 
of the full moment. 

We remark that our analytic model of the vortex core is in full agreement 
with the numerical solution of unrestricted mean field and Xij t>y Wang, Han 
and Lee. [35] Recently we found that for small doping in the t-J model a small 
moment SDW co-exists with orbital currents in the vortex core. [36] More generally, 
we expect (tt, tt) spin fluctuations to be enhanced [37] so that the tendency to 
antiferromagnetism is fully compatible with the staggered flux picture. This vortex 
solution is also interesting in that the tunneling density of states show a gap, with 
no sign of the large resonance associated with Caroli-deGennes-type core levels 
found in the standard BCS model of the vortex. There exists a single bound state 
at low lying energy, [36] in agreement with STM experiments. The low density 
of states inside the vortex core has an important implication. In the standard 
Bardeen-Stephen model of flux-flow resistivity, the friction coefficient of a moving 
vortex is due to dissipation associated with the vortex core states. Now that the 
core states are absent, we can expect anomalously small friction coefficients for 
underdoped cuprates. The vortex moves faist transverse to the current and gives 
rise to large flux-flow resistivity. Since the total conductivity is the sum of the 
flux-flow conductivity and the quasiparticle conductivity, it is possible to get into 
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a situation where the quasiparticle conductivity dominates even for H < Hc 2. 
Thus the “cheap” and “fast” vortex opens the possibility of having vortex states 
above the nominal Tc and Hc 2 , when the resistivity looks like that of a metal, 
with little sign of flux-flow contribution. Prom this point of view, the large Nerst 
effect observed by Ong and co-workers [38] over a large region in the H-T plane 
above the nominal Tc and Hc 2 (as determined by resistivity) may be qualitatively 
explained. 

Finally, I would like to mention my recent work with N. Nagaosa on new 
collective modes in the superconductor state. [39] In ordinary superconductors, 
there is a single pairing order parameter A and associated with it are the amplitude 
mode and the phase modes. In our theory the hopping matrix element Xij is also a 
dynamical degree of freedom. The basic reason is that the hole hopping amplitude 
depends sensitively on the spin configuration, as we explained earlier, so that Xij 
encodes the effect due to the spin degree of freedom. With the pairing and hopping 
amplitudes as dynamic degrees of freedom, it is natural that we can expect new 
collective modes. These can be evaluated in a direct way within mean field theory 
in analogy with the standard RPA treatment, but the SU(2) theory allows us to 
classify them. We find two new classes of collective modes. The first is called the 
^-mode, which describes the local fluctuation towards the staggered flux phase, 
and has the effect of generating the orbital current correlation described earlier. 
The second is called the 0-gauge mode. This mode involves the relative oscillation 
of the amplitudes of Xij and Aij in such a way that \xij P + | A^j p is constant, and 
this oscillation is modulation in a staggered way. Thus this mode is most important 
near (tt, tt), but it disperses throughout k space in a way which we have computed. 
The frequency of the mode is estimated to be of order J for x = 0.15. This mode 
couples to Raman scattering in the standard way so that optical Raman scattering 
may observe the mode at k = 0. X-Ray Raman scattering should be able to map 
out the entire dispersion, but higher energy resolution than what is currently 
available will be needed. It turns out the LSCO offers a special opportunity to 
couple to this mode. In the LTO phase the copper oxygen bonds are buckled 
in a staggered way. A uniform motion of the oxygen along the c-axis leads to a 
staggered modulation of the hopping matrix element. Thus we predict that a new 
collective mode at (7r,7r) will show up as a side band of the buckling mode in 
c-axis conductivity. Unfortunately, the relative spectral weight of one side band is 
estimated to be very small (of order 5 x 10~^), making its detection a challenging 
one for experimentalists. 
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Network Models for Chiral Symmetry Classes 
of Anderson Localisation 

Marc Bocquet and J.T. Chalker 



Abstract. We consider localisation problems belonging to the chiral symmetry 
classes, in which sublattice symmetry is responsible for singular behaviour at a 
band centre. As an account for the talk given in TH2002, we focus here on the 
chiral time-reversal invariant symmetry class. We formulate a model which has the 
relevant symmetries and which is a generalisation of the network model introduced 
previously in the context of the integer quantum Hall plateau transition. We show 
that this generalisation can be re-expressed as corresponding to the introduction 
of absorption and amplification into the original network model. This model repre- 
sents a convenient starting point for analytic discussions and computational studies. 
It exhibits both localised and critical phases, and band-centre singularities in the 
critical phase approach more closely in small systems the expected asymptotic form 
than in other known realisations of the symmetry class. In addition, we demonstrate 
that by imposing appropriate constraints on disorder, a lattice version of the Dirac 
equation with a random vector potential can be obtained. 



1 Introduction 

The classification by symmetry of Hamiltonians for disordered systems provides 
an important framework in the study of Anderson localisation. Three standard 
symmetry classes are long-established, and represented in the zero-dimensional 
limit by the three Wigner-Dyson random matrix ensembles [1]. The existence of 
additional symmetry classes has been implicit in work which also has an exten- 
sive history, but a complete classification has been set out only rather recently, 
by Altland and Zirnbauer [2, 3]. Hamiltonians belonging to one of these addi- 
tional symmetry classes are characterised by a discrete symmetry which relates 
eigenvalues and eigenfunctions in pairs. This contribution is concerned with the 
chiral symmetry classes. We shall focus on the chiral symmetry class which de- 
scribes time- reversal invariant systems, namely class AIII. It can be realised as a 
tight-binding Hamiltonian for time-reversal invariant systems in which sites can 
be divided into two sublattices and non-zero matrix elements connect only sites 
from opposite sublattices. In these terms, the discrete symmetry operation is mul- 
tiplication of wavefunction components on one sublattice by a factor of minus one: 
applied to an eigenstate with energy this generates a second eigenstate, with 
energy —E. 

Early work on disordered Hamiltonians with chiral symmetry followed a va- 
riety of independent directions. A one-dimensional model of this type was solved 
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exactly by Dyson [4] , who found a divergence in the density of states at the band 
centre, of a form shown by Ovchinnikov and Erikhman [5] to be characteristic for 
the symmetry class in one dimension, A closely related feature of this and similar 
models is that, while states away from the band centre are localised, both the 
mean free path and the localisation length diverge on approaching the band cen- 
tre, as revealed in early treatments by Eggarter and Riedinger [6] and by Ziman 
[7]. Two-dimensional systems with chiral symmetry were studied by Gade and 
Wegner, [8, 9] using a mapping to a non-linear sigma model: their calculations for 
the class AIII with weak disorder indicate a strongly divergent density of states 
at the band centre, also accompanied by a divergence of the localisation length. 
Chiral random matrix ensembles, investigated by Nagao and Slevin [10] and by 
Verbaarschot and Zahed, [11] likewise show band centre anomalies in the density 
of states, albeit on energy scales comparable with the level spacing. And in a sep- 
arate development, extensive analytical results have been obtained by Nersesyan 
et al [12] and by Ludwig et al [13] for a particular two-dimensional model with 
chiral symmetry, the massless two-dimensional Dirac equation with random vector 
potential. 

Much subsequent work has helped to establish the range of behaviour possi- 
ble in these systems. In one dimensional models, a finite localisation length at zero 
energy and a modification of the Dyson singularity in the density of states can be 
induced [5, 14, 15] by terms in the Hamiltonian that in the absence of disorder 
would generate a spectral gap. For two-dimensional systems, it is recognised that 
a random vector potential in the Dirac equation constitutes a special choice of 
disorder, and scaling flow from this towards a generic fixed point, with properties 
similar to those derived by Gade, has been studied in a field- theoretic framework 
[16] and by using results on random Gaussian surfaces [17]. Moreover, while early 
numerical results [18, 19, 20] were quite different from analytical predictions, care- 
ful tuning of model parameters and study of large systems has recently brought 
calculations closer to expected asymptotic behaviour [17, 21]. Numerical work has 
also shown that a finite value for the zero-energy localisation length can be pro- 
duced in two-dimensional chiral models, by the same mechanism that is effective 
in one dimension. [17] 

Our aim in the following is to introduce a representative of the chiral unitary 
symmetry class, in the form of a network model [22]. In place of a Hamiltonian, it 
uses the ideas of scattering theory and may be specified by a transfer matrix, [22] 
or by a unitary evolution operator for one step of discretised time [23, 24]. The 
version we set out here for class AIII is connected in two distinct ways to a partic- 
ular network model without chiral symmetry which has been studied previously, 
the U(l) network model [22] investigated in the context of the integer quantum 
Hall plateau transition. The model with chiral symmetry is constructed by cou- 
pling two copies of the partner model. This coupling can be re-expressed after a 
transformation as equivalent to introduction of absorption and coherent amplifi- 
cation in the original models. This equivalence parallels the established link [25] 
between chiral symmetry classes and non-Hermitian random operators. Extensions 
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to the two remaining chiral symmetry classes (BDI and CII) are made explicit in 
Ref. [26]. 

The remainder of this paper is organised as follows. We show in Sec. 2 how to 
construct a network model belonging to the chiral symmetry class AIII, and discuss 
in Sec. 3 some basic aspects, including the relation to non-Hermitian models, the 
continuum limit, and a lattice version of disorder analogous to a random vector 
potential in the Dirac equation. We present results from a numerical study of the 
two-dimensional version of the model in Sec. 4, and conclude in Sec. 5. 

2 Construction of a network model with chiral symmetry 

In this section we formulate a network model for the chiral unitary class (AIII). 
Our strategy is simply to construct the two-dimensional internal space associated 
with chiral symmetry using two related copies of a network model without that 
symmetry. We focus on a two-dimensional model; models in quasi-one and three 
dimensions can be constructed in the same way, starting for example from Eq. (7). 

The symmetry of a disordered system may be discussed in terms of a Hamil- 
tonian H, a scattering matrix S', or a transfer matrix T. We are concerned with 
systems which conserve probability density, so that the Hamiltonian is Hermi- 
tian and the scattering matrix is unitary: = H and S^ = S“h The equiva- 

lent condition for the transfer matrix involves the current operator J and reads 
T~^ = JT^J. Chiral symmetry is implemented on a two-dimensional internal 
space in which the Pauli operator acts: for the Hamiltonian it is the require- 
ment that = —H. Taking the scattering matrix to have the symmetry of 

this implies that TixST,x = S“h For the transfer matrix, it appears at first 
that one has a choice, taking flux in the pairs of scattering channels on which 
acts to propagate either in opposite directions or in the same direction. In fact 
only the former is tenable. It implies that the anticommutator {J, Sx} = 0 and 
that the chiral symmetry condition is = T, a property preserved under 

matrix multiplication. By contrast, the latter choice would lead to the commuta- 
tor [Sx,T] = 0 and the chiral symmetry condition = T~^ . Since the last 

condition is not in general preserved under matrix multiplication, connection in 
series of two systems of this kind would generate a sample without chiral symme- 
try, and we therefore reject this alternative. Systems in class AIII have no other 
relevant discrete symmetry. 

We recall first the essential features of the U(l) network model for the integer 
quantum Hall plateau transition. A wavefunction in this model takes complex 
values zi on the links / of the lattice illustrated by the full lines of Fig. 2. The forms 
of the transfer matrix and of the evolution operator follow from the properties of 
the elementary building units, shown in Fig. 1. A particle acquires a phase 0/ on 
traversing link I, so that in a stationary state amplitudes at either end are related 

by 
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Figure 1: Units from which the network model is constructed: a link (left) and a 
node (right). 



In a similar way, stationary state amplitudes on the four links which meet at a 
node are related by a 2 x 2 transfer matrix 

{ \ = f cosh(a) sinh(a) \ f Z 3 \ 

\ Z2 J \ sinh(a) cosh(a) J \ Z4 J ^ ^ ^ 

where a is real and all phase factors are associated with links. This equation may 
re-written in terms of a scattering matrix as 

/ Z 3 \ _ f cos(a) - sin(a) \ f 2 i \ /o', 

\ Z 2 J y sin(a) cos(a) J \ Z 4 J ' ^ 

with sin(a) = tanh(a). The transfer matrix that results from assembling these units 
is described in detail in Ref. [22], and the time evolution operator in Refs. [23] and 
[24]. 

Introduction of a two-dimensional internal space associated with chiral sym- 
metry results in a doubling of the number of wavefunction components. In this 
way, starting from two copies of the U(l) model the link amplitudes become two- 
component complex numbers, z/. In place of Eq. (1), the scattering properties of 
a link are characterised by a 2 x 2 transfer matrix T, with 



z' =Tz. 



Requiring T>xTT>x = T, T has the form 

T = e^'t> ( cosh(6) sinh(&) \ 

sinh(6) cosh(6) y ’ ^ ^ 

where (/> is a real phase and 6 is a real hyperbolic angle. It remains to discuss 
scattering at nodes of the doubled system. We replace Eq. (2) by 

I «('“ÏW “ti"! ] f . (6) 

\ J \ smh(a) cosh(a) J\Z 4 j ^ ^ 
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where 1 is here the unit matrix in the two-component space introduced on links. 
This choice amounts to the most general one compatible with chiral symmetry, 
since all scattering within the two-component space may be included in the link 
transfer matrices, T. 

Combining these elements to make a two-dimensional system, we arrive at 
the model shown schematically in Fig. 2. The transfer matrix for the system as 
a whole acts in the [1, 1] (or [1,1]) direction and may be written as a product of 
factors relating amplitudes on successive slices of the system. Alternate factors in 
the product represent links and nodes, and consist respectively of repeated versions 
of the 2x2 and 4x4 blocks appearing in Eqs. (5) and (6). We introduce disorder 
by taking the phase 0 in Eq. (5) to be an independent, uniformly distributed 
random variable on each link, and take the parameter a, characterising scattering 
at nodes, to be non-random. We have considered two ways to set the value of the 
coupling between the chiral subspaces, which we parameterize in terms of either 
the hyperbolic angle appearing in Eq. (5) or the compact angle j3 related to b by 
sin{P) = — tanh(6): we take /3 either uniformly distributed, or b to have a normal 
distribution of variance g. We ensure that the system is statistically invariant under 
7 t/ 2 rotations of the lattice, which places requirements on the node parameter a, 
exactly as in the U(l) model: nodes lie on two distinct sub-lattices, and the node 
parameter a on one sublattice is related to the parameter a' on the other sublattice 
by the duality relation sinh(a) sinh(a') = 1. 

The same system can alternatively be described using a time evolution op- 
erator, S', in place of a transfer matrix. In order to specify this operator, which 
is unitary and has the symmetry of a scattering matrix, it is convenient first to 
consider the special case 6 = 0, in which the two copies of the network model are 
uncoupled. Let U denote the time evolution operator for one copy. Then, since 
from Eq. (5) link phases are the same in both copies but propagation directions 
are opposite, the evolution operator for the other copy is . The dimension of 
U is equal to the number of links in the system. It is useful to define a diagonal 
matrix of the same dimension, with the angles /3i for each link I as diagonal entries: 
we use /3 to denote this matrix. The time-evolution operator for the system with 
chiral symmetry can then be written 

/ cos{0) -U sin{0) W \ . . 

y sin{0) cos{0) W J ' ^ ^ 

It is straightforward to check that T.xS'Ex = and that = S~^. 

3 Discussion of the model 

In this section we discuss some basic aspects of the AIII model. These include: 
symmetry of the spectrum of the time-evolution operator; the relation to the U(l) 
model which has absorption and amplification; the continuum limit and a lattice 
analogue of random vector potential disorder in the Dirac equation. Additional 
properties are also treated in Ref. [26] . 
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Figure 2: Structure of the AIII network model. Full and dashed lines indicate the 
two U(l) models from which the system is constructed, with nodes located at the 
vertices of the two lattices. Scattering that couples the sub- systems is represented 
schematically by square boxes, shown only in the upper left plaquette. In one step 
of time evolution, flux propagates between successive points marked with fllled 
circles, in the directions indicated with arrows. 

3.1 Spectral symmetries 

The eigenphases and eigenvectors of the time evolution operator S play 
the same role for a network model as do energy eigenvalues and eigenvectors for a 
model defined using a Hamiltonian. They satisfy 

Sj/ja = • ( 8 ) 

Chiral symmetry, has the consequence that is an eigenvector 

of S with opposite eigenphase: 

ST.^^Ipa = • (9) 

Because of this, the spectrum of eigenphases is symmetric around the point E — 0 
and their density p{E) may develop a singularity there. 

The symmetries described in the preceding paragraph are the ones of fun- 
damental interest in this work. However, all network models based on the lattice 
of Fig. 2 have in addition a bipartite structure. This structure leads to symmetry 
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of the eigenphase spectrum under the translation E tx E. When combined 
with chiral symmetry, E ^ —E, the consequence is that behaviour near E = 0 is 
mirrored exactly aX E = tt, and that similar behaviour is repeated at E = =b7r/2. 



3.2 Chiral symmetry and network models with absorption 
and amplification 

Any Hamiltonian H which satisfies the chiral symmetry condition T.xH'Ex = —H 
has a block structure that is most clearly displayed after making the rotation 
R = (Ea; + In the rotated basis, the chiral symmetry condition involves 

Y>z rather than E^^, since E^ = RE,xR~^ • We have 

= ( ht 5 ) • (10) 

This establishes a decomposition of the Hermitian operator H into the operators 
h and h^, which are in general non-Hermitian. We show in this section that for a 
network model with chiral symmetry there exists a similar decomposition into a 
pair of systems, each with half the number of degrees of freedom. A single member 
of the pair, taken in isolation, can be interpreted as a system in which there is 
absorption and amplification of flux. We discuss this separation first in the context 
of the transfer matrix, and then in terms of the time evolution operator. 

The logic can also be applied in the opposite direction. In this case, just as 
Eq. (10) provides a useful relation [27] between a non-Hermitian operator h, and 
the Hermitian operator H, so the arguments below can be used to relate a network 
model without flux conservation to a doubled system in which flux is conserved. 



3.2.1 Transfer matrices with absorption and amplification 



For our model all factors entering the transfer matrix are diagonal in the two- 
dimensional space on which E^^ acts, except for the 2x2 matrices appearing in 
Eqs. (5). These matrices are diagonalised by the rotation R, which is disorder 
independent and which mixes counter-propagating channels: 



/ cosh(6) sinh(6) \ r>-i ^ ( e’’ 0 \ 

Y sinh(6) cosh(6) ) y 0 / 



( 11 ) 



Because this rotation is independent of disorder, we can apply it to the transfer 
matrix T for the system as a whole. The result reads symbolically 



RTR-^ 



Tip) 0 \ 

0 T{-h) ) • 



(12) 



Therefore T{h) is obtained from the transfer matrix for the corresponding U(l) 
model by including imaginary parts in the link phases: 



4>i ^ (f>i- ibi . 



(13) 
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3.2.2 Time evolution operators with absorption and amplification 

It is interesting to re-examine this separation of the chiral model into subsystems, 
starting from the time-evolution operator in place of the transfer matrix. We begin 
by writing the action of the time-evolution operator 5 in a way that emphasises 
the 2x2 block structure evident in Eq. (7): 




(14) 



The dimensions of the vectors Z and Z appearing in Eq. (14) are the same as those 
of the matrix U in Eq. (7). This expression can be reorganized into the form 



where K is 






^3 

^4 



U 0 

0 u 





)■ 


(15) 


cosh (6) 


sinh(6) \ 


(16) 


sinh(6) 


cosh(6) J 



Properly speaking, since Z\ and Z 2 are amplitudes of oppositely propagating 
fluxes, K should be interpreted as a transfer matrix. Despite this, our approach is 
to apply to K the rotation R, and to discuss the rotated matrix as if it were an 
evolution operator. Doing this, we find 



RKR-' - ( ^ (17) 

where 

U{±b) = U exp(dz6) . (18) 

For the special case 6 = 0, the subsystems represented by full and dashed lines in 
Fig. 2 are uncoupled. In this case one has h({0) = U, which is the time evolution 
operator for one subsystem, and its inverse for the other subsystem. For general 
6, one can regard U{±b) as time evolution operators for systems with absorption 
and amplification. However, it is only the stationary states of these systems which 
have a simple physical significance. Such states exist either for an open system with 
scattering boundary conditions (the situation represented by the transfer matrix 
T), or for a closed system if a pair of eigenphases are zero (which can be arranged 
by fine-tuning system parameters). In this case, we can define eigenvectors 



= U{b)Z+ and Z_ = U{-b)Z . . (19) 

Undoing the rotation R, the stationary states of S satisfy ^p± = Sxp±, with 







and 




(20) 
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3.3 Continuum limit 

The network model representing the quantum Hall plateau transition is known [24] 
to have a continuum limit in which the time evolution operator is that for a Dirac 
Hamiltonian in two space dimensions, with random vector and scalar potentials 
and a uniform mass which varies through zero on crossing the transition. 

Although less obvious, there is also such a a continuum limit for the AHI 
network model. We obtain the continuum limit by taking all link phases and all 
scattering amplitudes within the chiral space to be small, [28] so that in Eq. (5) 
\(f)\ 1 and |6| 1, and by taking the node parameter close to its self-dual value, 

so that in Eq. (3) a = tt/ 4 -f m/2, with |m| < 1. 

After overcoming the complication arising from the fact that the operator 
related to the bipartite structure of the network does not commute with the chiral 
symmetry operator, we obtain a Dirac Hamiltonian H by interpreting the part of 
5^ that acts within one sublattice as exp(— iJT), and expanding to lowest order in 
0/ , bi ^ Pi and m. Using the numbering of links around a plaquette specified in 
Fig. 2, we define 



y — (01 + 02 + 03 + 04)/2 , 

Ax = (04 - 02)/2 and Ay = (0i - 03)/2 (21) 

as in Ref. [24]. We also define 

W = (Æ+Æ + Æ + /34)/2, 

Bx = {Pi - Ps)/2 and By = (04 - p 2)/2 . (22) 

Finally, to put the result in a standard form, we obtain H from H using a rotation 

which takes {a ^ in H to (— in H. We have finally 

H = [{-idx - Ax)cr'^ + (~^^y ” Ay)ay + ma^ -\-V]^'Ez 

+ [Bxcr^ + Bya^ -\- W] <S>T>y . (23) 

As expected, the Hamiltonian H fulfills the chiral symmetry condition, TjxHY>x = 
—H. It can be cast in the form of Eq. (10), with 

h = {—idx — Ax)cr^ -h {—idy - Ay)a'^ + ma^ -f- V . (24) 

Here, = Ax -i- iBx^ Ay = Ay iBy and V = U -h iW are, respectively, complex 
random vector and scalar potentials. If disorder is also introduced in the mass 
m and as an additional term BzCf^ 0 Ey, if is the most general form for a four- 
component Dirac Hamiltonian with randomness in symmetry class AHI [16]. 

3.4 Models with only vector potential disorder 

There has been very extensive interest in Dirac Hamiltonians on two space dimen- 
sions, with only vector potential disorder. In this subsection we show how lattice 
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versions of such problems can be realised, using the network model. A similar treat- 
ment for tight-binding Hamiltonians has been described recently by Motrunich et 
al [17]. 

Our starting point is a special choice for the chiral scattering angles /3i, de- 
rived from a potential function as follows. The potential is defined at the points 
which are marked with filled circles on the lattice shown in Fig. 2. Suppose that, 
on the network drawn with (say) full lines in this figure, the link I runs from point 
i to point j, and let the potential take the values and at these points. The 
hyperbolic angle b is then the gradient of the potential, with 

bi = . (25) 

Introducing a diagonal matrix with as entries, one finds that the evolution 
operator U{b) appearing in Eq. (18) is a similarity transform of the operator U 

U{b) = e'^Ue-'*’ . (26) 

Likewise, one has 

u(-b) = e-'^Ue^ . (27) 

These facts enables us to determine the stationary states for a system with 
non- zero $ in terms of those for a system with $ = 0, if boundary conditions or 
fine-tuning of the disorder realisation allow such states. Specifically, suppose that 
the system without chiral scattering has a stationary state '0o, so that 

(28) 

Then the system with chiral scattering has stationary states that, in the notation 
of Eq. (19), are given by 

and Z- = e~^^o . (29) 

There are two separate applications of these ideas which are of interest. The 
first is to take the operator U to be without disorder, setting all link phases (f)i = 0. 
Choosing in addition the node parameter to have its critical value a — ttI A, U has a 
stationary state 'ipo which is constant in space. Then the wavefunctions and Z_ 
are lattice versions of the eigenstates known for the Dirac equation with random 
vector potential, whose statistical properties have been very extensively studied. 
This is supported by the fact that in this case the continuum limit of the model 
Eq. (23) simplifies to a pair of Dirac fermions with real vector potential, equivalent 
to the ones described in Ref. [17]. 

A second application is to include disorder both via $ and in U itself. Taking 
all link phases to be random, and again setting A = tt/ 4, is a critical wavefunc- 
tion for the quantum Hall plateau transition. The combinations Z± = will 

therefore have multifractal fiuctuations that are determined from an interplay of 
the distributions of 'ipo and of It seems very likely that this will result in a new 
type of critical behaviour, but we leave details for a future investigation. 
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4 Numerical results for the two-dimensional AIII model 



4.1 Introduction 



In this section we present results from a numerical investigation of the two- 
dimensional AIII network model, as defined in Fig. 2 and Eq. (7). Most of our 
results are for systems in which the hyperbolic angles bi have a Gaussian distri- 
bution of zero mean and variance g. For these systems, we examine behaviour 
as a function of the node parameter a and g, parameterizing the latter with 7, 
defined by sin(7) = tanh(y^). Hence we consider the parameter space spanned by 
0 < q: < 7 t /4 and 0 < 7 < 7 t/ 2. We also present some data for systems in which 
the angles f3i are uniformly distributed between 0 and 2tt. To set the scene, we first 
summarize what can be anticipated for behaviour of the model, from a consider- 
ation of limiting values for the parameters a and 7 and from existing analytical 
results. 

We start by recalling that at 7 = 0 the system consists of two decoupled 
copies of the U(l) network model. States in each copy are localised for all a in 
the range 0 < a < tt/4, with a localisation length that diverges at a = tt/4. 
One expects the localised phase to be stable against small perturbations, so that 
it will extend over a region in parameter space with 0 < a < tt/4 and 7 small. 
Conversely, critical states in the two U(l) models at a = tt/4 are easily mixed by 
non-zero chiral coupling bi , so that it is reasonable to guess that the critical phase 
predicted by Gade and Wegner [8, 9] will be found in the region with a close to 
7t/4 and 7 > 0. 

Properties along the (critical) line a = tt/4 follow from studies of the non- 
linear sigma model: [8, 9, 30] the renormalisation group equations 



dl 16 ’ 



(30) 



are argued [16] to be exact to all order of perturbation theory. Note that when a 
is close to but not exactly equal to 7t/4, a non-zero mean mass for the underlying 
random Dirac fermions is generated. In that case, Eq. (30) should be extended to 
incorporate it, which is beyond the scope of the present work. This non-zero mass 
leads to localisation when large enough on a scale set by 7. 

The coupling constant A parameterizes the conductivity cr, with cr= 1/(87 tA^). 
Since A does not flow under renormalisation, the critical phase is described by a 
line of fixed points and may have a range of values for its conductivity. Flow of the 
second coupling constant, A^, determines properties at non-zero energy, giving in 
particular for the density of states p{E) at small energy E the scaling law [8, 9] 



P{E) ~ ^ exp (^-Ky/ln{Eo/EŸj 



(31) 



where Eq is a microscopic energy scale. Using the results of Ref. [16], k, = 4a/2/A^. 
As emphasised in Ref. [17] such behaviour for p{E) sets in only below a crossover 
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energy scale Ec^ from Ref. [16], we estimate this scale to be given by 

Ec = Eo exp [-aa^) , (32) 

where a = (32\/27r)^. We note also that Motrunich et al have argued that the 
power law in the exponent of Eq. (31) is modified at very small energy scales, [17] 
and that their results have been reproduced within a field- theoretic approach [31]. 

Since the derivations of these results depend on a number of technical as- 
sumptions, some checks are very desirable. Unfortunately, computational work is 
challenging because it is hard to reach scales smaller than E^ and at intermediate 
energies the same analysis predicts an approximately power-law dependence of the 
density of states on energy, with an effective exponent which depends on disorder 
strength. 

In our numerical work testing these expectations, we use calculations of two 
types, both applied to quasi-one dimensional systems of width M plaquettes and 
length L plaquettes (so that the time evolution operator S appearing in Eq. (7) is a 
AML X 4ML matrix, and the transfer matrix T appearing in Eq. (12) is a 4M x 4M 
matrix). For such samples, we find the localisation length as the inverse of the 
smallest positive Lyapunov exponent for the transfer matrix, calculated using the 
standard approach [32]. Applied directly to the transfer matrix as formulated in 
Sec. 2, the resulting localisation length is that for eigenstates of the time evolution 
operator at eigenphase zero; by associating an extra phase with every link 
(breaking chiral symmetry), we are also able to find the localisation length for 
states with non-zero eigenphase. Separately, we determine the density of states for 
eigenphases of the time evolution operator S, applying a version of the recursion 
method, [33] adapted to treat a unitary operator in place of the Hermitian one for 
which the approach was originally formulated. We use the terms eigenphase and 
energy interchangeably in describing our results. 

4.2 Phase diagram 

We determine a phase diagram for the model in the a, 7 plane from a study 
of the zero-energy localisation length in quasi-one dimensional geometry, and 
its dependence on M. The quantity of central interest is the ratio ^m/M. In a 
localised phase, approaches a finite limit the bulk localisation length, for 
large M, and so the ratio <^m/M decreases with increasing M, varying as ^/M 
for sufficiently large M. By contrast, in a critical phase, the ratio approaches for 
large M a finite constant whose value can be identified with the conductivity of 
the model [34]. 

Representative data for ^m/M at M = 8, 16, 32 and 64 are shown as a func- 
tion of 7, in Fig. 3 for a = 7 ttI32. 

Two regimes of behaviour are observed: for small 7, ^m/M decreases with 
increasing M, indicating a localised phase, while for 7 close to 7t/2, ^m/M is in- 
dependent of M for the system widths studied, suggesting a critical phase. By 
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Figure 3: ^mIM at a = 77 t/ 32 as a function of 7. 



identifying the value of 7 which divides the two regimes and studying its depen- 
dence on a, we arrive at the phase diagram displayed in Fig. 4. 

Properties in the critical phase are illustrated in more detail by the behaviour 
of the ratio as a function of 7 with a = tt/ 4, shown in Fig. 5. On this line in 

parameter space, behaviour at all points is critical rather than localised, and the 
data show a finite limiting value a for the ratio at large M, with a dependent on 
7: such behaviour is expected from Eq. (30) and the identification of the coupling 
constant A as being a function only of conductivity, which is given in turn by 
^mIM. This figure also illustrates crossover of behaviour with increasing M at 
small 7, from that of the U(l) model to that of the AIII model. The crossover is 
responsible for discontinuous variation of a with 7 at 7 = 0. 

We show in Fig. 6 the energy dependence of the localisation length at a point 
in the localised phase (left panel) and at a point in the critical phase (right panel) , 
with data at L = 2.5 x 10^ for a range of system widths M, as well as an extrapo- 
lation to the two-dimensional limit. It is clear from the data that the localisation 
length in the critical phase decreases very rapidly with E away from E = 0, 

4.3 Density of states 

Our interest in the (average) eigenphase density of the time evolution operator is 
focussed mainly on behaviour near zero energy, where we expect in the critical 
phase singular behaviour approaching that of Eq. (31), and in the localised phase 
either a density vanishing quadratically with energy, as for the random matrix 
ensemble in this symmetry class, [2] or singularities arising from Griffiths strings, 
as discussed in Ref. [17]. 
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Figure 5: ^m/M aX a = tt/ 4 as a function of the 7 . Inset: the limiting value, rr, of 
this ratio for large M 
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Figure 6: Localisation length at o; = tt/ 8, ^ = 1 equivalent to 7 = 0.557 t /2 
(left) and at a = tt/ 4, ^ = 9 equivalent to 7 = 0.9367t/2 (right) as a function of E 
in a log-log scale for increasing transverse size M=2,4,8,16,24 and extrapolated to 
M = oc, indicated by the symbols 0, A, □, y, Q ^ respectively. 



In Fig. 7 we show p{E) as a function of E^ with logarithmic scales for both 
axes, for a sequence of models, with a = tt/ A and 6/ Gaussian distributed in each 
case. Members of the sequence are chosen to have successively smaller values of a, 
and hence larger values for the energy scale Ec, since by increasing 7 we decrease 
cr, as is evident from the inset to Fig. 3. The data were obtained using systems of 
width M = 16, and lengths in the range L = 10^ to L = 10^, in order to achieve 
satisfactory self- averaging. Over the energy range accessible, the variation of p{E) 
with E is approximately power- law, and can be characterised following Ref. [17] 
using an effective dynamical exponent 2:, with 

p{E) ~ . (33) 

As one approaches the asymptotic low-energy behaviour, the effective exponent 2) 
increases, ultimately to infinity: the large values of 2: reached here indicate close 
approach to the limiting low-energy behaviour. 

Finally, we examine the evolution of the density of states as one moves from 
the critical phase into the localised phase. Results are presented in Fig. 8. At the 
broad scale to which these calculations are restricted, behaviour in the localised 
phase is consistent with the power law of Eq. (33), with a power — 1 + 2/2: which, 
deep in the localised phase, is positive and increases as the localisation length 
decreases. Such behaviour was proposed as generic for localised systems with chiral 
symmetry in Ref. [17]. 
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Figure 7: Density of states p{E) as a function of E for small E, with logarithmic 
scales on both axes. Three cases of increasing disorder are plotted (with symbols 
A for = 400, 0 for p = 25 and v for p = 9). The related dynamical exponents z 
in this range of energy are respectively z = 26.0 ± 2.0, ^ = 8.0 ± 0.1 and 5.3 ±0.1. 
Symbols □ show the case j3i uniformly distributed in [0, 27 t] with 2 ; = 3.02 ± 0.03. 




Figure 8: Density of states p{E) as a function of E in the range [0,7t/16] for fixed 
p = 1, with linear scales on both axes. Four cases of decreasing a (moving into the 
localised phase) are plotted with A for o = tt/4 (critical case), 0 for a = 37t/16, 
V for a = 7t/8 and □ for o; = tt/16. 
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Generalisation 


Network 


Physics 


with absorption 
and amplification 


A (U(l) model) 


Integer QHE 


AIII 


D 


Thermal H-Tc Supra 


BDI 


C 


Spin QHE 


CII 



Table 1: The three network models in two dimensions, representatives of the three 
chiral symmetry classes, and their correspondence with the underlying network 
models. 

5 Conclusions and developments 

We have introduced a network model which is a realisation of the chiral unitary 
symmetry class AIII. We have argued that it is interesting from several points 
of view. It is useful as a starting point for numerical studies of this symmetry 
class, offering access in the two-dimensional case we have examined to both crit- 
ical and localised phases, and displaying a band-centre singularity in the critical 
phase which approaches quite closely the expected asymptotic form. The model 
also has a striking connection to the U(l) network model, but with absorption and 
amplification. Moreover, by imposing constraints on the disorder, it serves as a lat- 
tice version of problems with randomness entering only through a vector potential. 
The whole construction and study can be adapted [26] to the the chiral orthogonal 
symmetry class (BDI) as well as to the chiral symplectic symmetry class (CII). 
They are connected to two known network models (D and C) [35, 36, 37] but with 
absorption and amplification terms. This is summarized in table 1. 
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Many Body Physics: Unfinished Revolution 



Piers Coleman 



Abstract.The study of many body physics has provided a scientific playground of 
surprise and continuing revolution over the past half century. The serendipitous 
discovery of new states and properties of matter, phenomena such as superfluidity, 
the Meissner, the Kondo and the fractional quantum hall effect, have driven the 
development of new conceptual frameworks for our understanding about collective 
behavior, the ramifications of which have spread far beyond the confines of terres- 
trial condensed matter physics - to cosmology, nuclear and particle physics. Here 
I shall selectively review some of the developments in this field, from the cold- war 
period, until the present day. I describe how, with the discovery of new classes of 
collective order, the unfolding puzzles of high temperature superconductivity and 
quantum criticality, the prospects for major conceptual discoveries remain as bright 
today as they were more than half a century ago. 



1 Emergent Matter: a new Frontier 

Since the time of the Greeks, scholars have pondered the principles that govern 
the universe on its tiniest and most vast scales. The icons that exemplify these 
frontiers are very well known - the swirling galaxy denoting the cosmos and the 
massive accelerators used to probe matter at successively smaller scales - from 
the atom down to the quark and beyond. These traditional frontiers of physics are 
concerned with reductionism: the notion that once we know the laws of nature that 
operate on the smallest possible scales, the mysteries of the universe will finally 
be revealed to us [1]. 

Over the last century and a half, in a period that stretches back to Dar- 
win and Boltzmann ™ scientists have also become fascinated by another notion: 
the idea that to understand nature, one also needs to understand and study the 
principles that govern collective behavior of vast assemblies of matter. For a wide 
range of purposes, we already know the microscopic laws that govern matter on 
the tiniest scales. For example, a gold atom can be completely understood with 
the Schroedinger equation and the laws of quantum mechanics established more 
than seventy years ago. Yet, a gold atom is spherical and featureless - quite un- 
like the lustrous malleable and conducting metal which human society so prizes. 
To understand how crystalline assemblies of gold atoms acquire the properties of 
metallic gold, we need new principles - principles that describe the collective be- 
havior of matter when humungous numbers of gold atoms congregate to form a 
metallic crystal. It is the search for these new principles that defines the frontiers 
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of many body physics in the realms of condensed matter physics and its closely 
related discipline of statistical mechanics. 

In this informal article, I shall talk about the evolution of our ideas about 
the collective behavior of matter since the advent of quantum mechanics, hoping 
to give a sense of how often unexpected experimental discovery has seeded the 
growth of conceptually new ideas about collective matter. Given the brevity of the 
article, I must apologize for the necessarily selective nature of this discussion. 

The past seventy years of development in the many body physics has seen 
a period of major unexpected conceptual and intellectual developments. Experi- 
mental discoveries of remarkable new phenomena, such as superconductivity, su- 
perfluidity, criticality, high temperature superconductivity, and the quantized Hall 
effect, have each prompted a renaissance in areas once thought to be closed to fur- 
ther fruitful intellectual study. Indeed, the history of the field is marked by the 
most wonderful and unexpected shifts in perspective and understanding that have 
involved a close linkage between experiment, new mathematics and new concepts. 

I shall discuss three eras - the immediate aftermath of quantum mechan- 
ics - many body physics in the cold war - and the modern era of correlated 
matter physics. Over this period, the physicists view of the matter has evolved 
dramatically - witness the evolution in our view of “electricity” from the idea of 
the degenerate electron gas, to the concept of the Fermi liquid, to new kinds of 
electron fluid, such as a the Luttinger liquid or fractional quantum Hall state. 
Progress was not smooth and gradual, but often involved the agony, despair and 
controversy of the creative process. Even the notion that an electron is a fermion 
was controversial. Wolfgang Pauli - inventor of the the exclusion principle could 
not initially envisage that this principle would apply beyond the atom, to macro- 
scopically vast assemblies of degenerate electrons: indeed, he initially preferred the 
idea that electrons were bosons. Pauli arrived at the realization that the electron 
fluid is a degenerate Fermi gas with great reluctance, and at the end of 1925 [2] 
gave way, writing in a short note to Schroedinger that 

“With a heavy heart, I have decided that Fermi Dirac, not Einstein is the 
correct statistics, and I have decided to write a short note on paramagnetism.’’ 

Wolfgang Pauli, letter to Schrodinger, November 1925 [2]. 



2 Unsolved riddles of the 1930s 

The period of condensed matter physics between the two world- wards was charac- 
terized by a long list of unsolved mysteries in the area of magnetism and 
superconductivity [3]. Ferromagnetism had emerged as a shining triumph of the 
application of quantum mechanics to condensed matter. So rapid was the progress 
in this direction, that Neel and Landau quickly went on to generalize the idea, pre- 
dicting the possibility of staggered magnetism, or antiferromagnetism in 1933 [4]. 
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Figure 1: Two mysteries of the early era, whose ultimate resolution 30 years later linked them 
to profound new concepts about nature, (a) The Meissner effect, whose ultimate resolution led 
to an understanding of superconductivity and the discovery of the Anderson-Higgs mechanism, 
(b) The Kondo resistance minimum, which is linked to the physics of confinement. 



In a situation with many parallels today, the experimental tools required to real- 
ize the predicted phenomenon, had to await two decades, for the development of 
neutron diffraction [5]. During this period, Landau became pessimistic and came 
to the conclusion that quantum fluctuations would most probably destroy antifer- 
romagnetism, as they do in the antiferromagnetic ID Bethe chain - encouraging 
one of his students, Pomeranchuk, to explore the idea that spin systems behave as 
neutral fluids of fermions [6]. 

By contrast, superconductivity remained unyielding to the efforts of the finest 
minds in quantum mechanics during the heady early days of quantum mechanics 
in the 1920s, a failure derived in part from a deadly early misconception about 
superconductivity [3]. It was not until 1933 that a missing elements in the puzzle 
came to light, with the Meissner and Ochensfeld discovery that superconductors 
are not perfect resistors, but perfect diamagnets. [7] It is this key discovery that 
led the London brothers [8] to link superconductivity to a concept of “rigidity” in 
the many body electron wavefunction, a notion that Landau and Ginzburg were 
to later incorporate in their order parameter treatment of superconductivity [9] . 

Another experimental mystery of the 1930’s, was the observation of a mys- 
terious “resistance minimum” in the temperature dependent resistance of copper, 
gold, silver and other metals [10]. It took 25 more years for the community to 
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link this pervasive phenomenon with tiny concentrations of atomic size magnetic 
impurities - and another 15 more years to solve the phenomenon - now known as 
the Kondo effect - using the concepts of renormalization and asymptotic freedom. 

3 Many Body Physics in the Cold War 

3.1 Physics without Feynman diagrams 

Many body physics blossomed after the end of the second world war, and as the 
political walls between the east and west grew with the beginning of the cold war, 
a most wonderful period of scientific and conceptual development, with a frequent 
exchange of new ideas across the iron curtain, came into being. Surprisingly, the 
Feynman diagram did not really enter many body physics until the early 60s, 
yet without Feynman diagrams, the many body community made a sequence of 
astonishing advances in the 1950’s [11]. 



high energy 
pi as mon modes 



e-quasiparticlcs 




Figure 2: Illustrating the Pines-Bohm idea, that the physics of the electron fluid can be divided 
up into high energy collective “plasmon modes” and low energy electron quasiparticles. 



The early 1950s saw the first appreciation by the community of the impor- 
tance of collective modes. One of the great mysteries was the reason why the 
non-interacting Sommerfeld model of the electron fluid worked so well, despite in- 
teractions that are comparable to the kinetic energy of the electrons. In a landmark 
early paper, David Bohm and his graduate student, David Pines [12] realized that 
they could separate the strongly interacting gas via a unitary transformation into 
two well-separated sets of excitations - a high energy collective oscillations of the 
electron gas, called plasmons, and low energy electrons. The Pines-Bohm paper 
is a progenitor of the idea of renormalization: the idea that high energy modes of 
the system can be successively eliminated, to give rise to a renormalized picture 
of the residual low energy excitations. 

Feynman diagrams entered many body physics in the late 1950s [11]. The 
first applications of the formalism of quantum field theory to many body physics 
to bulk electronic matter, made by Brueckner [13, ] were closely followed by Gold- 
stone and Hubbard’s elegant re-derivations of the method using Feynman dia- 
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grams [14, 15]. A flurry of activity followed: Gell-Mann and Brueckner used the 
newly discovered “linked cluster theorem” to calculate the correlation energy of 
the high density electron gas [16], and Galitskii and Migdal [17, 18] in the USSR 
applied the methods to the spectrum of the interacting electron gas. Around the 
same time, Edwards [19] made the first applications of Feynman’s methods to the 
problem of elastic scattering off disorder. 

One of the great theoretical leaps of this early period was the invention of the 
concept of imaginary time. The earliest published discussion of this idea occurs in 
the papers of Matsubara [20] . Matsubara noted the remarkable similarity between 
the time evolution operator of quantum mechanics 



U{t) = e-**^/'* 


(1) 


and the Boltzmann density matrix 




p{!3) = = U{-ih0), 


(2) 



where l3 — X/iksT) and ks is Boltzmann’s constant. This parallel suggested that 
one could convert conventional quantum mechanics into finite temperature quan- 
tum statistical mechanics by using a time-evolution operator where real time is 
replaced by imaginary time. 



t —irh. (3) 

Matsubara’s ideas took a further leap into the realm of the practical, when 
Abrikosov, Gorkov and Dzyaloshinski (AGD) [21] who showed that the method 
was dramatically simplified by Fourier transforming the imaginary time electron 
Green function into the frequency domain. They noted for the first time that the 
antiperiodicity of the Green function G{t -\- 0) = —G{r) meant that the continu- 
ous frequencies of zero temperature physics are replaced by the discrete frequencies 
ujn = (2n -h 1)7 tT, that we now call the “Matsubara frequency”. In their paper, 
the finite temperature propagator 

G{u )„ , ^ = [iuj„ + (4) 

for the electron makes its first appearance. 

Another great conceptual leap of the early cold war, was Landau’s devel- 
opment of the Quasiparticle excitation concept. Landau [22], stimulated by early 
measurements on liquid He — 3, realized that interacting fermi gases could be un- 
derstood with the concept of “adiabaticity”- the notion that when interactions are 
turned on adiabatically, the original single-particle excitations of the Fermi liquid, 
evolve without changing their charge or spin quantum numbers, into “quasipar- 
ticle” excitations of the interacting system. Landau’s Fermi liquid theory is the 
foundation for the modern “standard model” of the electron fluid. 
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3.2 Broken Symmetry 

Two monumental achievements of the cold- war era deserve separate mention: the 
discovery of “broken symmetry” and the renormalization group. In 1937, Lan- 
dau [23] formulated the concept of broken symmetry - proposing that phase tran- 
sitions take place via the process of symmetry reduction, which he described in 
terms of his order parameter concept. In the early fifties, Onsager and Penrose [24], 
refined Landau’s concept of broken symmetry to propose that a superfluidity could 
be understood as a state of matter in which the two-particle density matrix 

p{r,r') = (5) 

can be factorized: 

p{r,r') = 'ip*{r)'ip{r) -h small terms (6) 

where 

i>{r) = = (AT - l|^(r)|7V). (7) 

is the order parameter of the superfiuid, ps is the superfluid density and 0 the phase 
of the condensate. This concept of “off-diagonal long-range order” soon became 
generalized to fermi systems as part of the BCS theory of superconductivity [25, 
26] , where the off-diagonal order parameter 

F(x-x') = {N-2\i^^{x)Mx')\N), (8) 

defines the wavefunction of the Cooper pair. 

One of the remarkable spin-offs of superconductivity, was that it led to an 
understanding of how a gauge boson can acquire a mass as a result of symmetry 
breaking. This idea was first discussed by Anderson in 1959 [27], and in more detail 
in 1964 [28, 29], but the concept evolved further and spread from Bell Laboratories 
to the particle physics community, ultimately re-appearing as the Higg’s mecha- 
nism for spontaneous symmetry breaking in a Yang Mills theory. The Anderson- 
Higgs mechanism is a beautiful example of how the study of cryogenics led to a 
fundamentally new way of viewing the universe, providing a mechanism for the 
symmetry breaking between the electrical and weak forces in nature. 

Another consequence of broken symmetry concept is the notion of “gener- 
alized rigidity” [30], a concept which has its origins in London’s early model of 
superfluidity, according to which, if the phase of a boson or Cooper pair develops 
a rigidity, then it costs a phase bending energy 

U{x) ~ ip,(V0(x))2, (9) 

from which we derive that the “superflow” of particles is directly proportional to 
amount of phase bending, or the gradient of the phase 



js = Ps"^(t>- 



(10) 
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Anderson noted [30] that we can generalize this concept to a wide variety of bro- 
ken symmetries, each with their own type of superfiow (see table 1). Thus broken 
translation symmetry leads to the superflow of momentum, or sheer stress, bro- 
ken spin symmetry leads to the superflow of spin or spin superflow. There are 
undoubtedly new classes of broken symmetry yet to be discovered. 



Table 1. Order parameters, broken symmetry and rigidity 



Name 


Broken Symmetry 


Rigidity / Supercurrent 


Crystal 


Translation Symmetry 


Momentum superflow 
(Sheer stress) 


Superfluid 


Gauge symmetry 


Matter superflow 


Superconduct i vity 


E.M. Gauge symmetry 


Charge super flow 


Antiferromagnetism 


Spin rotation symmetry 


Spin superflow 
(x-y magnets only) 


? 


Time Translation Symmetry 


Energy superflow ? 



3.3 Renormalization group 

The theory of second order phase transitions was studied by Van der Waals in 
the 19th century, and thought to be a closed field. Two events - the experimental 
observation of critical exponents that did not fit the predictions of mean-field 
theory [31, 32], and the solution to the 2D Ising model [33], forced condensed 
matter physicists to revisit an area once thought to be closed. The revolution that 
ensued literally shook physics from end to end, furnishing us with a spectrum of 
new concepts and terms, such as 

• scaling theory [34, 35, 36], 

• universality - the idea that the essential long-distance physics is independent 
of all but a handful of short-distance details, such as the dimensionality of 
space and the symmetry of the order parameter. 

• renormalization - the process by which short-distance, high energy physics is 
absorbed by adjusting the parameters inside the Lagrangian or Hamiltonian. 
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• fixed points - the limiting form of the Lagrangian or Hamiltonian as short- 
distance, high energy physics is removed 

• running coupling constant, 

• cross-over, 

• upper critical dimensionality - the dimension above which mean- field theory 
is valid. 

that appeared as part of the new “renormalization group” [37, 38, 39, 40]. The 
understanding of classical phase transitions required the remarkable fusion of uni- 
versality, together with the new concepts of scaling, renormalization and the ap- 
plication of tools borrowed from quantum field theory. These developments are a 
main stay of modern theoretical physics, and their influence is felt far outside the 
realms of condensed matter. 

One of the unexpected dividends of the renormalization group concept, in the 
realm of many body theory, was the solution of the Kondo effect: the condensed 
matter analog of quark confinement. By the late fifties, the resistance minima in 
copper, gold and silver alloys that had been observed since the 1930s [10], had 
been identified with magnetic impurities, but the mechanism for the minimum 
was still unknown. In the early 60’s, Jun Kondo [41] was able to identify this 
resistance minimum, as a consequence of antiferromagnetic interactions between 
the local moments and the surrounding electron gas. The key ingredient in the 
Kondo model, is an antiferromagnetic interaction between a local moment and the 
conduction sea, denoted by 

Hi = Ja{0)-S (11) 

5 is a spin 1/2 and <j(0) is the spin density of the conduction electrons at the 
origin. Kondo [41] found that when he calculated the scattering rate of electrons 
off a magnetic moment to one order higher than Born approximation, 

\<x[Jp + 2{Jpf\n^]\ (12) 

where p is the density of state of electrons in the conduction sea and D is the 
width of the electron band. As the temperature is lowered, the logarithmic term 
grows, and the scattering rate and resistivity ultimately rises, connecting the re- 
sistance minimum with the antiferromagnetic interaction between spins and their 
surroundings. 

A deeper understanding of this logarithm required the renormalization group 
concept [42, 40, 43]. By systematically taking the effects of of high frequency virtual 
spin fluctuations into account, it became clear that the bare coupling J is replaced 
by a renormalized quantity 

Jp(A) = Jp + 2{Jpf In ^ 



(13) 
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that depends on the scale A of the cutoff, so that the scattering rate is merely 
given by 1/r cc (/oJ(A))^|a~t- The corresponding renormalization equation 

^ = p{Jp) = -2{Jpf + 0( J3) (14) 

contains a “negative jS function”: the hallmark of a coupling which dies away at 
high energies (asymptotic freedom), but which grows at low energies, ultimately 
reaching a value of order unity when the characteristic cut-off is reduced to the 
scale of the so called “Kondo temperature” Tk ~ . 

The “Kondo” effect is a manifestation of the phenomenon of “asymptotic 
freedom” that also governs quark physics. Like the quark, at high energies the 
local moments inside metals are asymptotically free, but at energies below the 
Kondo temperature, they interact so strongly with the surrounding electrons that 
they become screened or “confined” at low energies, ultimately forming a Landau 
Fermi liquid [43]. It is a remarkable that the latent physics of confinement, hiding 
within cryostats in the guise of the Kondo resistance minimum, remained a mystery 
for more than 40 years, pending purer materials, the concept of local moments and 
the discovery of the renormalization group. 

3.4 The concept of Emergence 

The end of the cold- war period in many body physics is marked by Anderson’s 
statement of the concept of emergence. In a short paper, originally presented as 
part of a Regent’s lecture entitled “More is different” at San Diego in the early 
seventies [44], Anderson defined the concept of emergence with the now famous 
quote 



“at each new level of complexity, entirely new properties appear, and 
the understanding of these behaviors requires research which I think is 
as fundamental in its nature as any other.” 

Anderson’s quote underpins a modern attitude to condensed matter physics - the 
notion that the study of the collective principles that govern matter is a frontier 
unto itself, complimentary, yet separate to those of cosmology, particle physics and 
biology. 

4 Condensed Matter Physics in the New Era 
4.1 New States of Matter 

Towards the end of the 1970’s few condensed matter physicists had internalized 
the consequences of emergence. In the early eighties, condensed matter physicists 
were by enlarge, content with a comfortable notion that they had outlined the 
principle constraints on the behavior and possible ground-states of dense mat- 
ter. Superconductivity was widely believed to be limited below about 25K [45]. 
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The “vacuum” state of metallic behavior was firmly believed to be the Landau 
Fermi liquid, and no significant departures were envisaged outside the realm of 
one-dimensional conductors. Tiny amounts of magnetic impurities were known to 
be anathema to superconductivity. These principles were so entrenched in the com- 
munity that the first observation [46] of heavy electron superconductivity in the 
magnetic metal UBeis was mis-identified as an artifact, delaying acceptance of 
this phenomenon by another decade. By the end of the 80 ’s all of these popularly 
held principles had been exploded by an unexpected sequence of discoveries, in 
the areas of heavy electron physics, the quantum Hall effect and the discovery of 
high temperature superconductivity. 





Figure 3: Illustrating the binding of two vortices to each electron, to form the v> = IjS Laughlin 
ground-state. 



4.1.1 Fractional Quantum Hall Effect 

In the 1930’s Landau had discussed the quantum mechanics of electron motion in 
a magnetic field [47], predicting the quantization of electron kinetic energy into 
discrete Landau levels 



h^{kl + kl) heB, , 1, 



(n = 0, 1, 2...). 



(15) 



Landau quantization had been confirmed in metals, where it produces oscillations 
in the field-dependent resistivity (Shubnikov de Haas oscillations) and magneti- 
zation (de Haas van Alphen oscillations), and the field was thought mature. In 
the seventies, advances in semiconductor technology and the availability of high 
magnetic fields, made it possible examine two dimensional electron fluids at high 
fields, when the spacing of the Landau levels is so large that the electrons drop into 
the lowest Landau level, so that their dynamics is entirely dominated by mutual 
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Coulomb interactions. Remarkably, the Hall constant of these electron fluids was 
found to be quantized with values 

V h 

= T = — ’ (1^) 

I i>e^ 

where at lower fields, i/ = 1, 2, 3 . . . is an integer, but at higher fields, v acquires 

a fractionally quantized values i/ = l/3, 1/5, 1/7 Laughlin [48] showed that 

the fractional quantum Hall effect is produced by interactions, which stabilize a 
new type of electron fluid where the Landau level has fractionally filling factor v — 
1 / (2M+1). In Laughlin’s approach, the electron fluid is pierced by “vortices” which 
identify zero’s in the electron wavefunction. Laughlin proposed that electrons bind 
to these vortices to avoid other electrons, and he incorporated this physics into 
his celebrated wavefunction by attaching each electron to an even number 2M of 
vortices. 



i>j 



\2M+1 



exp 






(17) 



where lo = y/h/eB is the magnetic length. The excitations in this state are 
gapped, with both fractional charge and fractional statistics: an entirely new elec- 
tronic ground state. Moreover, the wavefunction is robust against the details of 
the Hamiltonian from which it is derived. 

This break-through opened an entire field of investigation [49, 50] into the 
new world of highly correlated electron physics, bringing a whole range of new 
concepts and language, such as 

• fractional statistics quasiparticles - 



• composite fermions - 



• Chern- Simons terms. 



Equally importantly, the fractional quantum Hall effect made the community 
poignantly aware of the profound transformations that become possible in elec- 
tronic matter when the strength of interactions becomes comparable to, or greater 
than the kinetic energy. 



4.1.2 Heavy Electron Physics 

The discovery of heavy electron materials in the late seventies [51, 52] forced 
condensed matter physicists to severely revise their understanding about how local 
moments interact with the electron fluid. In the late seventies, electron behavior 
in metals was neatly categorized into 

1. “delocalized” behavior, where electrons form Bloch waves, and 

2. “localized” behavior, where the electrons in question are bound near a par- 
ticular atom in the material. Such unpaired spins form tiny atomic magnets 
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called “local moments” that tend to align at low temperatures and are ex- 
tremely damaging to superconductivity. 

Heavy fermion metals completely defy these norms, for they contain a dense array 
of magnetic moments, yet instead of magnetically ordering the moments develop a 
highly correlated paramagnetic ground-state with the conduction electrons. When 
this happens, the resistivity of the metal drops abruptly, form a highly correlated 
Landau Fermi liquid in which electron masses rise in excess of 100 times the bare 
electron mass [53]. 

Heavy electron physics is, in essence the direct descendant of the resistance 
minimum physics first observed in simple metals in the early 1930’s. Our current 
understanding of heavy fermions is based on the notion, due to Doniach [54], 
that the “Kondo effect” seen for individual magnetic moments, survives inside 
the dense magnetic arrays of in heavy fermion compounds to produce the heavy 
fermion state. The heavy electrons that propagate in these materials are really 
the direct analogs of nucleons formed from confined quarks. Curiously, one of 
the most useful theoretical methods for describing these systems was borrowed 
from particle physics. Heavy electrons are formed in f-orbitals which are spin-orbit 
coupled with a large spin degeneracy = 2j + 1. One of the most useful methods 
for developing a mean- field description of the heavy electron metal is the 1/N 
expansion, inspired by analogies with the 1/N expansion in the number of colors 
in Quantum Chromodynamics [55, 56, 57, 58]. 

4.1.3 High Temperature Superconductivity 

The discovery of high temperature superconductivity, with transition tempera- 
tures that have spiraled way above the theoretically predicted maximum possible 
transition temperatures, to its current maximum of 165X, stunned the physics 
community. These systems are formed by adding charge to an insulating state 
where electrons are localized in an antiferromagnetic array. Several aspects of these 
materials radically challenge our understanding of correlated electron systems, in 
particular: 

• The close vicinity between insulating and superconducting behavior in the 
phase diagram, which suggests that the insulator and superconductor may 
derive from closely related ground-state wavefunctions [59, 60]. 

• The “strange metal” behavior of the optimally doped materials. Many prop- 
erties of this state tell us that it is not a Landau Fermi liquid, such as the 
linear resistivity 



p — Po -\- AT (1^) 

extending from the transition temperature, up to the melting temperature. 
This linear resistivity is known to originate in an electron-electron scattering 
rate F(T) ~ ksT^ that grows linearly with temperature, which has been 
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called a “marginal Fermi liquid” [61]. In conventional metals, the inelastic 
scattering rate grows quadrat ically with temperature. Despite 15 years of 
effort, the origin of the linearity of the scattering rate remains a mystery. 

• The origin of the growth of a pseudogap in the electron spectrum for “under- 
doped” superconductors. This soft gap in the excitation spectrum signals the 
growth of correlations amongst the electrons prior to superconductivity, and 
some believe that it signals the formation of pairs, without coherence [62]. 

The radical simplicity of many of the properties of the cuprate superconductors 
leads many to believe that their ultimate solution will require a conceptually new 
description of the interacting electron fluid. 




x=0 19 Qcp? 



Figure 4: Schematic phase diagram for cuprate superconductors, where x is doping and T the 
temperature, showing the location of a possible quantum critical point. 



The qualitative phase diagram is shown in Fig(4), showing three distinct 
regions - the over-doped region, the fan of “marginal Fermi liquid behavior” and 
the under-doped region. The theoretical study of this phase diagram has proven 
to be a huge engine for new ideas, such as 

• Spin charge separation- the notion that the spin-charge coupled electron 
breaks up into independent collective charge and spin excitations, as in one 
dimensional fluids. 

• Hidden order - the notion that the pseudo-gap is a consequence of the for- 
mation of an as- yet unidentified order parameter, such as orbital magnetism 
(d-density waves) [63, 64] or stripes [65]. 
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• Quantum criticality - the notion that the strange-metal phase of the cuprates 
is a consequence of a “quantum critical point” around a critical doping of 
about Xc ~ 0.2 [66] In this scenario, the pseudo-gap is associated with the 
growth of “hidden order” and marginal Fermi liquid behavior is associated 
with the quantum fluctuations emanating from the quantum critical point. 

• Pre-formed pairs - the idea that the under-doped pseudo-gap region of the 
phase diagram is a consequence of the formation of phase- incoherent pairs 
which form at the pseudo-gap temperature [62]. 

• Resonating Valence Bonds - the idea that superconductivity can be regarded 
as a fluid of spinless charged holes, moving in a background of singlet spin 
pairs. [59] 

• New forms of gauge theory, including Z 2 [67], SU {2) [68] and even super- 
symmetric gauge theories [69] that may describe the manifold of states that 
is highly constrained by the strong coulomb interactions between electrons 
in the doped Mott insulator. 

Many of these ideas enjoy some particular realization in non-cuprate materials, 
and in this way, cuprate superconductivity has stimulated a huge growth of new 
concepts and ideas in many body physics. 

4.2 Quantum Criticality 

The concept of quantum criticality: the idea that a zero temperature phase transi- 
tion will exhibit critical order parameter fluctuations in both space and time, was 
first introduced by John Hertz during the hey-days of interest in critical phenom- 
ena, but was regarded as an intellectual curiosity. [70] Discoveries over the past 
decade and a half have revealed the ability of zero-temperature quantum phase 
transitions to qualitatively transform the properties of a material at finite tem- 
peratures. For example, high temperature superconductivity is thought to be born 
from a new metallic state that develops at a certain critical doping in copper- 
perovskite materials. [66] Near a quantum phase transition, a material enters a 
weird state of ’’quantum criticality”: a new state of matter where the wavefunc- 
tion becomes a fluctuating entangled mixture of the ordered, and disordered state. 
The physics that governs this new quantum state of matter represents a major 
unsolved challenge to our understanding of correlated matter. 

A quantum critical point (QCP) is a singularity in the phase diagram: a 
point X = Xc àt zero-temperature where the characteristic energy scale /csTo(x) of 
excitations above the ground-state goes to zero. (Fig. 1.). [71, 72, 73, 74, 75, 76] The 
QCP affects the broad wedge of phase diagram where T > To{x). In this region 
of the material phase diagram, the critical quantum fluctuations are cut-off by 
thermal fluctuations after a correlation time given by the Heisenberg uncertainly 
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principle 



T 



h 

1 ^' 



(19) 



As a material is cooled towards a quantum critical point, the physics probes 
the critical quantum fluctuations on longer and longer time-scales. Although the 
“quantum critical” region of the phase diagram where T > To{x) is not a strict 
phase, the absence of any scale to the excitations other than temperature itself 
qualitatively transforms the properties of the material in a fashion that we would 
normally associate with a new phase of matter. 



r ~ h/kuT 




Quantum Critical Point 



Figure 5: Quantum criticality in heavy electron systems. For x < Xc spins become ordered 
for T < To{x) forming an antiferromagnetic Fermi liquid ; for x > Xc, composite bound-states 
form between spins and electrons at T < To(x) producing a heavy Fermi liquid. “Non-Fermi 
liquid behavior”, in which the characteristic energy scale is temperature itself, and resistivity is 
quasi-linear, develops in the wedge shaped region between these two phases. The nature of the 
critical Lagrangian governing behavior at Xc is currently a mystery. 

Quantum criticality has been extensively studied in heavy electron materials, 
in which the antiferromagnetic phase transition temperature can be tuned to zero 
by the application of a pressure, held or chemical doping. Close to quantum criti- 
cality, these materials exhibit a number of tantalizing similarities with the cuprate 
superconductors [76]: 

• a predisposition to form anisotropic superconductors, 

• the formation of a strange metal with quasi-linear resistivity in the critical 
region 

• the appearance of temperature as the only scale in the electron excitation 
spectrum at criticality, reminiscent of “marginal Fermi liquid behavior” 
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Hertz proposed that quantum criticality could be understood by extending clgissi- 
cal criticality to order parameter fluctuations in imaginary time, using a Landau 
Ginzburg functional that includes the effects of dissipation: 

pl/T p 

F = j^ dT j d<^x{\{V + + U\iPŸ} + Fd (20) 

where Qo is the ordering vector of the antiferromagnet, ^ the correlation length 
which vanishes at the QCP and 

= ( 21 ) 

is a linear damping rate derived from the density of particle-hole excitations in the 
Fermi sea. An important feature of this Lagrangian is that the momentum 
dependence enters with twice the power of the frequency dependence, the time 
dimension counts as z = 2 space dimensions, and the effective dimensionality of 
the theory is 

D = d -\- z — d 2^ {22^ 

so that D = 5 for the three dimensional model, pushing it above its upper critical 
dimension. 

In heavy electron materials, there is a growing sense that the Hertz approach 
can not explain the physics of quantum criticality. Many of the properties of the 
QCP, such as the appearance of non-trivial exponents in the quantum spin cor- 
relations, with T as the only energy scale, suggest that the underlying critical 
Lagrangian lies beneath its critical dimension. Also, all experiments indicate that 
the energy spectrum of the quasiparticles in the Landau Fermi liquid either side 
of the QCP, telescopes to zero, driving the masses of quasiparticle excitations to 
infinity, and pushing the characteristic Fermi temperature to zero at the quantum 
critical point. Yet the Hertz model predicts that most the electron quasiparticle 
masses should remain finite at an antiferromagnetic QCP. 

This has led some to propose that unlike classical criticality, we can not use 
Landau Ginzburg theory as a starting point for an examination of the fluctuations: 
a a new mean-held theory must be found. One of the ideas of particular interest, 
is the idea of “local quantum criticality”, whereby the quantum fluctuations of 
the spins become critical in time, but not space at a QCP. Another idea, is that 
at a heavy electron quantum critical point, the heavy electron quasiparticle dis- 
integrates into separate spin and charge degrees of freedom. Both ideas require 
radically new kinds of mean-field theory, raising the prospect of a discovery of a 
wholly new class of critical phenomenon [76]. 

I should add that Chapline and Laughlin have suggested that quantum crit- 
icality may have cosmological implications, proposing that the event horizon of a 
black hole might be identified with a quantum critical interface where the char- 
acteristic scales of particle physics might, in complete analogy with condensed 
matter, telescope to zero [77]. 
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Figure 6: The “axis of complexity”. 



5 The nature of the Frontier 

This article has tried to illustrate how condensed matter physics has had a central 
influence in the development of our ideas about collective matter, both in the lab, 
and on a cosmological scale. Many simple phenomena seen in the cryostat, illus- 
trate fundamentally new principles of nature that recur throughout the cosmos. 
Thus, the discovery of superconductivity and the Meissner effect has contributed 
in a fundamental way to our understanding of broken symmetry and the Anderson 
Higg’s mechanism. In a similar way, the observation of the resistance minimum in 
copper, provides an elementary example of the physics of confinement, and required 
an understanding of the principles of the renormalization group for its understand- 
ing. The interchange between the traditional frontiers - and the emergent frontier 
of condensed matter physics is £ls live today, as it has been over the past four 
decades - for example - insights into conformal field theory gained from the study 
of 2D phase transitions [78] currently play a major role in the description D-brane 
solitons [79] in superstring theory. In the future, newly discovered phenomena, 
such as quantum criticality are likely to have their cosmological counterparts as 
well. 

One way of visualizing the frontier, is to consider that in the periodic table, 
there are about 100 elements. As we go out along the complexity axis(Fig. 6), 
from the elements to the binary, tertiary and quaternary compounds, the num- 
ber of possible ordered crystals exponentiates by at least a factor of 100 at each 
stage, and with it grows the potential for discovery of fundamentally new states 
of matter. Only two years ago - a new high temperature superconductor MgB 2 
was discovered amongst the binary compounds - and the vast phase space of qua- 
ternary compounds has barely been scratched by the materials physicist. This is a 
frontier of exponentiating possibilities, spanning from the simplest collective prop- 
erties of the elements, out towards the most dramatic emergent phenomenon of all 
- that of life itself. 

Curiously, this new frontier continues to preserve its links with technology and 
applications. During the past four decades, the size of semi-conductor memory has 
halved every 18 months, following Moore’s law [80]. Extrapolating this unabated 
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trend into the future, sometime around 2020, the number of atoms required to 
store a single bit of information will reach unity, forcing technology into the realm 
of the quantum. Just as the first industrial revolution of the early 19th century 
was founded on the physical principles of thermodynamics, and the wireless and 
television revolutions of the 20th century were built largely upon the understanding 
of classical electromagnetism, we can expect that technology of this new century 
will depend on the new principles - of collective and quantum mechanical behavior 
that our field has begun, and continues to forge today. 
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Multi-Terminal Carbon Nanotube Networks 

Reinhold Egger 



Abstract. Coulomb interaction effects have very pronounced consequences in car- 
bon nanotubes. I discuss some recent work on the theoretical description of these 
correlations and their effect on mesoscopic transport properties. The Luttinger liq- 
uid behavior of nanotubes could allow to detect fractional statistics and fractional 
charge. In general, multi-terminal setups will be necessary to probe these aspects, 
and a transport theory for such systems is only beginning to emerge. 

The Landauer-Büttiker approach to transport in mesoscopic systems has been 
very successful in describing non-interacting electrons by using a scattering matrix 
formulation [1]. It is an important challenge to generalize this approach to the 
case of strongly correlated electrons. In Ref. [2], such a theory was proposed for 
A^-terminal star-like networks of interacting ID quantum wires (QWs) described 
by Luttinger liquid (LL) theory. The LL is the generic state of the interacting 
electron liquid in one dimension [3]. Below I will discuss the proposal [2] in some 
detail. The two-terminal setup {N — 2) has been formulated and solved previously 
[4, 5]. However, the step from N = 2 to N = where three individually contacted 
QWs meet at a single node (“Y junction”), is non-trivial yet essential for the 
development of a practically useful transport theory for interacting electrons. In 
fact, very recently several theoretical works appeared where precisely this problem 
has been under study. While in Ref. [6] a weakly coupled “Kondo” node was 
considered, other authors used perturbation theory in the hopping [7] and/or the 
interaction [8]. At the same time, it has become clear that a more general approach 
is necessary to go beyond those special situations. Below we formulate boundary 
conditions that allow for the explicit solution of this transport problem. Progress 
along these lines is also likely to sharpen our understanding of conformal field 
theory with boundary conditions. 

This problem is not only of intellectual interest but also of relevance to trans- 
port experiments for carbon nanotubes [9]. As has been predicted [10] and observed 
in a series of beautiful experiments [11], single- wall nanotubes provide a realiza- 
tion of LL physics. Electron-electron interactions cause remarkably pronounced 
non-Fermi liquid behaviors characterized by the standard LL parameter g ^ 0.25 
(where ^ = 1 is the non-interacting value). Template-based chemical vapor de- 
position [12, 13] and electron beam welding methods [14] allow to fabricate and 
contact nanotube Y junctions, and the intrinsic nonlinear I — V characteristics 
of such a device has been observed recently [12]. In addition, the case N = A 
has been realized by several groups using two crossed nanotubes [15], providing 
another interesting application. 
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Eventually it could even be possible to access the fractional statistics of LL 
quasiparticles through the noise properties of such a device, thereby realizing a 
Hanbury-Brown-Twiss setup for fractionalized quasiparticles [7]. At this point it 
is important to stress that in a two-terminal device, neither fractional charge nor 
fractional statistics can be observed [16, 17, 18]. This remarkable fact is due to the 
non-chiral nature of the electron liquid in nanotubes, and has to be contrasted with 
the chiral LL in the fractional quantum Hall edge states [19]. The two-terminal 
shot noise of a nanotube with an impurity is due to backscattered quasiparticles 
constructed out of the zero-modes and plasmons of the bosonized LL theory [20] 
carrying a fractional charge e* = ge. Naively, one might expect that what happens 
at the impurity should be observable in the shot noise, namely that the two- 
terminal shot noise in the weak impurity limit can be written as e* times the 
backscattered current. On the other hand, as noted previously for the average 
current [4, 5], the coupling to Fermi liquid reservoirs needs to be incorporated 
explicitly in the model. To experimentally access the expected quasiparticle charge 
then requires more complicated multi- terminal setups. 

Following Ref. [2], we study N single- channel spinless QWs at — L < a: < 0 
described by LL theory merging at a common node at a: = 0. For simplicity, we 
assume the same interaction constant g and Fermi velocity uf in each QW, with 
straightforward generalization also to include spin and flavor degrees of freedom 

[10] . Theory then has to address (i) the inclusion of applied voltage sources and 

(11) how a consistent treatment of the node can be achieved. Let us start with the 
first issue. Like in the two-terminal case, adiabatically coupled external reservoirs 
held at electro-chemical potentials fii lead to radiative boundary conditions [4] (we 
put h = 1) 



2 



Pi,R{~L) -f 




pi,L{-L) 



pi 

2ttvf ’ 



(1) 



where Pi^r/l{^) is the right/left-moving part of the density in QW L These bound- 
ary conditions only depend on the current injected into each QW from the respec- 
tive reservoir, which in turn does not depend on the backscattering happening 
later on at the node or within the QW. Here we discuss the computation of the 
nonlinear conductance matrix normalized to 



- 



h dh 

e dfij ’ 



( 2 ) 



where the current Ii flowing through QW i is oriented towards the node, leaving 
noise properties to a future publication. Below applied voltages Ui are defined as 



elli = Pi - g , 




i 



(3) 



Under this definition, gauge invariance (conductance matrix is invariant under a 
uniform change of all /li) is automatically fulfilled if the Gij depend only on the Ui. 
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Next we address the physics arising at the node x - 0. Conservation of charge 
enforces the Kirchhoff node rule 

^/,=0. (4) 

i 

A second requirement at the node is the wavefunction matching via the S matrix 

[ 21 , 22 ] 

^l(O) = S^h{0), (5) 

where "^{x) = (^i , . . . , -0^) contains the wavefunctions for the N QWs, and 
the outgoing (left-moving) components ^ l are connected to the incoming (right- 
moving) states n via an appropriate N x N matrix see, e.g. Ref. [23], for 
specific choices at A" = 3. Note that the scattering matrix in Eq. (5) provides a 
“bare” (microscopic) description of the node properties, while interactions could 
dynamically generate some different boundary condition at low energy scales. 

Unfortunately, a boundary condition like Eq. (5) is very difficult to handle 
for correlated electronic systems and typically does not allow for progress. Here 
we proceed differently by constructing a wide class of practically important S 
matrices (albeit not all possible ones [23]) in the following way. We first consider an 
idea] system composed of impurity-free QWs symmetrically connected at the node. 
Microscopically, the corresponding non-interacting problem could be modeled as 
N tight-binding chains with equal hopping matrix element to, where the “last” site 
of each chain is connected to the common node site via the same and all on-site 
energies are equal. Such a node corresponds to the special highly symmetric S 
matrix 

/ {2-N)/N 2/N ••• 2/N 

2/N {2-N)/N ••• 2/N 

o — 

\ 2/N 2/N {2-N)/N 

For this S matrix, Eq. (5) directly implies 0i(O) = • • • = 0/v(O) for the components 
of ^ Since phases in 0^(0) can be gauged away, a density matching 

condition upon approaching the node results, 

pi(0) = ---=p/v(0), (7) 

where p0O) denotes the total electronic density in QW i close to the node. Remark- 
ably, the conditions (1), (4) and (7) then allow to explicitly solve this transport 
problem for arbitrary interactions because the condition (7) does not involve wave- 
function phases but only amplitudes. To arrive at more general S matrices, in a 
second step we then consider additional impurities in the different QWs displaced 
slightly away from the node. Inclusion of impurities does not cause conceptual 
difficulties, and such a modeling allows to construct all S matrices of practical 
interest. 
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Let US first discuss the ideal node defined by the bare scattering matrix 
(6). For ^ = 1, the conditions (1), (4) and (7) are in fact equivalent to the stan- 
dard Landauer-Biittiker approach. This is easily seen by using the usual scattering 
states, e.g. the state injected into QW i = 1: 



/ ^1 \ 


/ ^ikx _|_ ^^-ikx \ 




V'2 


= 


t^-ikx 


(8) 


\ i’N ) 




te~ikx j 





with reflection (transmission) amplitude r (t). For ÿ = 1, the boundary conditions 
(1) are equivalent to a Fermi distribution for occupying the states (8), while Eq. (4) 
gives 1 = r -(- (N — 1)L Furthermore, the density matching condition (7) yields 
|1 + rp = |t|^. Combining both equations immediately gives r = {2 — N)/N and 
t = 2/N, and hence reproduce Eq. (6). The conductance matrix is then 

= 2/Nf , = -(2/Nf. (9) 

Will the conductance matrix for this ideal case be affected by interactions {g < 1)? 
Based on the two-terminal case [4], one might suspect that there is no renormal- 
ization because of the Fermi-liquid character of the leads, and hence Eq. (9) would 
stay valid for arbitrary g. However, it turns out that for small applied voltages and 
low temperatures, the system always flows to the disconnected-node ûxed point, 

Gij = 0 . ( 10 ) 

Thus the only stable generic fixed point of this system for any g < 1 represents an 
isolated node weakly connected to N broken- up QWs, even for an arbitrary S ma- 
trix of the node. The corrections to Eq. (10) due to finite Ui or T are then sensitive 
to interactions. This phenomenon is a consequence of the strong correlations in the 
LL, which here induce asymptotically vanishing currents even in a perfectly clean 
(impurity- free) system. Equation (10) also implies that open boundary bosoniza- 
tion [24] allows to access the asymptotic low-energy transport properties of a QW 
network. 

For arbitrary g and sufficiently far away from the node such that Friedel 
oscillations [25] have decayed and the boundary conditions (1) hold, the left- and 
right-moving densities must combine to give 



evF{pi,R - Pi^l) = — ’ 



Pi — Pi,R + Pi,L = 



2ttvf 



(11) 

( 12 ) 



Here the matrix Tij has been defined whose entries depend on g and all The Tij 
reduce to standard Landauer-Biittiker transmission probabilities for p = 1 , but in 
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general cannot be interpreted as single-particle quantities. It is important to stress 
that Eqs. (11) and (12) are consistent with both the LL equation of motion and 
the boundary conditions (1) for arbitrary Tij. The Kirchhoff node rule can then 
be satisfied by requiring 

N 

i=l 

and we now use the density matching conditions (7) to obtain the Tij and hence 
the conductance matrix. We mention in passing that the usual relation Tij = 1 
should not be enforced since gauge invariance under uniform potential shifts in all 
reservoirs has been encoded in Eq. (3) already. 

At this point it is crucial to realize that close to the node, the density Pi(x) 
will deviate from the spatially homogeneous value pi in Eq. (12) due to Friedel 
oscillations. This happens already for p = 1, as can be easily checked from Eq. (8) 
and the subsequent solution. The total density very close to the node is Pz(0) = 
pi + <^Pi(0): where Spi{0) is the Friedel oscillation amplitude at the node location. 
The 2kF oscillatory Friedel contribution 6pi{x) in QW i arises due to interference 
of the incoming right-movers and the left-movers that are backscattered at the 
node. Importantly, left-movers that are transmitted from the other N — I QWs 
into QW i cannot interfere with the incoming right-mover and will therefore not 
contribute to Spi(x). This implies that Spi{x) is identical to the corresponding 
Friedel oscillation in a two-terminal setup with the same (bare) reflection coefficient 
as the one induced by the clean node, R= {1 — 2/N)^. Fortunately, this allows us 
to obtain 6pi{0) in an exact manner using the relation [4] 



SMO) = (14) 

TTVf 

where the “four-terminal voltage” parameter Vi is found from a self-consistency 
equation. This equation is explicitly given and solved for arbitrary g in Ref. [5], 
and using this solution, we also have the pinning amplitude (14) of the Friedel 
oscillation for any value of g. The relation to the two- terminal problem is not 
a necessary ingredient for our calculation, but quite convenient in allowing to 
compute ^Pi(O) from known results. 

For concreteness, in what follows we consider g — 1/2 where this self- 
consistency equation is quite simple {ks = 1), 



2Tb 



/I TB + ie{Uj-V,l2) 
V2 2ttT 



(15) 



where t/; is the digamma function and Tb = juOc for bandwidth ujc and impurity 
strength A of the corresponding two- terminal problem. (Specifically, for TV = 3, to 
match the reflection coefficient of the ideal Y junction, ttA/c^c = 1/2. The solution 
to Eq. (15) then yields as a function of IJi alone, which, however, itself depends 
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on all the chemical potentials, see Eq. (3). The density matching conditions (7) 
are then solved by enforcing 

{l + T,j)Uj-2Vi = TkjU, 



for all pairs k ^ j = 1, • • • A^, and with the Kirchhoff rule we find 






2-N 2{N-l)Vi 
N ^ mji 
2 2K 
N NUi 



(16) 

(17) 



Note that the Tij depend only on the applied voltages Ui but not on the fii. As 
a result, gauge invariance is ensured. The relations (16) and (17) represent the 
complete solution for the special S matrix (6). 

Before proceeding to more general S matrices, let us discuss these results for 
the ideal Y junction (N = 3) àt g = 1/2. Prom Eq. (2), we get 







( dVk 

\dUk 



where i ^ j k in the second equation. Note that the conductance matrix is 
symmetric. For eUi <C T, the linear conductances follow: 



Gij = {20ij - 2/3) 



1 — c'0'(c + 1/2) 
1 + cip'{c + 1/2) 



where c = Tb/2ttT and ip' is the trigamma function. As T ^ 0, the conductance 
matrix approaches the stable fixed point (10), Gij ~ {T/TbY> In general, for 
p < 1, we find Gij ~ as T 0. Since this power law coincides with 

the prediction of open boundary bosonization around (10), this also provides a 
consistency check for our calculation. Corresponding power laws also govern the 
nonlinear conductances. Clearly, interactions have a rather spectacular effect on 
the transport properties of this system. 

Next we briefly outline how to construct more general node S matrices based 
on this solution of the ideal junction for arbitrary g. The idea is to add impurities 
of strength in each QW close to the node, Xi — l//ci?, which will modify the 
bare S matrix; for explicit calculations, see Ref. [21]. One can then compute the 
Gij for this more general case, but still allowing for arbitrary p < 1, e.g. by using 
perturbation theory around the above solution of the ideal junction. Focusing on 
N = 3 and just one impurity, A2 = A3 = 0, to lowest order in Ai, a straightforward 
calculation gives 4 = 7° + Sli, where if is the current through QW i for A^ = 0 
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discussed above, and 



5h = 



X 



-2ec/+ — sin(7T5+) cos(7Tÿ+) 

Uc 

r(l-25+)(27r/0/ea;T"''', 



(18) 



where — 4^/3 and Obviously, this perturbative estimate 

breaks down at very small energy scales for gj^ < 1 but is valid for all energies 
at > 1. It is straightforward to perform similar calculations for more than one 
impurity, other N , and/or higher orders in the A^. From Eq. (18) and generaliza- 
tions, we infer that Gij = 0 is indeed the only stable fixed point. It is also clear 
that for at least one very strong impurity A^, the system will reduce to one of the 
special cases considered previously [6, 7, 8]. For g close to 1, we can also make 
explicit contact to Ref. [8]. For S matrices close to Eq. (6) and weak interactions, 
our solution indeed reproduces the results of Ref. [8] . Note that the restriction to 
weak interactions allows to easily treat more general S matrices [8] . 

To conclude, I have discussed a Landauer-type theory for strongly interact- 
ing electrons in branched quantum wires such as carbon nanotube Y junctions. 
A broad class of S matrices can be covered by formulating a suitable boundary 
condition (“density matching”) to describe an ideal symmetric junction, and on 
top adding effective impurities in the individual wires. The only stable fixed point 
corresponds to disconnected quantum wires, with corrections revealing the Lut- 
tinger liquid physics via various power laws. This setup may be important in future 
proposals for detecting electron fractionalization. 

I wish to thank S. Chen and B. Trauzettel for the collaboration on the topics 
of this article. Support by the DFG under the Gerhard-Hess program is acknowl- 
edged. 
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Dynamical Response of Quasi ID Mott Insulators 

Fabian H.L. Essler and Alexei M. Tsvelik 



Abstract. At low energies certain one dimensional Mott insulators can be described 
in terms of an exactly solvable quantum field theory, the U(l) Thirring model. 
Using exact results derived from integrability we determine dynamical properties 
like the frequency dependent optical conductivity and the single-particle Green’s 
function. We discuss the effects of a small temperature and the effects on interchain 
tunneling in a model of infinitely many weakly coupled chains. 



I Introduction 

The problem of the Mott metal-insulator transition, which is a paradigm for the 
importance of electron-electron interactions, has been the subject of great interest 
[1] since the pioneering works by Mott [2]. The intrinsic difficulty in describing 
the Mott transition quantitatively is that it occurs when the kinetic and potential 
energies are of the same order. This regime is difficult to access from either the 
“band” (where one diagonalizes the kinetic energy first and then takes electron- 
electron interactions into account perturbatively) or the “atomic” (where one di- 
agonalizes the electron-electron interaction first and then takes the hopping into 
account perturbatively) limits. In recent years much progress in understanding 
the Mott transition has been made based on the so-called Dynamical Mean Field 
theory, which considers a lattice in infinitely many dimensions D oo (see Ref. [3] 
and references therein). Here we are concerned with the rather different situation, 
where the tunneling along one lattice direction is much larger than along all others. 
The resulting system can be thought of in terms of weakly coupled chains of elec- 
trons and will be referred to as a quasi one dimensional Mott insulator. When the 
band is half-filled and the interchain tunneling is switched off, Umklapp processes 
dynamically generate a spectral gap M and we are dealing with an ensemble of 
uncoupled ID Mott insulating chains. The same Umklapp scattering mechanism 
can generate gaps at any commensurate filling e.g. quarter filling, but only if the 
interactions are sufficiently strong. There are two questions we want to address: 
(i) What is the dynamical response of the uncoupled Mott-insulating chains sys- 
tem? and (ii) What are the effects of a weak interchain tunneling? Examples of 
materials that are believed to fall into the general category of quasi- ID Mott in- 
sulators are the Bechgaard salts [4] and chain cuprates like SrCu02, Sr 2 Cu 03 ^ 
or PrBa 2 Cus 07 . They have been found to exhibit very rich and unusual physical 
properties such as spin-charge separation [5, 6]. 

^More precisely, these compounds are considered to be charge-transfer insulators. 
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The Models 

The simplest models used in the description of one dimensional Mott insulators 
are “extended” Hubbard models of the form 

H = [4,<rCn+i,CT + h.c.] + U ^nk,-\nk,i + EbE UkUk+j , (1) 

n,a k j>l 

where rik^a — c\. and Uk = n/c,| + nfc,| are electron number operators. The 
electron-electron interaction terms mimic the effects of a screened Coulomb inter- 
action. Two cases are of particular interest from the point of view of application to 
e.g. the Bechgaard salts [4, 7]: (1) Half filling (1 electron per site) and (2) Quarter 
filling (1 electron per 2 sites). We will discuss both these cases and emphasize 
similarities and differences in their dynamical response. 

II Field theory description of the low energy limit 

The field theory limit is constructed by keeping only the low-energy modes in the 
vicinity of the Fermi points ±/cf. We may express the lattice electron annihilation 
operator in terms of the slowly varying (on the scale of the lattice spacing ao) right 
and left moving electron fields R(x) and L{x) 

Ci^cr — vW Ra{x) -f- exp(-z/cF 2 ;) L^{x)] . (2) 

Here kF = 7r/2ao for the half-filled band and x = la^. The resulting Fermion 
Hamiltonian can then be bosonized by standard methods [8]. 

A. Half FiUed Band 

For the half-filled case we have 

^ exp , 

RUx) = [/3$, + i©c])exp(^^[$, + e,]), (3) 

where r]a are Klein factors with {r]a,r]b} = 2Sab and where /| = 1, /| = — 1 The 
canonical Bose fields and their respective dual fields 0s,c are given by 

T 0a , Ba = (f>a ~ , d ^ S,C , (4) 

where the chiral boson fields (j)a and 0a fulfill the following commutation relations 

[0a(a:),0a(2/)] = 27ri , a = c,s. (5) 

^The phase factors in (3) have been introduced in order to ensure the standard bosonization 
formulas foe the staggered magnetizations. 
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We choose a normalization such that for \x — y\ — > 0 the following operator 
product expansion holds 

exp {ia^c{x)) exp (i/3$c(?/)) — ' 1^: - exp {ia^c{x) + iP^dv)) • (6) 

Applying the bosonization identities we obtain the following bosonic form of the 
low energy effective Hamiltonian 



n 


= Hc + n, , 




Ws 


= ^ , 




Wc 


= ^ [(5cc0c)^ + + 2/icCOS/3$c • 


(7) 



Here the spin and charge velocities Vg^c and the parameters p and fic are functions 
of U and Vj. The Hamiltonian (7) exhibits spin-charge separation: Hc,s describe 
charge and spin degrees of freedom respectively, which are independent of one 
another. The pure Hubbard model corresponds to the limit P ^ 1 and the effect 
of Vj is to decrease the value of p. From the form (7) we can deduce a number 
of important properties. Firstly, the spin sector is gapless and is described by a 
free bosonic theory. Hence correlation functions involving (vertex operators of) the 
spin boson and its dual field ©s can be calculated by standard methods [8]. 
Excitations in the spin sector are scattering states of gapless, charge less spin ^ 
objects called spinons. The charge sector of (7) is a Sine-Gordon model (SGM). 
Here excitations in the regime P > are scattering states of gapped charge ±e 
excitations called soliton and antisoliton respectively. In the context of the half- 
filled Mott insulator these are also known as holon and antiholon. Soliton and 
antisoliton have massive relativistic dispersions, 

E{P) = ^/WTvfP^ , (8) 



where M is the single-particle gap. For Vj = 0, j > 3 the gap scales as 



M : 



St 






1 - X 
1 -h X 



{gx+2)l4gx 



(9) 



where we have fixed the constant factor by comparing to the exact result for the 
Hubbard model, and where 



ru -2 Vi-\-2V2\ 

\U + m + 2V2) 



g - {U + 6 V 1 V 2 V 2 )l 2 nt . (10) 

We note that the gap vanishes on the critical surface U — 2V\ + 2 V 2 = 0 separating 
the Mott-insulating phase with gapless spin excitations from another phase with 
a spin gap. 
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In the regime 0 < /5 < 1/v^, soliton and antisoliton attract and can form 
bound states. In the SGM these are usually known as “breathers” and correspond 
to excitons in the underlying extended Hubbard lattice model. There are 



N- 



-1-p 

/?2 



different types of excitons, where [x] in (11) 
exciton gaps are given by 

Mn = 2Msin(n7T^/2) , 



-J (11) 

denotes the integer part of x. The 
n = l,...,N, (12) 



where 




(13) 



B. Quarter Filled Band 



In the quarter filled case there are no simple Umklapp processes that can open a 
gap in the charge sector. As a result the quarter-filled extended Hubbard model 
is metallic in the weak coupling regime. However, integrating out the high energy 
degrees or freedom generates “double Umklapp” processes involving four electron 
creation and annihilation operators each [9, 10]. For small U, Vj these processes are 
irrelevant, but increasing U, Vj decreases their scaling dimension until the double 
Umklapp term eventually becomes relevant. The low-energy effective Hamiltonian 
is identical to (7), but the relations between the Fermi operators and the Bose 
fields are different 



= 

4 (^) = 






exp 






exp 



(-) 






C ) exp (-^ - 0 *]) 



(14) 



Although the low-energy effective Hamiltonian is the same as for the half-filled 
Mott insulator, the physical properties in the quarter- filled case are rather dif- 
ferent. Firstly, the insulating state emerging for sufficiently large U, Vj at quarter 
filling is generated by a different physical mechanism (double Umklapp scattering) 
as compared to half filling (Umklapp scattering) and concomitantly is referred to 
as a Akp charge-density wave insulator in the literature [11]. We adopt this ter- 
minology here. Secondly, the quantum numbers of elementary excitations in the 
charge sector are different. Like for the half- filled case the elementary excitations in 
the charge sector are a soliton/antisoliton doublet, but now they carry fractional 
charge ±|. A simple way to see this is to recall that the conserved topological 
charge in the SGM is defined as 






27T 




dx dx^c • 



(15) 
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The soliton has topological charge —1 and the antisoliton +1. A simple calculation 
shows that the right moving fermion creates two solitons 

(a;)|0) . (16) 

This implies that fermion number one corresponds to topological charge 2 and 
hence solitons have fractional charge. The elementary excitations in the spin sector 
are again a pair of gapless, chargeless spin spinons. 

Ill The Method 

Due to spin charge separation a general local operator 0{t, x) can be represented 
as a product of a charge and a spin piece O = OcOg. As a result correlation 
functions factorize as well 

(O|0t(l,rr) 0(0)10) = c(O|0t(i,x) 0c(O)|O)c .(O|0l(i,x) 0,(O)|O), , (17) 

where 1 0)^,0 are the vacua in the spin and charge sectors respectively. The corre- 
lation function in the spin sector is easily evaluated as we are dealing with a free 
theory. 

Correlation Functions in the Charge Sector 

Let us now discuss how to calculate two point functions in the SGM describing 
the charge sector of the theory. One first constructs a spectral representation and 
then utilizes integrability to determine the matrix elements of the operator under 
consideration between the ground state and excited states. This method is known 
as the “Form Factor Bootstrap Approach” [12, 13]. In order to utilize the spectral 
representation we need to specify a basis of eigenstates of the Hamiltonian (7). 
Such a basis is given by scattering states of solitons, antisolitons and breathers. 
In order to distinguish these we introduce labels ■ . Bn , s,s. As usual for 

particles with relativistic dispersion, it is useful to introduce a rapidity variable 6 



to parametrize energy and momentum 




Es{e) 


= M coshO , Ps{0) = {M/vc) sinhO , 


(18) 


Es{e) 


= M coshO , Ps{0) = {M/vc)smhO , 


(19) 


EbM 


= Mn cosh 9 , Psr, {9) = (Mn/vc) sinh 9 , 


(20) 



where the breather gaps are given by (12). A basis of scattering states of soli- 
tons, antisolitons and breathers can be constructed by means of the Zamolodchi- 
kov-Faddeev (ZF) algebra, which is the extension of the algebra of creation and 
annihilation operators for free fermions or bosons to the case of interacting par- 
ticles with factorizable scattering. The ZF algebra is based on the knowledge of 
the exact spectrum and scattering matrix of the model [14]. For the SGM the 
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ZF operators (and their hermitian conjugates) satisfy the following 
commutation relations” 


“generalized 


Z'H0i)Z'=(^2) = 


syjiOi - e2)z^'-{e2)z^'^{e,) , 

Cl ,C 2 


(21) 


zl[e,)zl{e2) = 


zl,je2)zlj9,)sii:i{9^-02) , 


(22) 



z^^(0i)zl(02) = z^,(02)sf^;^>(e2 - 0 i)z^H^i) + ( 27 r)s^e^m - ^ 2 ) . ( 23 ) 

Here are the known (factorizable) two-particle scattering matrices [14] 

and Sj = s, s, Bi, . . . , Using the ZF operators a Fock space of states can be 

constructed as follows. The vacuum is defined by 

ZsA^M=0. (24) 

Multiparticle states are then obtained by acting with strings of creation operators 
Zj (Û) on the vacuum 



IÛ„...0l)e„...e,=Zl(Ûn)...Zl(û,)IO). 

In this basis the resolution of the identity is given by 

— — roo 



(25) 



1 = | 0 >( 0 | + EE/ 

n=l U 



(27r)^n! 






and the following spectral representation of the two-point function of the operator 
O holds 



{0{x,t)0\Q,Q)) = 



00 . 

EE/ 

77 = 1 



d6i . . . dOyi 
(27r)^n! 



exp - Æ;j«)|(0|0(0,0)|6I„ . . 



(27) 



Here Pj and Ej are given by 

Pj = — — sinh 9j , Ej = M^. cosh 6j ^ (28) 

'^c 

and the formfactors of the operator O are denoted by 

= (O|e»(O,O)|0„...0i),„ ..,. . (29) 



The form factors can be calculated by solving a Riemann- Hilbert problem involving 
the exact scattering matrix of the SGM and are known for many operators. 
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Our conventions in (28) are such that Mg = Mg = M and Mb^ — Mn- We 
are mainly interested in the Fourier transforms of the retarded space and time 
dependent correlation functions. These take the following general form 



oo „ 

\cj,q) = ~27r^Y,J 



dOi . . . dOn 
(27r)^n! 






sinh 0j/vc) 6{q + sinh 0jlv ^) -i 

LÜ — cosh 9j + iri to + Yj cosh 6j + ii] Ï 



(30) 



IV Optical Conductivity 

The optical conductivity was calculated in Refs [17, 18, 19, 21]. In the field-theory 
limit, the electrical current operator is given by 

J(t,x) = yfÂ'dt4>c. (31) 

where ^' > 0 is a non-universal constant. The expression for the current operator 
is the same for the half-filled and the quarter filled bands. As seen from Eq. (31), 
the current operator does not couple to the spin sector. This shows that spinons 
do not contribute to the optical conductivity in the field theory limit. Hence, the 
calculation of the optical conductivity has been reduced to the evaluation of the 
retarded current-current correlation function in the charge sector. The real part 
of the optical conductivity {uj > 0) has the following spectral representation 



a{u) = 



27r" 

Laoiü 



n=l ei ^ ' 



S(^MuLsinhOk)S{LJ cosh6lfc) . (32) 



In Refs. [12, 13] integral representations for the form factors of the current operator 
in the SGM were derived. Using these results we can determine the first few terms 
of the expansion (32). Prom (32) it is easy to see for any given frequency u only 
a finite number of intermediate states will contribute: the delta function forces 
the sum of single-particle gaps Ylj to be less than lj. Expansions of the form 
(32) are usually found to exhibit a rapid convergence, which can be understood in 
terms of phase space arguments [15, 16]. Therefore we expect that summing the 
first few terms in (32) will give us good results over a large frequency range. 

Using the transformation property of the current operator under charge con- 
jugation one finds that many of the form factors in (30) actually vanish. In particu- 
lar, only the “odd” breathers Bi, H3 , . . . (assuming they exist, i.e., P is sufficiently 
small) couple to the current operator. The first few non- vanishing terms of the 
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spectral representation (32) are 
a(uj) 

= X] ^H2rx-l(^) + ^ss(^) + CTBiB2(<^) + • • • (33) 

n=l 

Here A = A'vdLao and <75^(0;), cr55(o;) and cfbiB 2 {^) ^^e the contributions of the 
odd breathers, the soliton-antisoliton continuum and the B 1 B 2 breather-breather 
continuum respectively. The latter of course exists only ii N >2. We find 

^S2n-i(<^) = T72 f2n-lS{u - M2n-l) (34) 

m—1 

fm = sin(m7T^) JJ tan^(7m^/2) 

n=l 

r-:n( 2 H sinh(t(l - Q/2) sinh^(mt^/2) \ 

Jo i sinh(i^/2)cosh(i/2) sinhi )' ^ ’ 

The soliton-antisoliton contribution is [18] 



_ 4x/g;2 _ 4M20(g; - 2M) 

^SS ^ ^2 [cos( 7 t/^) -h COSh(^/^)j 

/ dt sinh[t(l — ^)/2] [1 — cos(t^/7r) cosht] 

^\Jo ^ sinh(t$/2) cosh(t/2) sinht 

where 6 = 2arccosh(cj/2Af). The result for the B 1 B 2 breather-breather continuum 
is given in Ref. [21]. As a function of the parameter (3, the optical conductivity 
behaves as follows. 

• 1 / 2 : 

In this regime the optical spectrum consists of a single “band” corresponding 
to (multi) soliton-antisoliton states above a threshold A = 2M. The absorp- 
tion band increases smoothly above the threshold A in a universal square 
root fashion 

a{(jj) oc Vuj — A for cu A'^ . (37) 

In Fig. 1 we plot the leading contributions for the case /3^ = 0.9. Clearly, the 
four-particle contribution is negligible at low frequencies. 

• 1/2 >/3^> 1/3: 

Here the optical spectrum contains one band and one excitonic breather peak 
below the optical gap A = 2M at the energy ujbi = Mi. The optical weight 
is progressively transferred from the band to the breather as 0"^ decreases 
down to 1/3. The absorption band again increases in a square-root fashion 
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Figure 1 : Optical conductivity for = 0.9. Shown are the dominant contributions 
at low frequencies: the soliton-antisoliton part ct 2 (oj) and the two soliton - two 
antisoliton contribution < 74 (cj). 



(37) above the threshold for all values of /3^ except = 1/2, where the 
breather peak merges with the band. In this case a (to) shows a square- root 
divergence at the absorption threshold 

(j{üü) oc r.....: for (jj {l3^ = 1/2) . (38) 

Vu; — A 

The field theory results discussed here have been compared to Dynamical Den- 
sity Matrix Renormalization Group (DDMRG) (see e.g. Ref. [20] and references 
therein) computations of cr{uj) for extended Hubbard models in Refs [17, 21, 22] 
and good agreement has been found in the appropriate regime of parameters. We 
note that the DDMRG method can also deal with parameter regimes in the un- 
derlying lattice model, to which field theory does not apply. Let us discuss the 
above results from the point of view of an application to optical conductivity mea- 
surements in the Bechgaard salts [23]. There it is found that up to 99% of the 
total spectral weight is concentrated in a finite- frequency feature, which has been 
attributed to Mott physics of the type discussed here [23]. A comparison of (33) 
to the experimental data gives satisfactory agreement at high frequencies, but the 
detailed peak structure at low frequencies is not reproduced [18]. A likely source 
for these differences is the interchain tunneling. 



V Spectral Function 
A. Half-Filled Mott Insulator 

The zero-temperature spectral function of the half-filled Mott insulator has been 
studied in many previous works. There have been extensive numerical studies on 
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finite size t-J and Hubbard models e.g. Refs [24, 5]. The limit where the single- 
particle gap is much larger than the bandwidth was treated in Refs [25]. This 
regime is complementary to the case we address here. The weak-coupling limit 
we are interested in was studied in Refs [26, 27, 28], where a conjecture for the 
spectral function was put forward. Here we derive these results by means of an 
exact, systematic method. In what follows we will for simplicity fix = 1, i.e. deal 
with the Hubbard model only. 



1 Zero Temperature 

The single particle Green’s function is calculated by following the steps outlined 
above [29]. The creation and annihilation operators for right and left moving 
fermions factorize into spin and charge pieces upon bosonization (3). The spin 
part is easily calculated: In imaginary time we have 

(exp(-^|4.. + 0.|) + (39) 

The correlation function in the charge sector is calculated by means of the form 
factor bootstrap approach. Taking into account only processes involving one soliton 
we obtain 



exp 






0c 




(40) 



where Zq ^ 0.9218 [30]. The corrections to (40) involve intermediate states with 
three particles and are negligible at long distances/low energies. In Fourier space 
we obtain after analytical continuation to real frequencies 



G(^\uj,kF^-q) ^ 



-Za 



CÜ -h VcQ 






(m -h \/ -b v‘^q‘^ — —— — —(uj-\-Vcq) 

\ / Vq “h Vg 



(41) 



We note that the charge velocity Vc is larger than the spin velocity Vg . The spectral 
function is obtained from the imaginary part of the single particle Green’s function 
(41). In the case Vg = Vc = v it takes the simple form 



Ar{u, kp + q) 



— — ImG^^^ (cj, /cf + q) 

7T 



Zq m Q{\u\- + v'^q^) 
Tï\u — vq\ — m? — v‘^q‘^ 



In Fig. 2 we plot the spectral function in the case Vg = 0 . 81^0 and j3 = 1 
both in a density plot and in a series of constant q scans. There is a continuum of 




Vol. 4, 2003 Dynamical Response of Quasi ID Mott Insulators 



S599 




Figure 2: Spectral function for a half- filled Mott insulator with Vs = O.Suc and 
13 = 1. 



states above the Mott gap, which is smallest at /ci?. The most striking feature is 
the presence of two distinct “peaks” dispersing with velocities Vs and Vc respec- 
tively. Most of the spectral weight is concentrated in these features, which are a 
direct manifestation of Spin-Charge Separation. The lower (higher) energy fea- 
ture corresponds to the situation where all the momentum is carried by the spin 
(charge) sector. Concomitantly the high/low energy feature is referred to as anti- 
holon/spinon peak. Neither peak is sharp but has intrinsic width (more precisely, 
they correspond to square root divergences). 



2 Finite Temperature T M 

It is possible to extend the results for the spectral function to small temperatures 
T < M [31]. The method is a straightforward generalization of the one we employed 
for T = 0. As T < M the effects of temperature on the charge piece of the 
correlation function are small, but the spin piece can be affected quite strongly 
because the spinons are gapless. The spectral function in the vicinity oï kp can be 
represented as 



Ar(w, kp + g) ~ 



/ oo 

dz 

-OO 




c(z),q-s{z))+e ^ , 

(43) 



where A = 



j 7tA Z 2 

Y Vs (27t)3 ’ 



c{z) = A cosh 2 :, s( 2 () = (A/us)sinhz and 






Re 






.LJ + Vsq 1 \ 

* 47tT ’ 2 J 



gs(u>,q) 



S(ui - Vsq). 
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Figure 3: Spectral function for a half-filled Mott insulator with Vg = 0.5vc and 
= 1 at temperatures T — 0.03M and T = 0.04 respectively. 



In Fig. 3 we plot the spectral function for Vg = O.Si’c and /3 = 1 aX temperatures of 
0.03 and 0.04 times the single particle gap. A significant temperature dependence 
of the “charge” peak is apparent. It gets damped rather strongly at temperatures 
that are still small compared to the gap. This may make an unambiguous detection 
of SC-separation by ARPES more difficult as the experiments are done at elevated 
temperatures (room temperature for Sr 2 Cu 03 ) in order to avoid charging effects. 

B. Quarter-Filled CDW Insulator 

The single- particle Green’s function in the quarter-filled case can be determined 
by the same method [32]. The spin sector is again gapless and the spin part of 
the Green’s function is easily determined. The charge piece is again analyzed by 
means of the form factor bootstrap approach. The difference to the half-filled case 
is that the charge piece of the single-electron operator (14) now couples to at least 
two (anti)solitons. Neglecting contributions of four or more particles in the charge 
sector we obtain the following result for the single-particle Green’s function in the 
vicinity oi kp 



G^^\iO,kp + q) = -Z 




de \G{29)\^ uj + Vcq 
a/ 2¥ yc(0) \/c^{e) - «2 



(c(0) + \/c^(6) - s^) 



{uj + VçqŸ 
a 



(44) 



where ^ c{6) = 2Mcosh^, a = {vc-\-Vg)/{vc-Vg). In Fig. 4 we plot 

the spectral function Ar{uj, kp q) as a function of u for Vg = O.St’c and different 
values of q. Clearly the spectral function is rather featureless and there are no 
singularities. Furthermore, in contrast to the half-filled Mott insulator discussed 
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above, there are no dispersing features associated separately with Vc and Vs- The 
absence of any distinct features is clearly related to the fact that the electron has 
“fallen apart” into at least three pieces. 

Just above the threshold at = 4M^ one can approximate 

\G{20)\ — \Ci sinh^l, so that for Vc = Vg 



AR{uj,q) 



rv 1 


f— I'l 


1 47tM \üü — vq\ 


K2M V 



(45) 



Thus the spectral weight increases linearly with s — 2 M above the threshold. The 




Figure 4: Afi{uj,kF -h ç) as a function of u/M for Vg = O.St’c. The curves for 
different q are offset. 



tunneling density of states for Vc = Vg is 



p{uj) = 




I 



arccosh(u;/2M) 



dO |G(26l)p 
V2cosh6»’ 



(46) 



and displays a roughly linear increase after an initial (w — behaviour just 

above the threshold at a; = 2M. 

Let us now turn to a comparison with experiments. PrBa 2 Cu 307 (P123) 
is a quarter-filled quasi- ID cuprate, to which the theory presented here may have 
some relevance. The ARPES data for P123 (Fig. 3 of [6]) show a single, very broad, 
dispersing feature that is asymmetric around kp. If we interpret the underlying 
increase in intensity in the data as background, the signal has a form similar to 
Fig. 4. In order to assess whether the theory presented here is indeed relevant to 
PI 23, it would be interesting to extract a value Apj^ for the gap from the ARPES 
data and compare it to gaps seen in optical measurements and the thermal 

( i ) 

activation gap A^^ extracted e.g. from the dc conductivity. The theory presented 
here predicts 





= A 



(i) 

PE • 



(47) 
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This is in contrast to the case of the half-filled Mott insulator, where one has 

=2A^^ =2A^y . (48) 

VI Luttinger’s Theorem 

In all cases we have discussed, the Green’s functions have branch cuts but no poles. 
In particular there are no poles at zero frequency and hence no Fermi surface. 
Nevertheless Luttinger’s theorem holds as we will now show. Luttinger’s theorem 
reads [33] 

N 
V 

where the integration is over the interior of the region defined by either singular- 
ities or zeroes of the single-particle Green’s function. The former is the case for 
a Fermi liquid whereas the latter is the case at hand. The equality (49) is inter- 
esting, because it implies that the integral on the right hand side is independent 
of electron-electron interactions. In a Fermi liquid this means that the volume of 
the Fermi surface is unaffected by interactions. The Green’s functions we have 
discussed above all fulfill (49) by virtue of them having zeroes at the position of 
the non-interacting Fermi surface. 

This property follows from (i) the fact that the spin sector is gapless, which 
implies that {Ra{r,x)Ll{0)) = 0 and (ii) Lorentz invariance of the low-energy 
effective theory, which implies that 

('ï-,(r,x)4-t(0)) - exp{±ict>)Tliry, <i> = R,L. (50) 

Here r and 0 are polar coordinates and 7Z denotes the radial part of the correlation 
function. As we are dealing with an insulating state we have 7l{r) oc exp(— Ar) at 
large distances and hence f dr 7Z{r)r is finite. Thus 

Gh,l(0,0)= f d0exp(ibz0) 

J —TT 

For a metallic state the r integral would diverge and the Green’s function would 
have a singularity rather than a zero. 



J dr 7^(r)r = 0. (51) 






G(0,/c)>0 



d^k 

(27t)^’ 



(49) 



VII Interchain Tunneling 



Let us now consider a quasi one dimensional situation of Hubbard chains weakly 
coupled by a long-ranged interchain tunneling 



H 



+ h.c. + uY^ 



n^a 



n 



(52) 
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Figure 5: Elements of the diagrammatic perturbation theory in the interchain 
tunneling. 



+ 







Figure 6: Diagrammatic expansion for the single-particle Green’s function of cou- 
pled chains. 



We allow the interchain tunneling to be long-ranged in order to be able to carry 
out a controlled expansion in the case where the Fourier transform i±(k) of the 
interchain tunneling becomes of the same order as the ID Mott gap M (see the 
discussion below). For simplicity we take t_L long-ranged only in the direction 
perpendicular to the chains, although it is straightforward to generalize all formulas 
to the an interchain tunneling of the form 

E . (53) 



VII.l A. Expansion around uncoupled chains 

Following the analogous treatment for the case of coupled Luttinger liquids [34, 35] 
we take the interchain tunneling into account in a perturbative expansion. The 
building blocks of this expansion are the n-point functions of fermion operators 
for uncoupled chains, which are represented pictorially in Fig. 5. The full single- 
particle Green’s function is given in terms of a diagrammatic expansion, the first 
few terms of which are shown in Fig. 6 

Unlike for Luttinger liquids it is extremely difficult to determine four-point 
functions for ID Mott insulators. On the other hand, it is trivial to sum all dia- 
grams involving only two-point functions of uncoupled chains. This approximation 
is known as RPA and goes back a long way [36] . The small parameter making RPA 
a controlled approximation for any form of the interchain tunneling is the ratio of 
the interchain tunneling to either the temperature or the Mott gap. Within RPA 
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Figure 7: Spectral Function for fixed k. 



the single-particle Green’s function is given by 



G3d(^5 Q, k) 

t±(k) 



Gip{u^ q) 

1 - tp{k) GiD{(^,q) 

exp(zfc • [Ri - Rm]) . 

m 



(54) 



The RPA Green’s function Gsp^cu^q, k) has the interesting property that it has a 
pole, which corresponds to a bound state of an antiholon and a spinon with the 
quantum numbers of an electron. For small interchain tunneling this bound state 
still has a gap. Fig. 7 is a density plot of G 3 d(^x;, q,k) as a function of lu and q for 
a fixed value of k and hence a fixed value of t±{k). At energies above the Mott 
gap there is a continuum of states, which is similar in nature to the result for 
uncoupled chains. In the Mott gap the coherent electronic mode is visible. 



B. Formation of a Fermi surface 

As long as the “binding energy” of the electronic bound state is small, RPA is 
a controlled approximation for any form of the interchain tunneling [35]. How- 
ever, in the most interesting situation when the gap of the bound states becomes 
very small, RPA becomes uncontrolled; there is no longer any small expansion 
parameter for a generic t±{k). An exception is the case of a long-ranged interchain 
tunneling: here the support of ^_l(A;) becomes very small, so that any integration 
over the transverse momentum generates a small volume factor proportional to the 
inverse range of the hopping. Recalling that RPA takes into account all terms not 
involving any integrations over the transverse hopping (i.e. “loops”), we conclude 
that RPA is the leading term in a controlled loop expansion. 

Increasing t±{k) in RPA reduces the gap of the electronic bound state, until 
it eventually vanishes and electron and hole pockets are formed: we have crossed 
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Figure 8: Fermi surface predicted by the RPA for a 2D square lattice with nearest- 
neighbour interchain tunneling. 



over from weakly coupled ID Mott insulators to an anisotropic Fermi liquid. As 
an example, let us consider a 2D square lattice with interchain tunneling between 
nearest neighbour chain only. Here RPA is an uncontrolled approximation, but we 
still find it instructive to discuss its predictions. 

In this case electron pockets are formed in the vicinity of the points (±/cf, 0 ) 
and hole pockets form around (±/ci?, diTr). In the electron-hole symmetric case the 
volume of the electron pockets is precisely the same as the volume of the hole 
pockets. 

We note that the “Chain-DMFT” approach (which approximates the coupled 
chains system by a single chain in a self-consistent bath) gives an open Fermi 
surface [37]. 

VIII Conclusions 

We have discussed the dynamical response of two strongly correlated one dimen- 
sional insulators: the half-filled Mott insulator and the quarter-filled 4/ci?-CDW 
insulator. In the low-energy limit both of them exhibit spin-charge separation and 
it is possible to determine the dynamical response by means of exact methods. 
The optical conductivity is found to be insensitive to the spin degrees of freedom 
and also does not differentiate between the half-filled Mott and the quarter-filled 
Akp CDW insulator. On the other hand the single- particle Green’s function (as 
well as the dynamical density-density réponse function [38]) probe the physics of 
both spin and charge sectors and are quite different in the two cases. We also have 
investigated the effects of interchain tunneling t± in a system of infinitely many 
Mott-insulating chains. We have shown that any t± ^ 0 leads to the formation of a 
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gapped electronic bound state below the Mott gap. Increasing t± reduces the gap 
of this bound state until eventually a Fermi surface in the form of closed electron 
and hole pockets is generated. 
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Mesoscopic Fluctuations of Co-Tuuneling 
and Kondo Effect in Quantum Dots 

L.L Glazman 



Abstract. Conductance of a quantum dot in the Coulomb blockade regime is dis- 
cussed. At moderately low temperatures, the thermally-assisted transport of elec- 
trons through the dot gives way to elastic co-tunneling [1]. The amplitude of the 
elastic co-tunneling is sensitive to the specific structure of electron wave functions 
in a given sample. The conductance exhibits strong mesoscopic fluctuations [2]. 
Statistical properties of these fluctuations are reviewed in this lecture. At lower 
temperatures, conductance through a dot is altered by the Kondo effect, if the 
number of electrons on the dot is odd. We describe the universal features of the 
Kondo conductance [3, 4], and briefly discuss the manifestations of this effect in 
the limiting cases of weak and strong [5] dot-lead coupling. 



1 Introduction 

The effect of charging on the electron transport was conjectured in order to inter- 
pret the early data [6, 7] on the in-plane hopping conductivity of granular films. 
There, the activation energy extracted from the temperature dependence of con- 
ductivity was associated with the charging of individual grains composing the 
film. A clear demonstration of the role of charging, however, appeared later in the 
experiments of Giaever and Zeller [8] and Lambe and Jaklevic [9] on “vertical” 
tunneling through a layer of grains. 

In Ref. [8] tunneling through a layer of Sn particles coated by a thin insulator 
and sandwiched between two aluminum electrodes was studied. The authors [8] 
measured the I-V characteristics of the junction both with and without magnetic 
field applied. That allowed them to exclude superconductivity as a source of non- 
linearity, and to associate the observed finite-bias offset in the I-V characteristics 
with the effect of charging. They have realized, that at low temperatures electrons 
must tunnel through a grain one by one. A single additional electron localized on 
the grain in the course of tunneling, raises the potential of the grain. This prevents 
other electrons from hopping onto the grain before that electron have finished its 
path between the leads, and the grain is discharged. Such a “blockade” is effective 
only at low temperatures, when the characteristic energy Ec — e^/2C of a grain 
carrying one extra electron exceeds temperature, Ec T (hereinafter C is the 
total capacitance of the grain, and temperature T is measured in energy units). 

A comprehensive theory of the effect of charging on tunneling was developed 
in papers of Shekhter [10] and Kulik and Shekhter [11]. There the events of tun- 
neling to and from the grains were considered as sequential processes. In addition, 
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Cl C 

Figure 1: Sketches of the I — V characteristics for tunneling (a) through a single 
grain at C\ C 2 and R\ and (b) through a layer of grains. In the inset: the 

equivalent circuit for a single grain attached to the leads by two tunnel junctions. 



all the electron-electron interaction was replaced solely by the charging energy. In 
the course of electron tunneling through the grain, its charge is varied by ±e. In 
Ref. [11] rate equations controlling the probabilities of various discrete values of 
the grain charge were derived. These equations formed the basis for calculation 
of non-linear I-V characteristics. In the case of tunneling through a single grain, 
the I-V characteristic has a step-like structure with alternating intervals of low 
and high differential conductance, see Fig. la. Tunneling through many grains in 
parallel smears out the structure in the I-V characteristic. However, the average 
charging energy yields a characteristic offset of the high-voltage linear parts of the 
characteristic, see Fig. lb. 

Tunneling through a single grain may be studied, for instance, by means of 
the STM technique [12, 13]. At higher biases, V e/Ci, 2 , many states differing 
by the value of charge become available for electron tunneling. This leads to a 
progressive smearing of the higher steps in the I-V curve, even in the absence 
of the electron heating effect. On the basis of the rate equations for a strongly 
asymmetric setup, R 2 ^ Ri in the inset in Fig. 1, one can expect that about 
C 2 R 2 HC 1 -h C 2 )Ri steps are resolved [14]. 

The investigation of single electron tunneling through a metallic grain in a 
two-terminal device inevitably involves applying a relatively high bias eV ^ jC 
to the device, and therefore may easily result in a significant departure of the sys- 
tem from the thermal equilibrium. A device with a third terminal, gate, is free from 
that drawback. The gate is coupled to the grain only capacitively, and it allows 
to vary the electrostatic potential on the grain without perturbing its equilibrium 
state. By tuning the gate voltage of such a three-terminal device, commonly re- 
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Figure 2: An equivalent circuit for a single-electron transistor. The total capaci- 
tance of the grain in this circuit is represented by the sum of the gate and junctions 
capacitances, C = + +C 2 . A bias can be applied by disbalancing the voltages 

Vi and V 2 - 



ferred to as single electron transistor [15], one controls the Coulomb blockade of 
tunneling, without varying the (small) bias applied to the leads. Experimental im- 
plementation of such a device was reported first by Fulton and Dolan [16]. In this 
experiment, a periodic modulation of the device resistance with the variation of 
the gate potential was demonstrated. 

At some special gate voltages the Coulomb blockade is lifted. Indeed, the 
two states of the grain with the charges eN = en and eN = e(n + 1), where n is 
integer, have the same electrostatic energy 

Hc=Ec[N-Afy, ( 1 ) 

if the dimensionless gate voltage J\f = CgVgje is half-integer, z.e.. Coulomb block- 
ade is lifted at = Af* with Af* = n + Here Ec = e^/2C is the charging 
energy, C and Cg are the total capacitance of the grain and its capacitance to the 
gate, respectively. 

Electron transport through the grain is activationless at the degeneracy 
points Af — AA*. For the gate voltages around these points, and at temperatures 
T Ec^ the rate equations [11] yield [17] for the linear conductance 



G{N, T) 



1 Ec{Af-Af*)/T 
2Roo smh[Ec{Af-Af^)/TY 



( 2 ) 



Here Rqq = {G\ G 2 )/G[G 2 is the high-temperature (T ^ Ec) resistance of the 

device, Gi and G 2 being the conductances of the two tunnel junctions connecting 
the grain to the leads, see Fig. 2. According to Eq. (2), all the Coulomb blockade 
peaks are characterized by a single value of the low- temperature conductance, 
G{Af*) = l/2i^oo- In the Coulomb blockade valleys {Af ^ Af*), conductance falls 
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Figure 3: Sketches of the temperature dependence of the linear conductance 
G{J\f,T) in the Coulomb blockade peaks J\f = Af* and valleys {M / Af*). Ap- 
plication of the rate equations to a model characterized by continuous electron 
spectrum in the grain, yields a periodic dependence G{Af); all Coulomb blockade 
peaks have the same height. 



off exponentially with the decreasing temperature (Fig. 3); again, all the valleys 
behave exactly the same way. 

The derivation of Eq. (2) disregards entirely the effects of coherent propaga- 
tion of electrons across the grain, and effects of spectrum discreteness for electrons 
within the grain. This sets limitations for the application of the simple rate equa- 
tions theory [17]. The coherence would result in weak- localization corrections to 
the conductance through a grain, even if one dispenses with the spectrum dis- 
creteness. Such corrections are small in the case of multi-channel junctions, no 
matter how big is the coherence length (the number of channels in a junction 
can be estimated as its area in units Xp set by the electron Fermi wavelength). 
The discreteness of electron spectrum destroys the strict periodicity of the G{J\f) 
dependence. However, this discreteness is not important as long as the temper- 
ature exceeds the spacing SE between the quasiparticle energy levels in the dot. 
These two conditions are met usually in experiments with lithographically pre- 
pared metallic islands separated from leads by oxide tunnel barriers. Relatively 
large area of the barriers results in a large number of channels in the tunnel junc- 
tions; the Fermi wavelength in a metal is much smaller than the linear size of 
a sub-micron grain, so the condition ÔE T is met even for the lowest attain- 
able temperatures. An example of the G{Af, T) behavior well described by Eq. (2) 
can be found in the data of Ref. [18] for the high-resistance devices (sample 1 on 
Fig. 3 of Ref. [18]). Deviations from the rate equation theory occur at smaller re- 
sistance of the junctions (Ri ,2 ~ when the treatment of the grain charge as 

a classical discrete quantity becomes inadequate (data for sample 3 in Ref. [18]). 

Single-electron tunneling through metallic nanoparticles and quantum dots 
formed in semiconductor nanostructures, however, demonstrates significant de- 
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viations from the above-described behavior, even in the experiments with high- 
resistance junctions. The reason is that the level spacing for these objects is con- 
siderably larger, and the condition SE^ T is within the reach of an experiment. 
In addition, in semiconductor devices the number of channels in tunnel junctions 
connecting a quantum dot to the leads is small and controllable. Mesoscopic fluctu- 
ations of the valley conductance become stronger for smaller number of channels. 
We start the discussion of single-electron tunneling through quantum dots from 
the estimates of the relevant energy scales. 



2 Energy Scales Involved in the Single-Electron Tunneling Effects 



As it was discussed above, the Coulomb blockade of tunneling through a conducting 
grain becomes effective at temperatures T Ec> In the Coulomb blockade regime, 
the linear conductance is finite only at the gate voltages allowing for degeneracy 
of the ground state with regard to addition of a single electron. The degeneracy 
condition, in a general form, depends not only on the charging energy, but also on 
the energy of spatial quantization of an electron confined to the dot. In a disordered 
grain, or irregularly-shaped quantum dot, we expect no geometrical symmetries, 
and thus assume the single-particle electron spectrum to be non-degenerate. The 
average level spacing in such a spectrum is defined by the density of states Ud in 
the material, and by the volume of the grain: 

ÔE - (3) 



Here L is the characteristic linear size of the grain, and d is the dimensionality of 
the system (d = 2 for a quantum dot formed in a two-dimensional electron gas at 
the interface of a semiconductor heterostructure, and d = 3 for a metallic nanopar- 
ticle; d = 1 for a one-dimensional conductor, like a segment of carbon nanotube). 
For a mesoscopic (A^ L) conductor, the level spacing is small compared to the 
charging energy. Indeed, using the estimate Ec ~ we find: 



ÔE kTivf ( ^ ^ f 



(4) 



where vf is the Fermi velocity, k is the dielectric constant, and Vg is the conven- 
tional gas parameter characterizing the electron-electron interaction in a non-ideal 
Fermi gas. Except an exotic case of an extremely weak interaction, having a large 
number of electrons in a quantum dot or grain. Ne ~ {L/XfY ^ 1? guarantees 
the smallness of the ratio 6E/Ec in dimensions d = 2 and 3. For the smallest 
quantum dots formed in a GaAs heterostructure, SE/Ec ~ 0.1; for a metallic 
nanoparticle with L ~ 5nm this ratio is about 0.01, see, e.g., Refs. [19, 20]. We 
should mention also that in the d — 1 case, the estimate (4) is of little help, as 
even a weak electron-electron interaction results in a formation of a Luttinger liq- 
uid [21], significantly affecting the nomenclature of the elementary excitations. In 
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Figure 4: A sudden shift V of the potential on a dot results in a transient current 
/, while the quasi-discrete levels risen above the Fermi level depopulate. Consid- 
eration of this current allows one to relate the junction conductance to the levels 
spacing ôE and width F. 



the ca.se of a single-mode finite-length Luttinger liquid, the elementary excitations 
can be viewed as Id plasmon waves in a confined geometry, so the corresponding 
level spacing is ÔE ^ Ec^ 

Tunneling between the grain and leads results in broadening of the discrete 
levels. The characteristic width of the levels F can be related to the level spacing 
ÔE and to the high-temperature conductance of the junctions connecting grain 
to the leads. Indeed, consider a grain attached to one lead, and suddenly biased 
by voltage V, see Fig. 4. Such a bias will force eVjôE occupied levels in the 
grain to cross the Fermi level. An electron escapes from an occupied level over 
time Tesc ~ h/r. In the absence of Coulomb blockade, the escapes from different 
levels are uncorrelated with each other, and the current of electrons leaving the 
grain right after the voltage pulse, is 7 ~ {e/resc){eV/SE) ~ {e^ /h){r/SE)V . This 
estimate allows one to relate the ratio T/ÔE to the junction conductance defined as 
G = I /V. Proper accounting for the numerical factors yields the following relation 
between the partial width F^ of the level and the linear conductance Gi of the 
junction number i: 

(r.) = ^SE. (5) 

Note that for each particular level n in the dot, widths F^ refiect the magnitudes 
of the electron eigenfunctions vicinities of the respective junc- 

tions; (...) represents averaging over many discrete eigenstates in the grain. Well- 
developed Coulomb blockade occurs at small junction conductances, G^ <C e‘^/h. 
Therefore, in the main part of this lecture we will assume the following hierarchy 
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of the energy scales: 



r < (5E < Ec, 



( 6 ) 



where T = We start with the description of a theory of an isolated quantum 

dot (F = 0). 



3 The Constant Interaction Model and its Justification 



A simple description of the confined electron system in a dot is possible if a number 
of conditions is met: (z) the electron-electron interaction within the dot should be 
not too strong, ^ 1, so that the Fermi liquid model is applicable; (zz) there 
should be no degeneracies in the spectrum of the confined quasiparticles with 
energies near the Fermi level (this is satisfied, in general, for a chaotic electron 
motion); {iii) the conductance within the dot should correspond to the metallic 
regime {i.e., the electron mean free path must exceed considerably the Fermi wave 
length) . 



The dimensionless conductance g oîa, dot is well-defined, if due to the disorder 
or irregular shape of the dot the motion of an electron within the dot is chaotic. In 
a dot of a linear size L the dimensionless conductance g is related to the Thouless 
energy [22] Er and level spacing [see Eq. (3)] as g ^ EtISE. Here the Thouless 
energy is 



hD 



or Et — 



hvT 



(7) 



for the cases of elastic mean free path I shorter and longer than L, respectively; 
D is the diffusion constant in a disordered dot. If L ^ ^ in a 2d dot, then its 
dimensionless conductance is g ^ l/Xp- For a “ballistic” quantum dot H/Et equals 
the time it takes for an electron to traverse the dot; conductance is obviously large 
in this case, g L/Xp. 

If the conditions (z)-(zzz) are met, then the Random Matrix Theory Hamilto- 
nian [23] of non-interacting quasiparticles is a good starting point for describing 

the dot, 

= (8) 

a, 7 



Elements Ha'y of tho Hermitian matrix in Eq. (8) belong to a Gaussian ensemble 
of random real (GOE) or complex (GUE) variables. The matrix elements do not 
depend on spin, and therefore each eigenvalue of the single-particle Hamilto- 
nian represents a spin-degenerate orbital level. The spacings |^n+i — ^n\ obey the 
Wigner-Dyson distribution [24]; the average value of |^n+i — Çn| is SE. 

The two-particle interaction has the form 



Hint — ^ ' 05 , 0-1 5 



(9) 
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with the matrix elements Ha/d'yô depending on the electron-electron interaction 
potential V{r\ — r 2 ) and on the chosen basis of orbital states (j)^, 

'HajS'rô = j dfidr2V{fi - 'r2)<^a(n)<^/3(^2)^* (10) 

It turns out [25, 26, 27, 28, 29] that the majority of these matrix elements are 
small if p ^ 1, and only relatively few “most diagonal” elements remain finite in 
the limit g ^ oo. In other words, the Hamiltonian (10) can be separated in two 
pieces: 

= ( 11 ) 

The first term here is universal: it does not depend on the geometry of the dot, or 
on the realization of the disorder. The second term consists of several parts, each 
proportional to some power of 1/^; so is small and sample-specific. 

The form of the universal part of the interaction Hamiltonian can be estab- 
lished from the symmetry requirement [30]. Indeed, in view of the invariance of 
the distribution function of the Random Matrix Hamiltonian with respect to an 
arbitrary orthogonal (for GOE) or unitary (in the case of GUE) transformation, 
the term should consist of operators invariant under such transformations. 
There are three such operators, bilinear in 'tpa,a and 

ct,a a ct 

Operator N is the total number of electrons, S is the total spin of the dot; the 
“superconducting pair” operator T is needed for the description of the Cooper 
instability in grains of superconducting materials. Because of the pair-wise nature 
of interaction between the particles (9), the universal Hamiltonian is quadratic in 

terms of operators T, 5, and N: 

2 *^2 

hI°1=Ec{n-M) +Js{s) (13) 

Note that the first term in Eq. (13) is just the charging energy (1). The second term 
represents the exchange energy of a dot. As long as the dot is isolated from the 
leads, both its charge and spin are conserving quantities, because the corresponding 
operators commute with the non-interacting part (8) of the Hamiltonian. On the 
contrary, operator T does not commute with Hf, so an isolated dot in a ground 
state cannot have a non-zero average superconducting order parameter. 

The exchange integral Js is small in the case of weak electron-electron in- 
teraction: Js ^ TgSE (there is an additional ln(l/rs) factor in this estimate in 
the case of a 2d dot). Smallness of the ratio Js/^E guarantees the absence of a 
macroscopic (proportional to the volume of the dot) spin in the ground state, in 
accordance with the well-known Stoner criterion for the itinerant magnetism [31]. 
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If all the orbital levels would be equidistant, then the spin of an even-electron 
state should be zero, while an odd-electron state should have spin 1/2. However, 
the level spacings |^n+i — $n| are random. If the spacing between some orbital levels 
is accidentally small, the dot may acquire a spin [32] exceeding 1/2. In particular, 
a dot with an even number of electrons may have spin 1 in the ground state [33]. 
At small rg, higher spins may occur only if several levels come very close to each 
other, which is quite a rare event [34]. 

The third term in the Hamiltonian (13) is renormalized to zero in the case 
of repulsive interaction, i.e., Jc/ôE ^ 0 in a large {ÔE 0) dot. For an at- 
tractive interaction, Haa-f'y < 0, the renormalization enhances Ac, signaling the 
superconducting instability. In a large grain with SE significantly smaller than 
the superconducting gap A in the bulk material, the introduced energy constant 
Jc —ÔE. The variation of Jc with the decreasing size of the grain (ie., with the 
increasing ratio ÔE/A) was studied in Refs. [35, 36, 37]. 

We will not discuss superconducting grains here, and set Jc — 0 in the re- 
mainder of this paper. If one disregards also the rare configurations in which two 
orbital levels come very close to each other, then the higher spin states can be 
also thrown away. The result of these simplifications is the so-called Constant 
Interaction model, 

Hci = Ec (^N - J\f^ + iV = ^ aJ^^Uncr , (14) 

na no- 

which is widely used in the analysis of the experimental data. Note that the single- 
particle level spacings |fn+i ~ ^ ^E here are small compared to the charging 

energy, and each state is spin-degenerate. The eigenfunctions are random, 

and, in the leading order in 1 / do not correlate with each other and satisfy the 
Porter-Thomas distribution [24]. 

The non-universal correction to the Hamiltonian (13) can be split in two 
terms, The leading term here, comes from 

the joint effect of the interaction and of the electron confinement in dot [38] , this 
correction is proportional to 1/ The corresponding term in depends also 
on the gate voltage A7. In the simplest case of a distant gate, this term can be cast 
into the form [30] 

F(i/V 5) = (iV - V) y] (15) 

a, 7 

The average value of the random matrix elements Xot (3 is zero, while their variance 
is given by 

= (16) 

Here ~ 1 is a numerical coefficient which depends on the details of the potential 
confining the electrons [30, 38]. As one can see, the correction (15) causes some 
shifts of the single-particle energy levels described by Hamiltonian (14) of the 




S618 



L.I. Glazman 



Ann. Henri Poincaré 



Constant Interaction model. This shifts however, at large conductance of the dot 
{g 1) are small compared with the level spacing ÔE, which gives additional 
support for the Constant Interaction model. 

Next in the magnitude mesoscopic term in the Hamiltonian has the 

standard four-fermion structure, 

= IY. (17) 

a/3j0 

Here the matrix elements have non-zero average values, 

= &2y {SasS 0 ^ + S^^S^s) ■ (18) 

The amplitude of the mesoscopic fluctuations 6 Ha( 3 ^s of these matrix elements is 
of the order of their average [25, 26, 27, 28, 29], 




Here the numerical constants 62 ~ 1 and 63 ~ 1 depend on the details of the 
dynamics of electron motion within the dot [25]. This part of the Hamiltonian 
determines the inelastic electron relaxation within the dot. 

Discussing electron transport through a quantum dot in the following sec- 
tions, we will concentrate on the “conventional” case, described by the Constant 
Interaction model (14). 

4 Activationless Transport through a Blockaded Quantum Dot 

According to the rate equations theory [11, 17], at low temperatures, T Ec, 
conduction through the dot in the Coulomb blockade valleys is exponentially sup- 
pressed. This suppression occurs because the process of electron transport through 
the dot involves a real state in which the charge of the dot differs by the one- 
electron charge e from the thermodynamically most probable value. The ther- 
modynamic probability of such a fluctuation is ~ exp[—Ec\Àf — A^^l/T], which 
explains the conductance suppression, see Eq. (2). Going beyond the lowest-order 
perturbation theory in conductances Gi and G 2 allows one to consider processes 
in which the quantum states of the dot carrying a “wrong” charge participate 
only as virtual states in the tunneling process. Such higher-order contribution to 
the tunneling conductance was envisioned in the insightful paper of Giaever and 
Zeller [8]. The first quantitative theory of this effect, however, was developed much 
later [1, 39]. The leading contributions to the activationless transport, according 
to Refs. [1, 39], are provided by the processes of inelastic and elastic co-tunneling. 

Unlike the sequential tunneling, in the co-tunneling mechanism, the events 
of electron tunneling from one of the leads into the dot, and tunneling out from 
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Figure 5: Inelastic co-tunneling, (a): The virtual state is formed after step 1 of the 
process is accomplished, and the dot acquires an extra electron. In step 2, some 
other electron tunnels out. (The reverted sequence of processes also contributes to 
the tunneling amplitude.) (b): In the final state, a charge e is transferred from the 
left lead to the right one, and an electron- hole excitation is left in the quantum 
dot. 



the dot into the other lead occur as a single quantum process. There exists also a 
complementary process of co-tunneling, in which the order of the tunneling events 
is reversed. The state of the dot with the “wrong” charge (one extra electron or 
hole) appears only as an intermediate (virtual) state in such processes. 



4.1 Inelastic Co- Tunneling 

In the inelastic co-tunneling mechanism, the electron which enters the dot occupies 
one of the empty levels of spatial quantization, and the electron that leaves the 
dot, vacates one of the other levels of spatial quantization, see Fig. 5. Therefore, 
the process of inelastic co-tunneling results in the transfer of charge e between the 
leads and by a simultaneous creation of an electron-hole pair in the dot. 

The amplitude of the co-tunneling process may be calculated by means of per- 
turbation theory in the tunnel Hamiltonian Hr describing the electron transport 
between the dot and leads: 

Ht = ^ ^ ^ ^ + tp^aJ^Up) . (20) 

k,n p,ri 



Here and are the creation operators describing electrons in the leads, and tkn, 
tpn are the tunneling matrix elements connecting state n in the dot with the states 
k and p in the two opposite leads. The typical values of these matrix elements can 
be related to the junctions conductances. 



(\tkn\^) 



(ri) 

270/1 



hGi 

2776 ^ 1/1 



ÔE, 






M. 

2tti'2 



nCi 

2TTe^V2 



5E, 



(21) 



where 1 / 1^2 are the densities of states in the leads. 
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For simplicity, we will illustrate here the estimate of the inelastic co-tunneling 
conductance deep in the Coulomb blockade valley, i.e., at almost integer A/*. Sup- 
pose the energy of the incoming electron e, measured from the Fermi level, is 
confined by condition: 

e < Ec. (22) 

Then the energy deficit of the virtual state involved in the co-tunneling process is 
close to Ec- The amplitude A[n of the inelastic transition is 

An = (23) 



In such a transition, the initial state has an extra electron in the single-particle 
state A:, and the final state has an extra electron in the opposite lead (state p) 
and an electron-hole pair in the dot (discrete state m is occupied, and state I 
is empty). The number of final states available at given initial energy £ can be 
estimated from the phase space argument, familiar from the calculation of the 
lifetime of a quasiparticle in the Fermi liquid [40]. If the initial electron energy 
£ ^ ÔE, then the number of final states available is ~ £^ j^E. Using this estimate 
and Eqs. (21), (23), we can find now, up to a numerical factor, the conductance 
for the inelastic process: 



Gin — 





47rGiG2 f tV 

3 eyn \Ec) ■ 



(24) 



(The numerical constants here are fixed so that the final result coincides with 
the result of rigorous calculation [1]). A comparison of Gin with the result of 
the rate equations theory (2) shows that the inelastic co-tunneling takes over the 
thermally-activated hopping at moderately low temperatures 



T<T,^ = Ec 



In 



jU 

G1+G2 



n-l 



(25) 



The smallest energy of the electron-hole pair is of the order of 5E. At temperatures 
below that threshold the contribution of the inelastic co-tunneling mechanism to 
the conductance becomes exponentially small. It turns out, however, that even at 
much higher temperature this mechanism becomes less effective than the elastic 
co-tunneling. 



4.2 Elastic Co- Tunneling. Mesoscopic Fluctuations of the Conductance in 
the Coulomb Blockade Valleys 

In the process of elastic co-tunneling, an electron tunnels in and out of the dot 
to/from the same level of spatial quantization. Therefore, no electron-hole pairs 
are created in such a process: after the charge is transferred between the leads, the 
quantum dot remains in its initial state. 
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Estimating the elastic co-tunneling contribution to the conductance we will 
consider the edge of a Coulomb-blockade valley, 

< V - V* < 1/2. (26) 

This will allow us to account only for the hole-like contributions to the tunneling 
amplitude, see Fig. 6. (The general case is qualitatively similar to this simplified 
one.) Each of these contributions utilizes one of the discrete levels as a virtual 
state for tunneling: 



A 



n 



tknt 



♦ 

pn 



2Ec{Af-Af^)^\U' 



Cn<0. 



(27) 



To derive the energy denominators here, we have utilized the Constant Interaction 
model (14); the quasiparticle energies are measured from the Fermi level. The 
total tunneling amplitude is the sum of the partial amplitudes An. The elastic 
co-tunneling conductance can be calculated then as 




n 



Im 



(28) 



If the junctions to the dot are point contacts, then Gq\ exhibits strong mesoscopic 
fluctuations. Indeed, tunneling matrix elements entering Eq. (27) depend on the 
values of the electron wave functions at the points of contacts, tkn oc 
tpn OC (Pn(f^ 2 )- As it was discussed in Section 3, the electron eigenfunctions in the 
dot are random and uncorrelated. Therefore, the partial amplitudes 



An — ^ \/ G1G2 



ÔE 



<Pn{ri)(^n{r2) 



2Ec{M-Af^) + \U i\cpn\^) 



(29) 



are random and uncorrelated as well, and {An) = 0. 

Now it is clear that only diagonal terms {I = m) in the double-sum of Eq. (28) 
contribute to the average conductance (Gei). The sum of diagonal terms (|yInP) is 
converging, and the characteristic number of terms contributing to it, ~ Ec\M — 
J\f'"\/8E^ is large under the conditions (26). The resulting average contribution to 
the conductance is [1] 



(v^ + V*-V+l) • 

(Here both the hole-like and electron-like contributions are taken into account.) 

Comparing Eq. (30) with Eq. (24), we see that near the bottom of the valleys 
the elastic co-tunneling dominates the electron transport already at temperatures 



T<T,, = ^/Ec5E, 



(31) 
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Figure 6: Elastic co-tunneling. Under the condition (26), the hole-like processes 
dominate: first an electron is extracted from the dot [transition (1)], and then the 
resulting hole is filled by an electron from the other lead [transition (2)]. The 
special role of higher-order tunneling through the top-most filled level in the case 
of an odd number of electrons on the dot will be discussed in Section 5. 



which may exceed significantly the level spacing. 

All the off-diagonal terms (/ ^ m) in Eq. (28) contribute to the mesoscopic 
fluctuations of the conductance. Given the random Gaussian behavior of the am- 
plitudes (29), it is obvious that the variance of the conductance is of the 

order of (Gei)^. The exact relation [2] between these quantities 

{5Gl) = -^{G,,Ÿ (32) 

depends on the existence of the time reversal symmetry in the system. In zero 
magnetic field such symmetry is present, and the wave functions in the dot are 
described by the orthogonal statistical ensemble (GOE), (3=1. With the increase 
of the magnetic field, all states contributing to the elastic co-tunneling eventually 
cross over into the unitary ensemble (GUE), and j3 = 2. 

The structure of the electron wave functions in the dot, and the sample- 
specific value of SGei is affected by a relatively weak magnetic field B, An increase 
of the field from B to B 3 - -Bcorr results in scrambling of the wave functions; the 
corresponding random values 0Gei{B) and 0Ge\{B + Bcorr) become uncorrelated 
at a sufficiently large Bcorr- The characteristic field increment Bcorr needed for 
suppression of correlations, can be rigorously defined after the correlation function 
{ÔGe\{Bi)ÔGei{B 2 )) is calculated [2]. Here we will give only an estimate of Bcorr 
for a disordered quantum dot, appealing to the semi-classical picture of electron 
motion. 

The characteristic time r the dot spends in the virtual state during the pro- 
cess of elastic co- tunneling is determined by the energy deficit of such a state, 
T ~ h/[Ec\JV — A/"* I]. During this time, the trajectory of the tunneling electron 
would cover an area ~ Dt^ if there would be no boundaries of the dot (here D is the 
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electron diffusion constant in the dot). Because of the finite linear size of the dot 
L, the trajectory winds inside it, approximately rj = Dr/L? times. The winding 
direction is random, therefore the effective area 5eff under the electron trajectory 
is determined by the fluctuation of 77 ; the typical value of this area is 5eff ~ 
Introducing here Thouless energy (7), we find 

The electron wave functions which are important for the elastic co- tunneling vary 
substantially if the magnetic flux through the typical trajectory is increased by 
one quantum, BcowSeH ^ ^o- This relation yields: 



^corr(A^) 



$0 Iec\M-X*\ 

L^\j Et 



(33) 



Note that the correlation field depends on the gate voltage. It is increasing while 
the gate voltage is approaching the bottom of the valley. This can be understood 
as the result of the decrease of the time r allowed for the virtual localization of an 
electron in the dot. The shorter this time, the stronger field should be applied in 
order to affect the phases of the partial tunneling amplitudes (29). 

The fact that the number of discrete single-particle levels participating in 
the co-tunneling process is large, affects also the correlation of the conductance 
fluctuations in different valleys. Indeed, it is necessary to shift the gate voltage 
A/" by ~ Ecj^E in order to replace all the discrete eigenstates participating (as 
virtual states) in the elastic co-tunneling process. So one would expect [42] that 
the conductance fluctuations are correlated over about Ec/ÔE valleys. 

Equations (30) and (33) explain, why conductance G{J\f) and correlation 
magnetic field Bcorr(A/*) were observed [41] to vary with opposite phases when the 
gate voltage was varied. Later, the calculated [2] correlation function {SGe\{Bi) 
SGe\{B 2 )) was used to extract the values of the correlation field from the data, 
and a quantitative agreement with Eq. (33) was found [43]. At the same time, the 
correlation of conductance fluctuations in different valleys was found to fall off 
substantially faster than the theoretical prediction. 



5 Kondo Conductance of a Blockaded Quantum Dot 

Among the Ec\Af — àT'I/SE virtual states participating in the elastic co-tunneling 
through a blockaded dot, the top- most occupied discrete level plays a special role. 
If the number of electrons in the dot is odd, this level is filled by a single electron 
and therefore is spin-degenerate. The amplitude of an electron transfer through 
such a level, calculated in the fourth order in tunnel matrix elements tkm tpn 
logarithmically diverges at low temperatures. This divergence signals the Kondo 
singularity [44] in the transmission amplitude. 

If the junctions conductances are small, Gi ^2 ^ then the other levels 

of the dot, which are doubly- filled or empty, are unimportant in the discussion of 
Kondo effect. The model of the dot attached to two leads then can be truncated to 
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the Anderson single-level impurity model with the electron continuum consisting 
of two bands: 

Ha = + a2qaO-2qa) + + Un-iü^ (34) 

g,cr (7 

q,a 



Here and are the electron creation operators in the leads 1 and 

2, and on the upper occupied discrete level in the dot, respectively; and £q == 
2Ec{J\f* - M) are the corresponding energies in the electron continuum and on 
the dot; U ~ Ec- For brevity, the tunneling matrix elements ti and t 2 connecting 
the discrete state in the dot with the states in the leads are taken here to be 
g'-independent; for the same reason, we consider here the “depth” of the localized 
state to be confined by the conditions: ÔE C — Ec^ At first sight, the 
Anderson model with two bands (34) may be associated with a two-channel Kondo 
model [46]. However, it is easy to show that in fact such a two-channel model is 
degenerate, and can be reduced to the conventional single-channel one. Indeed, by 
a unitary transformation 

{ ft," { : } = Türrâ» { U ' 

Hamiltonian (34) can be converted [4] to the conventional one-band Anderson 
impurity model [45]. The localized state and band “a” form the usual Anderson 
impurity model, which is characterized by three parameters: U, Sq, and T = ri+F 2 . 
Band “/3” is entirely decoupled from the impurity. 

The unitary transformation (35) establishes the relation between the tunnel- 
ing conductance Gk associated with the spin- degenerate level (“Kondo conduc- 
tance”) in a quantum dot, and the known results for the t-matrix Ts{lo) of the 
conventional Kondo problem in the sd exchange model: 

G = Go jcLü{-df/dw) t [-TTi/Im = Go/ . ( 35 ) 



Here 



2^2 

Go = — sin^( 26 lo) = 



2 e 2 



2tit2 ^ 

\t\\ + \tl\ 



Tk is the characteristic temperature of the problem (Kondo temperature), and 
f{x) is some universal function (it has been evaluated numerically and plotted, 
e.g.^ in Ref. [48]). A remarkable property of scattering on a Kondo impurity, is 
that the corresponding cross-section approaches the unitary limit at low energies, 
/(O) = 1. The low-temperature correction to the unitary limit is proportional 
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Figure 7: (a) Linear conductance at T Tk and T^Tk^ Due to the Kondo effect, 
function G{Àf) develops plateaus in place of the “odd” valleys of the Coulomb 
blockade, (b) Sketch of the temperature dependence of the linear conductance, 
G{T), in the “odd” valley. Conductance decreases with the decrease of temperature 
from T Ec down to T ~ Te\. At very low temperatures, T ^ Tk, conductance 
grows again, reaching the unitary limit G = in the case of equal partial 

widths Fi and P 2 of the singly-occupied level, which gives rise to the Kondo effect. 




to and described by Nozieres’ Fermi- liquid theory [49]. The conventional [40] 
representation of the low-temperature expansion of f{x), 

yields one of the ways of defining the Kondo temperature. Being expressed [47] in 
terms of the parameters of the Anderson impurity model, Tk is given, up to some 
unimportant pre-exponential factor, by the following relation: 

Tk - (C/r)i/2 exp{7r£o(eo + U)/2TU). (38) 

Equations (36) and (37) tell us that upon sufficiently deep cooling, the gate 
voltage dependence of the conductance through a quantum dot should exhibit a 
drastic change. Instead of the “odd” valleys, which correspond to the intervals of 
gate voltage 

\N - (2n + 1)1 < ^, n = integer, (39) 

plateaus in the function G{N) develop, see Fig. 7a. In other words, the tempera- 
ture dependence of the conductance in the “odd” valleys should be very different 
from the one in “even” valleys. The conductance decreases monotonously with the 
decrease of temperature down to T Tei, and then saturates at the value of Ge\ 
in the even valleys. On the contrary, if the gate voltage is tuned to one of the 
intervals (39), the G{T) dependence is non- monotonous. After the initial drop oc- 
curs with lowering the temperature in the interval Tei ^ T ^ Ec, the conductance 
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starts to increase again at Tk, see Fig. 7b. Its T = 0 saturation value depends 
on the ratio of the partial level widths Fi and F 2 of a particular discrete level; 
these widths are random quantities described by Porter-Thomas distribution, see 
Sections 2 and 3. 

The experimental search for a tunable Kondo effect brought positive results 
[19] only recently. In retrospect it is clear, why such experiments were hard to 
perform. Let us express the Kondo temperature (38) in terms of the quantum dot 
parameters. Replacing the partial widths by their average values, we find: 



lh{Gi + G2)SE r ^ 

^ -2tt- 

Ec f 



The negative exponent in the above formula contains a product of two large pa- 
rameters, Ec/SE and e^/h(G\ -h G 2 ), leading to a strong suppression of Tk- For a 
quantum dot device, it is also hard to see the conventional signature of the Kondo 
effect, which is the logarithmic temperature dependence of the conductance in the 
perturbative regime [50], at temperatures significantly exceeding Tk> The proper 
expansion of the function f{T/TK) in Eq. (36) yields the temperature-dependent 
correction 









(41) 



As one can see from Eq. (41), the Kondo correction to the conductance remains 
particularly small compared to G^\ everywhere in the temperature region T ^ Tk • 

To bring Tk within the reach of a modern low- temperature experiment, one 
may try smaller quantum dots in order to decrease Ed^E\ this route obviously 
has technological limitations. Another, complementary option is to increase the 
junction conductances, so that G \^2 come close to 2e^ jh^ which is the maximal 
conductance of a single- mode quantum point contact. Junctions in the experi- 
ment [19] were tuned to G (0.3 — 0.5)e^/7r/i. A clear evidence for the Kondo 
effect was found at the gate voltages away from the very bottom of the odd- number 
valley, where A/"* — M is relatively small. Only in this domain of gate voltages the 
anomalous increase of conductance G{T) with lowering the temperature T was 
clearly observed. (The unitary limit and saturation of G, indicating that T Tk, 
were not reached even there.) At J\f = 2n-h 1, where the Kondo temperature is the 
smallest, the anomalous temperature dependence of the conductance was hardly 
seen. 

To increase the Kondo temperature, it is useful to make the junctions con- 
ductances larger. However, if G \^2 come close to e^firh, the discreteness of the 
number of electrons on the dot is almost completely washed out. This raises the 
question about the nature of the Kondo effect in the absence of charge quanti- 
zation. The way charge quantization is destroyed at G \^2 ^ e^fnh depends on 
the detailed properties of the junctions. For the experimentally relevant case of 
a dot connected to leads by single-mode quantum point contacts, this problem 
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was addressed in Ref. [51]. Spin quantization and Kondo effect for an almost open 
quantum dot were considered in Ref. [5]. Here we will reproduce the result for Tk 
only in the case of a strongly asymmetric setup, assuming one of the junctions is 
in the weak tunneling regime, G\ ^ IttU, and the other one is in the regime of 
weak reflection, G 2 = (e^/7r/i)(l — |r 2 p) with |r 2 p <C 1. Kondo effect develops in 
the “odd” valleys (39) at 



To{Af) = (4e^/7r)£^ck2p cos^ ttAA. 

Here C = 0.5772 is the Euler constant. Note, that in the case of weak reflec- 

tion the exponent in the Kondo temperature is ~ {Ec /SE)\r 21"^ , and the large 
ratio Ec/SE can be partially compensated by the smallness of |rp. So, the spin 
of a quantum dot may remain quantized even if charge quantization is destroyed, 
and the average charge e{N) is not integer. This spin-charge separation is possible 
because charge and spin excitations of the dot are controlled by two very different 
energies: Ec and SE, respectively. Kondo effect is distinguishable on the back- 
ground of the elastic co-tunneling, as long as |r 2 p ^ ÔEj Ec^ and, correspondingly, 
Tk Note, that similarly to the case of weak tunneling, the Kondo tempera- 

ture (42), and the conductance at T <C Tk exhibits strong mesoscopic fluctuations 
which can be described with the help of the Random Matrix Theory. 

6 Conclusion 

We have discussed various regimes of electron transport through a blockaded dot 
weakly coupled to the leads, see Eq. (6). Transport is controlled by the sequential 
tunneling of electrons above temperature Tin, see Eq. (25). In the temperature 
interval T^ < T < Tn the main contribution to the conductance comes from the 
inelastic co-tunneling. Everywhere above the temperature Tei, see Eq. (31), the 
mesoscopic fluctuations of the conductance SG are small compared to the average 
conductance (G). Below that temperature, the electrons passing through the dot 
preserve the coherence; at T < Tei mesoscopic fluctuations are strong, ÔG ~ (G). 
If the dot carries a spin (z.e., if the number of electrons on the dot is odd), then 
Kondo effect should develop at sufficiently low temperatures. This results in a non- 
monotonous temperature dependence of the conductance in the “odd” valleys of 
the Coulomb blockade, see Fig. 7. The evolution of the conductance mechanisms 
with temperature is summarized in Fig. 8. 

In the weak tunneling regime, see Eq. (6), we are able to apply the perturba- 
tion theory in the junction conductances. Such an approach cannot be used in a 
consideration of an interesting and important for experiments regime of strong dot- 
lead coupling {Gi ~ I nh). To treat this regime, one needs to develop methods 
allowing to simultaneously account for the Coulomb interaction (1) and tunneling 
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Figure 8: Evolution of the linear conductance mechanisms with temperature. The 
region where the inelastic co-tunneling dominates the electron transport, is con- 
fined by the temperatures Tin, see Eq. (25), and Tgi, see Eq. (31). At temperatures 
below Tel, mesoscopic conductance fluctuations 5G are of the order of the average 
value of the conductance. The Kondo temperature Tk is given by Eq. (38). 



(20) in a non-perturbative manner. Despite this difficulty, there is quite complete 
understanding by now of the most relevant case of a quantum dot connected to 
the leads by single- channel junctions. The special limit of the partially-open dot 
case, described by the conditions G\ and 1^% — G 2 ^ e^/Tr/i, allows 

for a straightforward generalization of the scheme of Fig. 8. In that limit, no room 
is left for the thermally- activated transport: Tin The inelastic co-tunneling 

mechanism controls the electron transport at temperatures Tei ^ T ^ Ec\ proper 
results for the conductance G{T) can be found in Ref. [52]. At Tk ^ T ^ Tei, 
elastic co-tunneling is the leading mechanism of conduction, 6G ^ (G); the de- 
tailed results for the conductance and its mesoscopic fluctuations can be found in 
Ref. [53]. At lower temperatures, the Kondo effect may develop. The condition for 
the Kondo effect to be distinguishable from the mesoscopic conductance fluctua- 
tions, is Tk ~ àE. The Kondo effect is washed out if the conductance G 2 comes 
too close to /'kTi. 
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Tunneling Between Parallel Quantum Wires 

Bertrand I. Halperin 



Extended Abstract 

Recent developments in fabrication techniques and experimental design have al- 
lowed the observation of tunneling conductance between parallel quantum wires 
of very high quality, in GaAs heterostructures, using the cleaved edge overgrowth 
technique [1]. To a first approximation momentum is conserved during the tunnel- 
ing process, except for a momentum boost, HQ — Bedfc, produced by an applied 
magnetic field B perpendicular to the plane containing the two wires, where d is 
the separation between the wires. The differential tunnel conductance G = dljdV 
is proportional to the spectral density for creating a hole in one wire and and 
electron in the other, with total momentum Q and energy E = eV^ where V is the 
voltage difference between the two wires. 

For two infinite wires, with non-interacting electrons, a gray-scale plot of G 
would show positive and negative ^-function peaks along curves in the V — B 
plane which are directly related to the one-electron dispersion curves in the two 
wires. If electron-electron interactions are included according to the Luttinger- 
Liquid theory for a pair of interacting one-dimensional wires, the J- functions are 
replaced, at small but finite V, by more complicated structures singularities along 
lines of different slopes that reflect the existence of separate velocities for spin 
and charge propagation, instead of the simple Fermi-velocity, in each of the two 
wires. Detailed predictions for the form of G have previously been given, for a 
pair of infinite wires, by Carpentier, Peca and Balents [2], and by Ziilicke and 
Governale [3]. Experimental results in an appropriate regime were found to be 
consistent with the theoretical predictions, giving an indication of the separation 
of spin and charge velocities, but the experimental resolution did not permit an 
accurate measurement of the these velocities. 

In recent work, together with Yaroslav Tserkovnyak, Ophir Auslaender, and 
Amir Yacoby, we analyzed in greater detail the experimental results in the range 
of small voltages and small magnetic fields, which in the sample studied were 
controlled by tunneling between one-dimensional bands in the two wires whose 
Fermi- momenta differed by only a few percent. [4,5] For non-interacting electrons, 
one can then neglect the difference in the Fermi- velocities vp between the two 
wires, and for infinite wires one would then expect to observe the ^-function con- 
tributions to G along two lines, with slopes ±l/vp in the E—Q plane. However, the 
measurements show striking additional structure, which reflects the finite length 
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of the region of tunneling, determined by the length L of the shorter of the two 
wires. In particular, in the gray-scale plot of G there are a series of light and 
dark bands parallel to the lines expected for an infinite sample, refiecting oscil- 
lations in G with an approximate period SB in the magnetic-field direction that 
varies inversely with the length L for different samples. The period corresponds 
roughly to SQ = 27tIL, as might be expected from a diffraction pattern. Unlike a 
conventional diffraction pattern, however, the observed patterns were one- sided, 
appearing only on the positive V side of the infinite-length lines (where positive V 
favors electron-tunneling from the longer of the two wires into the shorter wire). 
We have been able to explain this effect by considering that the confinement at 
the ends of the shorter wire is not sharp, but is gradual, due to the relatively 
large distance of the wire from the confining electrostatic gates. Thus the electron 
wave functions in the shorter wire are not sine waves but are WKB wave functions 
containing about 100 nodes, that are spaced further apart near the ends than in 
the center of the line segment. By making a reasonable assumption for the shape 
of the confining potential, we have been able to get good agreement for both the 
varying spacing between maxima of the oscillatory pattern and for the fall off of 
intensity with distance from the beginning of the pattern. 

Another feature of the observed diffraction pattern is a series of bands par- 
allel to the B-axis, where the modulation of G is greatly reduced. We can explain 
this a Moire pattern from superposition of two sets of parallel lines with differ- 
ent slopes, corresponding to two different velocities, which we identify with the 
spin velocity and the charge velocity of the wires. (Only the antisymmetric charge 
mode, with charge fiuctuations of opposite sign in the two wires should be excited 
in the tunneling experiment). The spacing ÔV between the bands of suppression 
allows one to determine that the charge velocity is larger than the spin velocity by 
a factor of 1.3, which is consistent with other estimates for this system. Detailed 
calculations, in which we have generalized the Luttinger liquid theory to a situ- 
ation of two coupled wires, with a varying density along the length of one wire, 
give good agreement with the observations. We believe that these are the most 
precise measurements to date of separate spin and charge velocities in a tunneling 
mecLSurement of one-dimensional wires. 

In summary, the experiments, under interpretation, give detailed information 
about the confinement at the ends of the wire, as well as a measurement of the 
charge and spin velocities in the coupled system. 

This work has been supported by the National Science Foundation through 
grants DMR-02-33773, and by the US-Israel BSF. 
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Field theories of paramagnetic Mott insulators 



Subir Sachdev 



Abstract. This is a summary of a central argument in recent review articles by the 
author (Physica A 313, 252 (2002), Annals of Physics 303, 226 (2003), and Rev. 
Mod. Phys, July 2003). An effective field theory is derived for the low energy spin 
singlet excitations in a paramagnetic Mott insulator with collinear spin correlations. 



1 Introduction 

In a recent article [1] (intended for an audience of experimentalists), the author has 
reviewed arguments that many aspects of the physics of the cuprate superconduc- 
tors can be understood by using their proximity to paramagnetic Mott insulators. 
Further, a distinction was made between Mott insulators with collinear and non- 
collinear spin correlations, and it was argued that current experimental evidence 
suggests that we need only consider the collinear class. A phenomenological de- 
scription of the ground states and excitations of these classes of Mott insulators 
was provided, along with a discussion of their experimental implications. A more 
technical discussion (intended for theorists) of such insulators, along with a de- 
scription of the effective field theories which describe their low energy properties 
appears in Ref. [2, 3]. Here, we briefly recall the derivation and properties of the 
effective field theory of Mott insulators with collinear spin correlations, which is 
expressed in terms of a compact U(l) gauge field. The non-collinear class leads 
naturally to a Z 2 gauge theory, but we will not consider it here. The reader is 
referred to these previous reviews [1, 2, 3] for complete citations to the literature. 



2 Compact U(l) gauge theory of Mott insulators 

We focus on Mott insulators on a d dimensional bipartite lattice of sites j. The 
spin operator Sj on site j at imaginary time r can be written as 



Sj(t) = rjjSn{rj,r)] (1) 

here rjj = ±1 on the two sublattices, rj is the spatial co-ordinate of site j, n is a 
unit length vector in spin space, and S is the (integer or half-odd-integer) angu- 
lar momentum of each spin. The antiferromagnetic exchange interaction between 
near neighbor spins implies that n(rj,r) will be a slowly varying function of its 
spacetime arguments. A standard analysis of the coherent state path integral of 
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over the Sj spins shows that the low energy quantum fluctuations are described 
by the following partition function 



Z = y*Pn(r, T)J(n^(r, r) — 1) exp —iS E Vj j drAAn{r,.r}) 
- ^ / d'^rdr {(drnf + c^(Vrii)^) 



(2) 



where c is the spin- wave velocity, and p is a coupling constant which controls the 
strength of the quantum fluctuations. Excluding the first Berry phase term, this is 
the action of the 0(3) non-linear sigma model in d-f 1 spacetime dimensions. Here 
we are primarily interested in the consequences of the Berry phases: ^r(n(r))dr is 
defined to be the oriented area of the spherical triangle defined by n(r), n(r + dr), 
and an arbitrary reference point no (which is usually chosen to be the north pole). 

The theory (2) can be considered to be the “minimal model” of antiferro- 
magnets. In dimensions d > 1 it has at least two phases: at small g there is 
the conventional magnetically ordered “Neel” phase with (n) 0, while at large 

g there is a “quantum disordered” paramagnetic phase which preserves spin ro- 
tation invariance with (n) = 0. We are especially interested here in the nature 
of this paramagnetic state. In this section, we will manipulate Z in this large g 
regime, and derive an alternative formulation which allows easier computation of 
the integral over the Berry phases. 

The key to an analysis of the large g regime is a better understanding of 
the nature of At- We will see that At behaves in many respects like the time- 
component of a compact U(l) gauge field, and indeed, this accounts for the sug- 
gestive notation. All physical results should be independent of the choice of the 
reference point no, and it is easy to see by drawing triangles on the surface of a 
sphere that changes in no amount to gauge transformations of At- If we change 
no to nQ, then the resulting A'^ is related to At by 

A', = At- dTct>{r) (3) 

where 0(r) measures the oriented area of the spherical triangle defined by n(r), 
no, and Uq. Furthermore, as we will discuss more completely below, the area of any 
spherical triangle is uncertain modulo 47 t, and this accounts for the ‘compactness’ 
of the U(l) gauge theory. 

We proceed with our analysis of Z. First, we discretize the gradient terms of 
the 0(3) sigma model. We will limit our considerations here to antiferromagnets 
on d dimensional cubic lattices, but similar considerations apply to other bipartite 
lattices. We also discretize the imaginary time direction, and (by a slight abuse of 
notation) use the same index j to refer to the sites of a d + 1 dimensional cubic 
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lattice in spacetime. On such a lattice we can rewrite (2) as 

^ / n - 1) exp X! , (4) 

where the sum over jj, extends over the d + 1 spacetime directions. We have also 
dropped unimportant factors of the lattice spacing and the spin-wave velocity in 

(4). 

As noted above, we are especially interested here in the large g regime where 
there are strong fluctuations of the n^. There are strong cancellations from the 
Berry phases between different spin configurations in this regime, and so the second 
term in Z has to be treated with great care. We will do this by promoting the field 
Aj^ to an independent degree of freedom, while integrating out the n^. Notice 
that we have now introduced all d + 1 components of the compact U(l) gauge 
field with the index //, while only the g = r component appears explicitly in (4). 
The remaining components appear naturally as suitable degrees of freedom when 
we integrate the out. Formally, the integration over the uj can be done by 
introducing new ‘dummy’ variables and rewriting (4) by introducing factors 
of unity on each link; this leads to 




In the first expression, if the integral over the is performed first, we trivially 
return to (4); however, in the second expression we perform the integral over the 
nj variables first, at the cost of introducing an unknown effective action Sa for 
the Ajf^. In principle, evaluation of Sa may be performed order- by-order in a 
“high temperature” expansion in 1/g: we match correlators of the Aj^ flux with 
those of the Ajfi flux evaluated in the integral over the rij with positive weights 
determined only by the l/g term in (4). Rather than undertaking this laborious 
calculation, we can guess essential features of the effective action Sa from some 
general constraints. First, correlations in the decay exponentially rapidly for 
large g (with a correlation length ^ l/ln(^)), and so Sa should be local. Second, 
it should be invariant under the lattice form of the gauge transformation (3) 

~ ( 6 ) 
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associated with the change in the reference point on the unit sphere from no to 
Hq, with (j)j equal to the area of the spherical triangle formed by n^ , no and Uq. 
Finally the area of any triangle on the sphere is uncertain modulo 47 t and so the 
effective action should be invariant under 

Aj^ — > Aj^ + 2tt. (7) 

The simplest local action which is invariant under (6) and (7) is that of compact 
U(l) quantum electrodynamics and so we have 

Z = i W dAjij, exp I ^ ^ cos - i2S ^ ) , (8) 

m ° j / 

for large p; comparison with the large g expansion shows that the coupling ^ g‘^ . 
In (8), is the discrete lattice derivative along the p direction, and the sum over 
□ extends over all plaquettes of the d+1 dimensional cubic lattice - both notations 
are standard in the lattice gauge theory literature. 

The first term in the action (8) is, of course, the standard ‘Maxwell’ term of a 
compact U(l) gauge field. In this language, the Berry phase has the interpretation 
of a / J^Ajj^ coupling to a fixed matter field with ‘current’ = 2S6^r^ This 
corresponds to static matter with charges ±2S on the two sublattices. It is this 
matter field which will crucially control the nature of the ground state. 

The remaining analysis of Z depends upon the spatial dimensionality d. In 
d = 1, a dual model of (8) is solvable, and the results are in complete accord with 
those obtained earlier by Bethe ansatz and bosonization analyses of spin chains. 
We will consider the d = 2 case in the section below. There has been relatively little 
discussion of the d = 3 case (which exhibits both confining and deconfining phases 
of the gauge theory), and this remains an important avenue for future research. 

3 Duality mapping in d = 2 

As is standard in duality mappings, we first rewrite the partition function in 2 + 1 
spacetime dimensions by replacing the cosine interaction in (8) by a Villain sum 
over periodic Gaussians: 

^ = XI / n ^ X - i2S ^ r]jAjr 

(9) 

where e^y\ is the total antisymmetric tensor in three dimensions, and the are 
integers on the links of the dual cubic lattice, which pierce the plaquettes of the 
direct lattice. Throughout this subsection we will use the index j to refer to sites 
of this dual lattice, while j refers to the direct lattice on sites on which the spins 
are located. 
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• • • • • 



Figure 1: Specification of the non-zero values of the fixed field The circles are 
the sites of the direct lattice, while the crosses are the sites of the dual lattice, J; 
the latter are also offset by half a lattice spacing in the direction out of the paper 
(the fi = T direction). The are all zero for while the only non-zero 

values of a^y are shown above. Notice that the flux obeys (11). 



We will now perform a series of exact manipulations on (9) which will lead to 
a dual interface model [4, 5]. This dual model has only positive weights - this fact, 
of course, makes it much more amenable to a standard statistical analysis. This 
first step in the duality transformation is to rewrite (9) by the Poisson summation 
formula: 

V □ 

= ^ exp (-^ ) , (10) 

V j □ / 

where (like qj^) is an integer- valued vector field on the links of the dual lattice 
(here, and below, we drop overall normalization factors in front of the partition 
function). Next, we write the Berry phase in a form more amenable to duality 
transformations. Choose a ‘background’ flux which satisfies 

(H) 

where j is the direct lattice site in the center of the plaquette defined by the curl 
on the left-hand-side. Any integer- valued solution of (11) is an acceptable choice 
for and a convenient choice is shown in Fig 1. Using (11) to rewrite the Berry 
phase in (9), applying (10), and shifting by the integer 2Sa^^, we obtain a new 
exact representation of Z in (9): 

Z = f J^dAj^exp ~ ~ ^ j • (12) 
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Figure 2: Specification of the non-zero values of the fixed fields (a) PCj and (b) 
introduced in (14). The notational conventions are as in Fig 1. Only the fi = r 
components of are non-zero, and these are shown in (b). 



The integral over the Aj^ can be performed independently on each link, and its 
only consequence is the imposition of the constraint = 0. We solve this 

constraint by writing as the gradient of a integer- valued ‘height’ hj on the sites 
of the dual lattice, and so obtain 



z = y] exp 





(13) 



This is the promised 2-f-l dimensional interface, or height, model in almost its final 
form. 



The physical properties of (13) become clearer by converting the “frustration” 
in (13) into offsets for the allowed height values. This is done by decomposing 
into curl and divergence free parts and writing it in terms of new fixed fields, 
Xn and W/z as follows: 



— ^ji^j T ^nijxAi^yjX’ 



(14) 



The values of these new fields are shown in Fig 2. Inserting (14) into (13), we can 
now write the height model in its simplest form [4] 



Zft = y exp [ -y . (15) 

{Hj} V j ) 

where 

Hj = h,-2SXj (16) 

is the new height variable we shall work with. Notice that the 3^^^ have dropped 
out, while the Xj act only as fractional offsets (for S not an even integer) to the 
integer heights. From (16) we see that for half-odd-integer S the height is restricted 
to be an integer on one of the four sublattices, an integer plus 1/4 on the second, an 
integer plus 1/2 on the third, and an integer plus 3/4 on the fourth; the fractional 
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parts of these heights are as shown in Fig 2a; the steps between neighboring heights 
are always an integer plus 1/4, or an integer plus 3/4. For S an odd integer, the 
heights are integers on one square sublattice, and half-odd-integers on the second 
sublattice. Finally for even integer S the offset has no effect and the height is an 
integer on all sites. We discuss these classes of S values in turn in the following 
subsections. 

3.1 S even integer 

In this case the offsets 2S^j are all integers, and (15) is just an ordinary three di- 
mensional height model which has been much studied in the literature. Unlike the 
two-dimensional case, three-dimensional height models generically have no rough- 
ening transition, and the interface is always smooth. With all heights integers, 
the smooth phase breaks no lattice symmetries. So square lattice antiferromagnets 
with S even integer can have a paramagnetic ground state with a spin gap and 
no broken symmetries. This is in accord with the exact ground state for a 5 = 2 
antiferromagnet on the square lattice found by Affleck et a/., the AKLT state [6]. 

3.2 S half-odd-integer 

Now the heights of the interface model can take four possible values, which are 
integers plus the offsets on the four square sublattices shown in Fig 2a. As in 
Section 3.1, the interface is always smooth i.e. any state of (15) has a fixed average 
interface height and any well-defined value for this average height breaks 

the uniform shift symmetry of the height model under which Hj Hj ± 1. After 
accounting for the height offsets, we will see below that any smooth interface must 
also break a lattice symmetry with the development of bond order, this allows a 
number of distinct spin gap ground states of the lattice antiferromagnet. 

It is useful, first, to obtain a simple physical interpretation of the interface 
model in the language of the S = 1/2 antiferromagnet [7]. From Fig 2a it is clear 
that nearest neighbor heights can differ either by 1/4 or 3/4 (modulo integers). 
To minimize the action in (15), we should choose the interface with the largest 
possible number of steps of ±1/4. However, the interface is frustrated, and it is 
not possible to make all steps ±1/4 and at least a quarter of the steps must 
be ±3/4. Indeed, there is a precise one-to-one mapping between interfaces with 
the minimal number of ±3/4 steps (we regard interfaces differing by a uniform 
integer shift in all heights as equivalent) and the dimer coverings of the square 
lattice: the proof of this claim is illustrated in Fig 3. We identify each dimer 
with a singlet valence bond between the spins (the ellipses in Fig 2), and so each 
interface corresponds to a quantum state with each spin locked in the a singlet 
valence bond with a particular nearest neighbor. Fluctuations of the interface 
in imaginary time between such configurations correspond to quantum tunneling 
events between such dimer states, and an effective Hamiltonian for this is provided 
by the quantum dimer model [8]. 
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Figure 3: Mapping between the quantum dimer model and the interface model Z 
in (15). Each dimer on the direct lattice is associated with a step in height of ±3/4 
on the link of the dual lattice that crosses it. All other height steps are ±1/4. Each 
dimer represents a singlet valence between the sites, as in Fig 2. 




□:n 

I I I I 
I I I I 
I I I I 

□:n 



(b) 



Figure 4: Sketch of the two simplest possible states with bond order for S' = 1/2 on 
the square lattice: (a) the columnar spin-Peierls states, and (b) plaquette state. The 
different values of the {S^ • S^ ) on the links are encoded by the different line styles. 
Both states are 4- fold degenerate; an 8-fold degenerate state, with superposition 
of the above orders, also appears as a possible ground state of the generalized 
interface model. 



The nature of the possible smooth phases of the interface model are easy 
to determine from the above picture and by standard techniques from statistical 
theory [4, 7]. Interfaces with average height = 1/8, 3/8, 5/8, 7/8 (modulo 
integers) correspond to the four-fold degenerate bond-ordered states in Fig 4a, 
while those with (7/j) = 0, 1/4, 1/2, 3/4 (modulo integers) correspond to the four- 
fold degenerate plaquette bond-ordered states in Fig 4b. All other values of (iTj) 
are associated with eight-fold degenerate bond-ordered states with a superposition 
of the orders in Fig 4a and b. 

Support for the class of bond-ordered states described above has appeared in 
a number of numerical studies of S — 1/2 antiferromagnets in d = 2 which have 
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succeeded in moving from the small g Néel phase to the large g paramagnet. These 
include studies on the honeycomb lattice [9] (duality mapping in Ref. [4]), on the 
planar pyrochlore lattice [10] (duality mapping for a lattice with this symmetry 
is in Ref. [11], with a prediction for the bond order observed), on square lattice 
models with ring-exchange and easy-plane spin symmetry [12], and square lattice 
models with SU(A^) symmetry [13]. 

3.3 S odd integer 

This case is similar to that S half-odd-integer, and we will not consider it in detail. 
The Berry phases again induce bond order in the spin gap state, but this order 
need only lead to a two- fold degeneracy. 

4 Conclusions 

The primary topic discussed in this paper has been the effective field theory of 
paramagnetic Mott insulators with collinear spin correlations. This field theory is 
the compact U(l) gauge theory in (8), and applies in all spatial dimensions. We 
also reviewed duality mappings of (8) which are special to d = 2 spatial dimensions, 
and mapped the theory onto the interface model (15). Finally, we reiterate that 
paramagnetic Mott insulators with non-collinear spin correlations are described 
by a Z 2 gauge theory which has not been presented here. 
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Arrest and Flow of Colloidal Glasses 

M.E. Cates 



Abstract. I review recent progress in understanding the arrest and flow behaviour 
of colloidal glasses, based on mode coupling theory (MCT) and related approaches. 
MCT has had notable recent successes in predicting the re-entrant arrest behaviour 
of colloids with short range attractions. Developments based upon it offer impor- 
tant steps towards calculating, from rational foundations in statistical mechanics, 
nonlinear flow parameters such as the yield stress of a colloidal glass. An important 
open question is why MCT works so well. 



1 Introduction 
1.1 Soft Matter 

This paper addresses issues of arrest and flow in soft condensed matter; see [1] 
for a useful ensemble of background reading and [2] for a good, experimentally 
motivated overview. We consider a system of N particles in the size range of 
nanometres (e.g. globular proteins) to microns (traditional colloids), suspended 
in a solvent, with total volume V. To a good enough approximation, at least for 
equilibrium properties, the solvent degrees of freedom can be integrated out to 
give an effective pairwise Hamiltonian H = obvious notation). 

Equilibrium statistical mechanics, when applicable, is governed by the partition 
function Z = J exp{—PH)V[ri]; quantum effects play no role [3]. 

In many colloidal materials the effective interaction ti(r) comprises a hard 
core repulsion, operative at separation 2a with a the particle radius, combined 
with an attraction at larger distance. For simplicity one can imagine a square 
well potential of depth e and range <fa, with .^ < 1. Unlike atomic systems, to 
which colloidal ones are otherwise quite analogous, these parameters can be varied 
easily in experiment, essentially by varying the solvent conditions [3]. For example, 
adding polymers to a colloidal system will mediate an entropie attraction between 
spheres whose range is comparable to the size of the polymer coils and whose depth 
is controlled by their concentration. For globular proteins the same tricks can be 
played with salt concentration and pH. 

The resulting equilibrium pheuse diagrams are well known, and depend on the 
range parameter ^ and the attraction energy or temperature through the parameter 
/3e. For ^^0.2, at small f3e there is a phase separation from a colloidal fluid to a 
colloidal crystal. At higher f3e, a liquid phase intervenes; the fluid undergoes a gas- 
liquid separation at intermediate densities although the crystal is stable at higher 
ones. So far, this is just like the phase diagram of argon or a similarly classical 
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atomic substance. (But of course solvent fills the space between the colloids, so 
the gas is not a real gas.) However, for smaller ^ the liquid phase is missing; one 
has only one transition, from fluid to crystal. In fact, though, the liquid phase is 
still lurking beneath: it is metastable. 

1.2 Arrest in Colloidal Fluids 

Colloidal fluids can be studied relatively easily by light scattering [4] . This allows 
one to measure the dynamic structure factor S{q, t) = (p(q, t')p(— q, t' -\-t))/N and 
also the static one, S{q) = S{q,0). Here p(r, t)) = ~ ~ this is 

the real space particle density (with the mean value subtracted), and p(q, t) is its 
Fourier transform. For particles with hard-core repulsions, S{q) exhibits a peak 
at a value q* with q*a = 0(1). The dynamic structure factor S(q,t), at any q, 
decays monotonically from S(q) as ^ increases. In an ergodic colloidal fluid, S(q,t) 
decays to zero eventually; all particles can move, and the density fluctuations have 
a finite correlation time. In an arrested state, which is nonergodic, this is not true. 
Instead the limit S(q^oo)/S(q) = f(q) defines the nonergodicity parameter. (Note 
that this corresponds to the Edwards- Anderson order parameter in spin glasses.) 
The presence of nonzero f(q) signifies frozen- in density fluctuations. Although f(q) 
is strongly wavevector dependent, it is common to quote only f(q*) [5]. 

Colloidal fluids are found to undergo nonergodicity transitions into two dif- 
ferent broad classes of arrested nonequilibrium state. One is the colloidal glass, in 
which arrest is caused by the imprisonment of each particle in a cage of neigh- 
bours. This occurs even for e = 0 (i.e. hard spheres) at volume fractions above 
about (j) = 4:7ra^N/3V ^ 0.58. Such a system would, in equilibrium, be a crystal; 
but equilibrium can be delayed indefinitely once a glass forms (particularly if there 
is a slight spread in particle size a, which helps suppress nucléation). The noner- 
godicity parameter for the colloidal glass obeys f(q*) ~ 0.7. The second arrested 
state is called the colloidal gel. Unlike the repulsive glass, the arrest here is driven 
by attractive interactions, resulting in a bonded network structure. Such gels can 
be unambiguously found, for short range attractions, whenever l3e^5 — 10. Hence 
it is not necessary that the local bonds are individually irreversible (this happens, 
effectively, at [3e ^ 15—20); and when they are not, the arrest is collective, not local. 
It is found experimentally that for colloidal gels, f(q*) ^0.9, which is distinctly 
different from the colloidal glass. The arrest line for gel formation slices across 
the equilibrium phase diagram (e.g., plotted on the (<p,Pe) plane), and, depending 
on parts of it lie within two phase regions. This, alongside any metastable gas- 
liquid phase boundary that is present, can lead to a lot of interesting kinetics [6, 7], 
in which various combinations of phase separation and gelation lead to complex 
microstructures and time evolutions. 
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1.3 A Brief Primer on Mode Coupling Theory (MCT) 

This is not the place to explain MCT in detail. The most powerful form of the 
theory [8], which is favoured by most of the true experts, remains somewhat ob- 
scure to many others. However, in a stripped down version (see e.g. [9, 10]) the 
theory can be viewed as a fairly standard one-loop selfconsistent approach to an 
appropriate dynamical field theory. 

We take j3 = \ and start from the Langevin equations -f for 

independent particles of unit diffusivity {Dq = 1) subjected to external forces F^. 
The noise force then obeys {fzfj) = By standard manipulations one proceeds 
to a Smoluchowski equation for the A'-particle distribution function 

with evolution operator is Q = Vi.(Vi — F^). Now take the forces Fj to derive 

(via Fi = —S/iH) from an interaction Hamiltonian 

H = -^ldhd^r'p{r)p{r')c{\r-r'\) (1) 

where Nc{q) = V[l — S{q)~^]. This is a harmonic expansion in density fiuctuations; 
c{q) is called the direct correlation function, and its form is fixed by requiring that 
S{q) be recovered in equilibrium. Neglected are solvent mediated dynamic forces 
(hydrodynamic couplings); these mean that in principle the Langevin equations 
for the particles should have correlated noise. Also neglected are anharmonic terms 
in H; to regain the correct higher order density correlators (beyond the two point 
correlator S{q)) in equilibrium, these terms would have to be put back. 

These assumptions give a Langevin equation for the density p(r): 

p = VV + V{pV6H/0p) + V.h (2) 

where h is a suitable noise (actually with a nontrivial density dependence [11]). 
This equation is nonlinear, even with the harmonic choice of H. However, from it 
one can derive a hierarchy of equations of motion for correlators such as S{q^t)^ 
more conveniently expressed via ^(g,t) = S{q,t)/S{q). Factoring arbitrarily the 
four-point correlators that arise in this hierarchy into products of two $’s, one 
obtains a closed equation of motion for the two point correlator as 

^{q,t)+T{q) <^{q,t)+ [ m{q,t - t')^(q,t) =0 (3) 

Jo 

where T(g) = q^ / S{q) is an initial decay rate, and the memory function obeys 

m(q, t)=Y, K,,k^(k, t)#(k - q, t) (4) 

k 

with the vertex 
Vq,k = 



^^^5(g)5(fc)5(|k - q|)[q.kc(fc) + q.(k - q)c(|k - ql)]^ (5) 
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1.4 Dynamic Bifurcation 

The MCT equations exhibit a bifurcation that corresponds to a sudden arrest 
transition, upon smooth variation of either the density 0 or other parameters 
controlling the kernel in the harmonic hamiltonian H. This is best seen in the 
nonergodicity parameters f{q), which suddenly jump (for all q at once) from zero 
to nonzero values. Near this (on the ergodic side, which is always the direction 
MCT approaches from), 4>(g, t) develops interesting behaviour. Viewed as a func- 
tion of time, it decays onto a plateau of height f{q), stays there for a long time, and 
then finally decays again at very late times. The two decays are called jS and a re- 
spectively. Upon crossing the bifurcation, the a relaxation time diverges smoothly 
with the parameters; upon crossing the locus of this divergence, f[q) = S'(^,oo) 
therefore jumps discontinuously from zero to a value that is finite for all q. 

2 Critiques and Defence of MCT 

This mathematical structure means that S{q) fixes all of the dynamics (up to a 
scale factor which was the bare diffusion constant of one particle, here set to unity). 
As a result, the theory has significant predictive power: MCT gives, in terms of 
S'(g), the critical parameter values where arrest occurs; the power law exponents 
governing the a and j3 decays; the divergence of the a relaxation time; and f{q). 
This makes the theory falsifiable. Indeed, it has successfully been falsified. For 
example, there is no doubt that MCT gives the wrong density (j)g for the glass 
transition in hard spheres. On the other hand, the predictions (including that for 
(/)g) do all agree with experiments at about the 10 percent level, and often better. 

Attitudes among theorists to MCT for colloids therefore vary considerably. 
Some argue that the approximations made are uncontrolled (true) and that, even 
though there are no explicit adjustable parameters in the theory, its approxima- 
tions have been implicitly tuned to suit the problem at hand (normally taken to 
be the glass transition in hard spheres). These might be fair accusations in part, 
but if so they can also be leveled at much of equilibrium liquid state theory, where, 
for decades, ad-hoc closures (Percus-Yevick, hypernetted chain, etc.) have com- 
peted for survival by an essentially Darwinian process. Other theorists point to 
the already-falsified status of MCT, drawing attention to the misplaced (j)g or, 
more interestingly, to other physical situations where exactly the same kind of 
approximation leads to totally wrong predictions (e.g. spurious arrest in systems 
that are known to evolve smoothly towards a unique equilibrium state). 

Against all this must be weighed the continuous stream of experimental re- 
sults, many of them subtle but others not, that confirm the MCT predictions in 
surprising detail especially if one makes one or two ad-hoc ‘corrections’ to adjust 
parameters like (j)g [12]. (There are also many simulation results which increasingly 
confirm the same picture [5].) 

A striking recent success of MCT concerns systems with attractive interac- 
tions as well as hard-core repulsions, in a region of parameter space where not 
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one but two arrest transitions are nearby (these are the transitions to a colloidal 
glass and to a colloidal gel). First, MCT unambiguously predicts [13] that adding 
a weak, short range attraction to the hard sphere system should melt the glass. 
The mechanism appears to be a weakening of the cage of particles in which a given 
particle is trapped. (This is caused by members of that cage moving closer together 
under the attractive forces; gaps in the cage, allowing motion, are then more likely 
to appear.) Second, MCT predicts that adding more of the same attraction should 
mediate a second arrest, this time into a gel (f{q*) ^ 0.95). Third, MCT predicts 
that as parameters are varied, a higher order bifurcation point should be seen, 
when the re-entrant arrest line (implied by the above picture) crosses from being a 
smooth curve to being a cuspy one [13]. Although not every detail of this scenario 
is yet confirmed, there is clear experimental evidence of the predicted re-entrant 
behaviour connecting the glass and the gel arrest lines [14] (see Figure 1) and 
clear evidence of the proximity of the higher order bifurcation, which shows up as 
a characteristic logarithmic decay for 4>(ç, t) [15]. The latter is also seen clearly in 
recent simulations [16]; see Figure 2. 

It is worth emphasizing that these results were predicted by MCT before 
the experiments (or simulations) were actually begun. The success of MCT at 
unifying the glass and gel arrest transitions in attractive colloids does much to 
dispel fears of implicit parameter tuning during the earlier evolution of the theory. 
It reconfirms the generic success of the MCT approach, at around the 10 percent 
level, in describing interacting colloids. 

So, perhaps the time has come to stop criticizing MCT for colloids on the 
grounds that it ‘cannot be right’. Indeed it cannot: the problem is really to under- 
stand how it can do so well. To paraphrase Churchill [17]: Mode coupling theory 
is the worst theory of colloidal glasses - apart from all the others that have been 
tried from time to time. 



3 Shear Thinning 

In Refs. [18], M. Fuchs and the author developed a theory, along MCT lines, of col- 
loidal suspensions under flow. The work was intended mainly to address the case 
of repulsion-driven glasses, and to study the effect of imposed shear flow either on 
a glass, or on a fluid phase very near the glass transition. In either case, simpli- 
fications might be expected because the bare diffusion time tq = a^/Z^o is small 
compared to the ‘renormalized’ one r = u^/D, which in fact diverges (essentially 
as the a relaxation time) as the glass transition is approached. If the imposed 
shear rate (which we assume steady) is 7 , then for yro 1 < 7 t, one can hope 
that the details of the local dynamics are inessential and that universal features 
related to glass formation, should dominate. Note, however, that by continuing 
to use a quadratic H (Eq.l), we will assume that, even under shear, the system 
remains ‘close to equilibrium’ in the sense that the density fluctuations that build 
up remain small enough for a harmonic approximation to be useful. 
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Figure 1: Experimental test of MCT for colloids with short range attractions. The 
dashed line separates ergodic samples (open symbols) from nonergodic samples, 
identified as gels (squares), glasses (circles) and intermediate (+). The control 
parameter on the vertical axis is the concentration of added polymer; horizontal 
is the volume fraction. The solid line is the MCT prediction for the nonergodicity 
transition (shifted to give the correct (pg in the absence of polymer). Courtesy K. 
Pham; see Poon et. al. [14] for individual discussions of the samples marked A-G. 



The basic route followed in Ref. [18] is quite similar to that already laid out 
above for standard MCT. However, we assume that an imposed shear fiow is 
present; obviously this changes the equations of motion. A key simplification is to 
neglect velocity fiuctuations so that the imposed shear flow is locally identical to 
the macroscopic one; this cannot be completely correct, but allows progress to be 
made. For related earlier work see Refs. [19, 20]. 

We again take j3 = Dq = 1, and start from the Langevin equations = 
u + Fi +fi for independent particles of unit diffusivity subjected to external forces 
Fi and, now, an imposed flow velocity u(ri). We take this to be a simple shear 
flow with u(r) = ÿyx. The Smoluchowski equation 4^ = is unchanged but the 
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Figure 2: Simulation test of MCT for Brownian spheres with short range attrac- 
tions. The correlator t) is shown for various q, in a system thought to lie close 
to the higher order singularity at the glass-gel corner. Dotted lines are fits to the 
logarithmic time dependence predicted by MCT in this region. (From [16].) 



evolution operator is now O = — u(r^)). We again take the forces 

Fi to derive from Eq.l, and with the same assumptions as before gain a Langevin 
equation for the density p{r): 

p+u.Vp = V^p + V{pVSH/6p) + V.h (6) 

So far, the adaptation to the equations to deal with imposed shearing is 
fairly trivial. The next stages are not. We assume an initial equilibrium state with 
= 0) oc Z, and switch on shearing at t = O-h. We define an advected correlator 

= {p{ci,0)p{-ci{t),t)) / S(q)N (7) 

where q(t) = q + q.Kt with K the velocity gradient tensor, Kij — 'jôixSjy. This 
definition of the correlator subtracts out the trivial part of the advection, which is 
merely to transport density fluctuations from place to place. The nontrivial part 
comes from the effect of this transport on their time evolution; the main effect 
(see e.g. [20]) is to kill off fluctuations by moving their wavenumbers away from 
q* where restoring forces are weakest (hence the peak there in S{q)). Hence the 
fluctuations feel a stronger restoring force coming from H, and decay away more 
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strongly. This feeds back, through the nonlinear term, onto the other fluctuations, 
including ones transverse to the flow and its gradient (i.e., with q along z) for 
which the trivial advection is absent. 

There follow a series of MCT-like manipulations which differ from those of 
the standard approach because they explicitly deal with the switchon of the flow 
at t = 0+ and integrate through the transient response to obtain the steady state 
correlators, under shear, as t oo. There is no integration through transients 
in standard MCT; on works directly with steady-state quantities. (In practice 
also, the following results were obtained in Refs. [18] using a projection operator 
formalism which differs in detail from the version of MCT outlined above.) Despite 
all this, the structure of the resulting equations is remarkably similar to Eqs. 3,4: 



4>(q,i) +r(q,i) 



Jo 



= 0 



(8) 



with r(g) replaced by a time dependent, anisotropic quantity: 

r(q, t)S(q) =q^ + qxqyjt + (qxqyjt + ql'y^f)S{q) - qxqyS{q)/q (9) 



The memory kernel is no longer a function of the time interval t — t' but depends 
on both arguments separately 



m(q, t,t') = '^V{q,'k,t,t')^(k, t)$(k - q, t) (10) 

k 



through a time-dependent vertex V, too long to write down here [18]. 

Using a nonequilibrium Kubo-type relationship, one can also obtain an ex- 
pression for the steady state viscosity rj — cr{'y)/'y where is the shear stress 
as a function of shear rate. The viscosity is expressed as an integral of the form 



rj = 




k 



( 11 ) 



where the function F may be found in Ref. [18]. 



3.1 Results 

The above calculations give several interesting results. First, any nonzero shear 
rate, however small, restores ergodicity for all wavevectors (including ones which 
are transverse to the flow and do not undergo direct advection). This is important, 
since it is the absence of ergodicity that normally prevents MCT-like theories being 
used inside the glass phase, at T < Tg or (j) > (f)g. Here we may use the theory in 
that region, so long as the shear rate is finite. 

In the liquid phase {(j) < (f)g) the resulting flow curve <7(7) shows shear thin- 
ning at yr ^ 1, that is, when the shearing becomes significant on the timescale of 
the slow relaxations. This is basically as expected. Less obviously, throughout the 
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glass, one finds that the limit ^ 7(7 0+) = ay is nonzero. This quantity is called 

the yield stress and represents the minimum stress that needs to be applied before 
the system will respond with a steady-state fiow. (For lower stresses, various forms 
of creep are possible, but the fiow rate vanishes in steady state.) 

The prediction of a yield stress in colloidal glasses is significant, because 
glasses, operationally speaking, are normally defined by the divergence of the vis- 
cosity. However, it is quite possible for the viscosity to diverge without there being 
a yield stress, for example in ‘power law fluids’ where < 7 ( 7 ) ^ 7 ^ with 0 < p < 1 . 
Indeed, a recent model of ‘soft glassy materials’ (designed for foams, emulsions, 
etc., and based on the trap model of Bouchaud [21]) gives a well-developed power 
law fluid region above Tg in which the viscosity is infinite but the static shear 
modulus zero [22]. This does not happen in the present calculation, where the 
yield stress jumps discontinuously from zero to a nonzero value, ay, at (pg. The 
existence of a yield stress seems to be in line with most experimental data on the 
flow of colloidal glasses, although one must warn that operational definitions of 
the yield stress do vary across the literature [23]. Ours is defined as the limiting 
stress achieved in a sequence of experiments at ever decreasing 7, ensuring that a 
steady state is reached for each shear rate before moving onto the next one. The 
latter requirement may not be practically achievable since the equilibration time 
could diverge (certainly one would expect to have to wait at least for times t such 
that ÿt^l). But unless the flow curve has unexpected structure at small shear 
rates, the required extrapolation can presumably be made. 

The existence of a yield stress everywhere within the glass phase follows from 
the structure of the MCT-inspired calculations outlined above and detailed in Ref. 
[18]. However, to calculate an actual value for ay requires further approximations; 
these avenues are pursued in Ref. [18]. Quantitative results for one such approx- 
imation, called the ‘isotropically sheared hard sphere model’ (ISHSM), are given 
in Figure 3. Such approximations can also give values for the flow curve exponent 
after the onset of yield (a — cry ~ 7 ^ with p ~ 0.15), and for the growth of the 
yield stress beyond the glass transition (cry — cry ~ {(p — cpgY^'^). 



3.2 Schematic MCT models 



It has long been known that the key mathematical structure behind (conventional, 
unsheared) MCT can be captured by low-dimensional schematic models in which 
the full q dependence is suppressed. In other words, one chooses a single mode, 
with a representative wavevector around the peak of the static structure factor, and 
writes mode coupling equations for this mode in isolation. At a phenomenological 
level, one can capture the physics similarly even with shearing present (despite the 
more complicated vectorial structure that in reality this implies). Specifically one 
can define the F ^2 model - the sheared extension of a well known static model. 



Fi 2 - via 



Mt) + r 




m{t - t')é{t')dt' 



= 0 



( 12 ) 
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Figure 3: Flow curves for the ISHSM. The shear stress is given in units of kT /S' 
with d the particle diameter. The parameter Peo is /Dq with Do the bare 
diffusion constant. The curves marked with circles are within the fluid phase (with 
dashed asymptotes showing the Newtonian limit); the curves marked with squares 
are in the glass. The diamonds denote the critical case. Each curved is marked by 
its distance from the glass transition, (j) — (j)g. The yield stress at the glass transition 
(jy (here called ) is indicated on the left. 

with memory function (schematically incorporating shear) 

m{t) = [vi^{t) + V 2 ^^(t)]/ (1 + (13) 

The vertex parameters are smooth functions of the volume fraction (f) (and 
any interactions). To calculate flow curves, etc., one also needs a schematic form 
of Eq.ll; here we take the first moment of the correlator to fix the time scale for 
stress relaxation (which is, in suitable units, simply the viscosity): 

rOO 

rj= ^{t)dt (14) 

Jo 

(Note that a different choice, e.g. with 4>(t)^ in this equation to closer resemble 
Eq.ll, would yield quite similar results.) This schematic model gives very similar 
results to the ISHSM, with a — ay ^ ay — cry ^ {(j) — ^^)^^^ [18]. The 

qualitative reproducibility of these results within different types of approximation 
scheme is reassuring. 
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4 Shear Thickening said Januning 

The calculations described above predict, generically, shear thinning behaviour: 
advection kills fluctuations, reducing the a relaxation time, which causes the sys- 
tem to flow more easily at higher stresses. However, in some colloidal systems, the 
reverse occurs. This is shear thickening, and gives a flow curve cr('j) with upward 
curvature. In extreme cases, an essentially vertical portion of the curve is reported 
[24]. One interpretation of the latter scenario (called ‘discontinuous shear thick- 
ening’) is that the underlying flow curve is actually S-shaped. Since any part of 
the curve with negative slope is mechanically unstable (a small increase in the 
local shear rate would cause an acceleration with positive feedback), this allows a 
hysteresis cycle in which, at least according to the simplest models, discontinuous 
vertical jumps on the curve bypass the unstable section (see Figure 4). 




Figure 4: Three possible flow curves for a shear thickening material. The monotonie 
curve corresponds to continuous shear thickening. The remaining two curves are 
S-shaped; one expects, on increasing the shear rate, the stress to jump from the 
lower to upper branch at (or before) the vertical dashed line shown in each case. 
One curve shows the full jamming scenario: the existence of an interval of stress, 
here between 0.45 and 0.63, within which the flow rate is zero, even in a system 
ergodic at rest. (Stress and strain rate units are arbitrary.) 

If this viewpoint is adopted, there seems to be nothing to prevent the upper, 
re-entrant part of the curve to extend right back to the vertical axis (see Figure 
4) in which case there is zero steady-state flow within a certain interval of stress. 
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The system has both an upper and a lower yield stress delimiting this region. (If 
it is nonergodic at rest, it could also have a regular yield stress on the lower part 
of the curve near the origin - we ignore this here.) This case has been called Tull 
jamming’ [25]. Although mostly a theoretical speculation, one or two experimental 
reports of this kind of behaviour have appeared in the literature recently [26]. 

The above discussion suggests that shear thickening and full jamming might 
be viewed as a stress-induced glass transition of some sort. If so, it is natural to 
ask whether this idea can be accommodated within an MCT-like approach. Since 
the analysis of Ref. [18] gives only shear thinning, this is far from obvious. In 
particular, a stress-induced glass transition would require the vertex V to ‘see’ the 
stress; this might require one to go beyond harmonic order in the density, that is, 
it might require improvement to Eq.l. Indeed, since it is thought that jamming 
arises by the growth of chainlike arrangements of strong local compressive contacts 
[27], it is reasonable that correlators beyond second order in density should enter. 

In very recent work, an ad-hoc schematic model along the lines of Eqs. 12-14 
has been developed to address shear thickening. The vertex is ascribed explicit 
dependence not only on 7 (as suggested by the shear thinning calculations of Ref. 
[18]) but also on the shear stress a. It is found that a vertex which is monotoni- 
cally decreasing with 7 but monotonically increasing with cr can indeed result in 
shear thickening, and, under some conditions, in full jamming [28]. This work is 
preliminary, but interesting in that it suggests how new physics (beyond two-point 
correlations) may need to be added to MCT before the full range of observed col- 
loidal flow behaviour is properly described. Of course, even for systems at rest, it 
is known that some important physics is missing from MCT, in particular various 
kinds of ‘activated dynamics’ in which the system can move exponentially slowly 
despite being in a region of phase space where, according to MCT, it cannot move 
at all (see e.g. [10]). Jamming seems very different from this: so perhaps there are 
more things missing from MCT than just activated processes. 

5 Conclusion 

Mode Coupling Theory (MCT) has had important recent successes, such as pre- 
dicting, in advance of experiment, the re-entrant glass/gel nonergodicity curves 
that arise in colloidal systems with short range attractions [13, 14]. 

Theoretical developments directly inspired by MCT now offer a promising 
framework for calculating the nonlinear flow behaviour of colloidal glasses and 
glassy liquids [18]. In fact, this is the only quantitative framework currently in 
prospect for the rational prediction of yield behaviour and nonlinear rheology in 
this or any other class of nonergodic soft materials. (Other work on the rheology 
of glasses [22, 29] does not, as yet, offer quantitative prediction of experimental 
quantities.) While promising, many things are missing from our approach: velocity 
fluctuations, hydrodynamic forces, anharmonicity in H (possibly implicated in 
shear-thickening) etc., are all ignored. Further hard theory work is needed here. 
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The eventual goal lies beyond the steady state flow curve (shear stress as 
a function of shear rate, < 7 ( 7 )) discussed in this article. Ideally we would like a 
full constitutive equation that relates the stress tensor at a given time to the 
preceding deformation history (or vice versa). To do this, starting from statistical 
mechanics, is difficult for any class of material; its achievement for the case of 
entangled polymers [30], which are ergodic, was a highlight of theoretical physics 
in the late 20th century. To obtain a well-founded constitutive equation for a 
significant class of nonergodic soft materials is a worthy goal for the early 21 st. 

As a framework for theoretical prediction, MCT is easy to criticize, but much 
harder to improve. The key question that should exercise the minds of theorists 
is no longer ‘is it correct’ (it clearly is not, in a technical sense) but ‘why does it 
work so weir? Only by engaging with this question are we are likely to improve 
our understanding of nonergodic colloidal materials and, in the longer run, come 
up with something better. 

Acknowledgments. I am deeply indebted to Matthias Fuchs, much of whose work 
is reviewed above, for introducing me to MCT. 
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Decoding the Genome 



Eric D. Siggia 



Biology is the only natural science built on an accident. All life forms share so 
many biochemical factors, that the best explanation for what we now see is evolu- 
tion from a common ancestor. Biochemistry is defined by the genes (eg enzymes) 
’listed’ in the genome and thus by a string of letters A,C,G,T. The most com- 
pelling argument for evolution, at least as measured by bites of information comes 
from comparing genomic sequence. The most widely used of these molecular phy- 
logénies is built from the sequence of ribosomal RNA, a component of the protein 
synthesis machinery common to all forms of free living life. Though the algorithms 
for computing the phylogeny from the extant species has been under development 
for several decades [1], methods which treat the entropy of all possible trees, on 
par with the energy (number of mutations and perhaps insertion-deletions) and 
thus optimize a free energy are still a subject of active research. Also unsettled are 
general ways of testing for so called horizontal transfer of genes between ancestral 
species, which put loops into the so called ’tree of life’ for certain genes. However 
they are computed, these molecular phylogenies all agree that the vaist diversity 
of life is embodied in single celled organisms and all the multicelled plants and 
animals are a relatively recent addition to the biosphere. They cluster together in 
one small branch of the ribosomal RNA phylogeny [2, 3]. 

Biology has been energized in the past decade by several genome scale tech- 
nologies all of which stem form earlier biochemical discoveries, but whose extension 
to the genome scale required a significant injection of chemistry, physics and en- 
gineering. Genome sequencing is now routine and costs a few cents per assembled 
base. The most common strategy is to randomly fragment the genome into pieces 
in various size ranges, amplify each fragment, and then sequence ~ 600 bp on 
each end. The computer then assembles the fragments into ’contgis’ (continuous 
stretches of sequence) and then targeted methods are used to fill in the gaps be- 
tween these to complete the chromosomes. Repeats in the genome (since much 
evolution has occurred by gene duplication and divergence, and there are various 
self replicating repetitive elements in the genome) render the reassembly process 
challenging and there is clearly good engineering required to balance the capacities 
of the various steps and get an answer at minimal cost [4], Table 1. 

The second technology (with now many variants) that is essential for the 
research I will report on is expression analysis; the measurement of the messenger 
RNA level genome wide. All technologies exploit the specificity of base pairing 
interactions and by various means place gene specific strings of bases at defined 
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bacteria 


yeast 


worm 


fly 


plant 


human 


Genome size (10® bases) 


1-4 


12 


100 


120/180 


115/125 


3000 


Genes (10®) 


1-4 


6 


19 


15 


26 


35 


Genes (% genome) 


~80% 

{Ecoli} 


75% 


- 


20% 


- 


2% 


Transcription factors (TF) 
TF with sites 


300 

60 


> 250 


> 700 


> 650 
200 


> 1530 


> 2000 



Table 1: Genome statistics for some common model organisms. Transcription factors are DNA 
regulatory proteins, and the ’genes as a percent of genome’ entry is defined as the ratio of the 
number of base pairs translated into proteins vs total genome size. 



positions on a centimeter scale slide or ’chip’. The total message RNA in the 
sample, after certain preprocessing, is then allowed to hybridize (ie base pair), 
with the chip and the amount of each message is converted into a fluorescent 
signal for each gene specific tag on the chip, which can then easily be read out. 

The genome has been dubbed a ’parts list’ by the popular press, but it is in 
fact closer to an assembly manual; a rather delphic one. Certain parts have to be 
preassembled (ie we all start from an egg), but thereafter through development, 
proteins (and some RNA) direct the expression of specific genes, some of which 
build the organism and others act as gene control factors (so called transcription 
factors). With now many genomes to compare, it has become apparent, that the 
’parts list’ is not so different between humans and flies, or simple worms with 
1000 cells. The increase in life’s complexity has more to do with how these parts 
are coordinated, ie regulated, than with the creation ab initio of new parts. Since 
development is very much about the unfolding of complexity, there has been a 
very fruitful convergence between evolution and development as described in two 
recent books [5, 6]. 

Gene control ultimately derives from sequence, but the scale of this depen- 
dence is uncertain. The best characterized signals derive from the binding of small 
groups of proteins and can range from under lObp for a single protein site, to 
~ 1000 bp for the well characterized developmental ’modules’ in fly which bind 
multiple copies of ^ 5 different transcription factors. All strategies for finding 
these regulatory sites from sequence exploit statistical irregularities ie there are 
too many (or too few) copies of some pattern based on some model. The data used 
is either repetition within one genome (see eg [7]) or more effectively a comparison 
between the regulation of homologous genes in different organisms. In the latter 
case evolution does the work; what matters for fitness varies less rapidly than what 
does not. The most informative evolutionary distance has to be discovered empir- 
ically. Typically the genes, being hundreds to thousands of bases in length, are 
easy to locate when the degree of evolutionary separation is optimized to expose 
the changes in regulation on much shorter scales. 
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The most complete set of comparative data exists for the ~ 10 fully se- 
quence bacteria related to Ecoli (eg the agents of cholera, salmonella poisoning, 
plague, . . . ) Although Ecoli is by far the best studied bacterium, less than 25% 
of its transcription factors have characterized binding sites, and fewer than 15% 
of the independent transcription units have been dissected for their regulatory 
inputs. The search for regulatory features, proceeds first by comparing the regu- 
latory sequences for homologous genes [8, 9], gene by gene. A second step then 
groups the thousands of sites thus found into several hundred clusters (one for each 
regulatory factor) [10], which then suggest which groups of genes are co- regulated. 

The second assembly step, makes apparent a fundamental difficulty in any 
effort to deduce regulatory function from sequence. The cell functions by proteins 
binding to DNA, hence every genome encodes for a decoding machine tuned to 
recognize certain sequences. We can not begin to calculate protein-DNA affini- 
ties, we have to resort to clustering of similar sequences. For sufficiently diverged 
sequences, clustering will be impossible, even though knowing the centers of the 
clusters (ie by having the proteins) its possible to correctly classify each binding 
site by the closest binding factor. By one of those meta consequences of evolu- 
tion, sites will become as fuzzy as possible consistent with the cell’s (and not the 
bioinformat ician’s) ability to function. Hence clustering data from a single genome 
(without other information to limit the collection of sequences being searched) is 
impossible for most factors. Using multiple species is a way around this dead-end, 
they provide multiple somewhat independent samples for the same factor, and 
hence better statistics. The clustering problem can be recast as a balance between 
energy (similar things like to sit together) and entropy (all the ways of making clus- 
ters). Interspecies data allows us to cluster bigger units, thus with more binding 
energy, but with the same entropy. Hence one does better [10]. 

The development of multicellular life involves a very strong transcriptional 
program to differentiate the many different cell types, which express very sets of 
proteins from the same genome. Experiment has shown that 100 — 1000 bp seg- 
ments of sequence can autonomously confer a posit ion- time specific expression. 
(The experiment is to put the sequence module in proximity to a foreign reporter 
gene, incorporate into the genome, and monitor whether its expression recapit- 
ulates some portion of the native pattern). If one uses information about the 
factor binding sites that play a role in some stage of development, then looking 
for collections of sites without regard to order does recover a significant number of 
modules in the fly [11]. The algorithm computes the free energy of all possible ways 
of generating the data from a model of independently positioned factor sites or 
’background’ and optimizes their probabilities. Many aspects of sequence analysis 
are ultimately statistical in character [12], and physical analogies and calculational 
devices from statistical mechanics have a definite role to play. 
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Physical Basis of Interference Effects in Hearing 



Thomas Duke, Daniel Andor and Prank Jiilicher 



Abstract. To capture faint sounds, the ear uses an active system of amplification. 
We and our colleagues have put forward the idea that the amplifier comprises a 
set of ‘self-tuned critical oscillators’: each hair cell contains a force-generating dy- 
namical system which is maintained at the threshold of an oscillatory instability, 
or Hopf bifurcation. The active response to a pure tone is perfectly suited to the 
ear’s needs, since it provides frequency selectivity, exquisite sensitivity and wide dy- 
namic range. However, the intrinsic nonlinearity of the mechanism causes tones of 
different frequency to interfere with one another in the cochlea. In order to provide 
a framework for understanding how the ear processes the more complex sounds of 
speech and music, we have examined the response of a critical Hopf oscillator to 
two tones. Our calculations indicate how the response to one tone is suppressed 
by the presence of a second tone of similar frequency. They also show how a char- 
acteristic spectrum of distortion products is generated. The results are in accord 
with experimental observations of basilar membrane motion. Given the complexity 
of the nonlinear response, how does the ear distinguish the frequency components 
of a sound source? We propose a simple model of pitch extraction based on the 
timings of neural spikes, and investigate to what extent psychophysical phenomena 
such as the sensation of dissonance and auditory illusions can be attributed to the 
physical nature of the peripheral detection apparatus. 



1 Introduction, Model ^uld Results 

It has recently been proposed that the active cochlear amplifier consists of a set of 
nonlinear mechanical oscillators, each of which is adjusted by a self-tuning mecha- 
nism to be on the verge of vibrating [1, 2]. At this critical point, the response dis- 
plays a generic compressive nonlinearity, with the displacement varying as the cube 
root of the stimulus amplitude. Evidence supporting this amplification mechanism 
has recently been obtained from micromanipulation experiments on hair bundles 
of the bullfrog sacculus [3], which indicate that the oscillatory dynamical system 
resides within the bundle itself. The resonant response of a critical oscillator am- 
plifies faint sounds and provides frequency tuning, but the intrinsic nonlinearity of 
the system causes interference effects when stimulus tones with different frequen- 
cies are present. We have calculated the response of a critical oscillator to two 
tones [4] and demonstrated that two interference effects in hearing are character- 
istic features of this active detection mechanism, whose generic properties may be 
described mathematically: 
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1. Two-tone suppression: By extinguishing the nonlinear amplification, the 
presence of a second tone systematically suppresses the response to a test 
tone (Fig. lA). 

2. Distortion products: Nonlinear interactions between tones generate a char- 
acteristic spectrum of distortion products (Fig. IB). 




erptuüc: or stlnjàc tiQ nodJng boncj fîHrj 



Figure 1: Two-tone interference effects. A: Active nonlinear response to a single 
tone ((dotted line), and suppressed response to the same tone (full line) in the 
presence of a masking tone of similar frequency and fixed amplitude. B: Spectrum 
of distortion products generated by two tones of similar frequency (1000 Hz and 
1050 Hz, arrowed) and equal amplitude. 



2 Discussion 

We have proposed that, in the mammalian cochlea, the basilar membrane is driven 
by a set of critical oscillators (most likely identified with outer hair cells) and 
propagates an active traveling wave [5] . Thus we submit that the two-tone response 
of a critical Hopf oscillator has considerable bearing on human psychoacoustics. 
A simple algorithm for pitch extraction [4], based on the timings of neural spikes, 
suggests that the sensation of dissonance and the phenomenon of the residue pitch 
might both have their origins in the physical nature of the active cochlear amplifier. 
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Active Behaviors in Living Cells 



Prank Jülicher 



Abstract. We review work on the force and motion generation in cells. Starting from 
the physics of individual motor molecules, we discuss how complex dynamic behav- 
iors emerge from the interplay of many active elements. The concepts developed for 
simple systems can be used to adress the active mechanical properties of complex 
cellular structures. 



1 Introduction 

One of the most visible feature of living cells is their ability to generate motion 
and exhibit active behaviors [1, 2]. Examples are muscle contractions, cell loco- 
motion on substrates and cell division. Cells constantly explore their environment 
by generating small protrusions, such as so-called lamellipods, which are sheet-like 
structures that emerge from the edges of a cell. They grow and shrink and lead 
to constantly readjusted cell shapes. Such structures are built around filaments of 
the cytoskeleton which form networks and gels. Their complex dynamic behaviors 
are the result of an interplay of a large number of enzymes which interact with 
cytoskeletal filaments. 

The cytoskeleton is the prototype system for the study of force and motion 
generation in cells. The dynamical properties of the cytoskeleton and its associated 
proteins are governed by phenomena on different scales. On the molecular level, 
highly specialized motor proteins consume chemical energy of ATP and are able 
to generate motion and forces [3, 4, 5]. These forces are stochastic in nature. On 
larger scales, where the cytoskeleton forms complex structures, dynamical behav- 
iors emerge from an interplay of many active processes on the molecular scale. 
Certain types of such behaviors can be described as arising via self-organization 
phenomena. This can be illustrated by focusing on simple examples, where com- 
plex patterns in space and time can be understood to be generated by molecular 
motors interacting with cytoskeletal filaments [6, 7, 8]. 

A simple situation arises if many aligned filaments form a bundle. Such a 
bundle can have active properties if motor molecules or aggregates of motors form 
mobile cross linkers [9, 10, 11]. These motors slide filament pairs with respect to 
each other which introduces a rich dynamics in the system and active mechanic 
properties. With the presence of motors, the filament bundle has been transformed 
in a nonlinear dynamic system which undergoes bifurcations and dynamic insta- 
bilities. Dynamic systems have interesting properties in the vicinity of dynamic 
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(b) 




Figure 1: Schematic representation of molecular motors and track filaments, (a) 
Many myosin molecules interacting with an actin filament, (b) Kinesin moving 
along a microtubule. Both types of filaments are polar and periodic, their two 
different ends are denoted ‘‘plus” and “minus” . 



instabilities which cells could use to achieve particular goals. An important exam- 
ple are spontaneous oscillations which can be used to detect and amplify periodic 
signals [12, 13]. Mechanosensory cells of the ears of vertebrates exhibit complex 
cytoskeletal structures which form bundles of rod- like stereocilia [14]. These hair 
bundles have been shown to generate spontaneous mechanical oscillations [15, 16]. 
In the subsequent sections, we discuss physical approaches to understand basic 
features of such active phenomena in cells. 

2 Motor proteins: single motors and collective eflfects 

A motor protein of the cytoskeleton interacts specifically with a certain type of fil- 
ament along which it is able to move in presence of Adenosinetriphosphate (ATP) 
which is a chemical fuel [3]. The filaments serve as guides or tracks for the motion. 
Two types of filaments play this role: microtubules and actin filaments. Both are 
formed by a polymerization process from identical monomers (actin and tubulin 
monomers, respectively), leading to a regular and periodic structure. An impor- 
tant feature is their polarity: The filaments are asymmetric with respect to their 
two ends. This symmetry has its origin in the asymmetry of the monomers which 
form a polar filament structure with two different ends which are denoted “plus 
end” and “minus end”. This polar symmetry is essential for motor operation as 
it defines the direction of motion. A given motor molecules moves in a particu- 
lar direction along a filament. Motor proteins are classified into several families: 
myosins, kinesins and dyneins. Myosins move always along actin filaments while 
kinesins and dyneins move along microtubules, see Fig. 1. The energy source for 
motion generation is the hydrolysis of ATP. The motor protein (or more precisely, 
the head domain containing the ATP binding site) undergoes a chemical cycle. 
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It binds ATP, hydrolyzes the bound ATP and releases the products ADP and P 
(phosphate). After completion of the cycle the motor is unchanged. The different 
conformations which occur during the chemical cycle have different structures and 
can in particular have different interaction characteristics with respect to the fila- 
ment. As a result, the motor protein undergoes chemistry-driven changes between 
strongly and more weakly bound states ( “attachments” and “detachments” ) . This 
coupling between chemistry and binding permits the creation of motion along a 
polar filament [17, 18, 4, 5]. 

The main features and principles of this mechanochemical coupling can be 
captured by a simple two-state model where we assume that two chemical confor- 
mations, an attached state 1 and a weakly bound state 2 are characterized by two 
energy landscapes [19, 20, 21, 22]. These energy landscapes are periodic potentials 
which have broken symmetry. This broken symmetry reflects the polarity of the 
track filaments. Transitions between the states represent conformational changes 
driven by the ATP hydrolysis cycle. Since the system operates far from equilib- 
rium, it attains a steady state with nonvanishing average velocity v and mechanical 
properties characterized by the relationship between v and an externally applied 
force /ext- If the periodic potentials are symmetric, no net motion can occur by 
symmetry in the absence of applied external forces. 

In many interesting situations, a large number of motor molecules operates 
collectively. Generalizing the simple two state model to situations where large num- 
bers of motors are coupled rigidly, we have predicted the possibility of dynamic 
instabilities in the force- velocity curve [23, 24]. An interesting possibility is that 
even if the potentials are symmetric, motion can occur via spontaneous symme- 
try breaking. In this case, two moving states with opposite velocities coexist. In 
practice, this dynamic symmetry breaking transition is concealed by noise. If the 
number of collectively operating motors is finite, they generate fluctuations which 
induce transitions between the two oppositely moving states. As a consequence, 
the system exhibits bidirectional motion where it moves a certain time in one 
direction before it switches stochastically to motion in the opposite direction. 

Recently, such bidirectional motion has been observed in so-called motility 
assays [25]. Motor molecules are attached to a substrate at high density and drive 
the motion of microtubules which adhere to the motor-coated substrate. Usually, 
the generated motion occurs with one particular end of the microtubule in front. 
For a particular mutant of a kinesin motor, the observed behavior was significantly 
different. Microtubules switched their direction of motion after times of several 
seconds until up to a minute and motion was bidirectional. At the same time, 
individual motors of this type were not able to generate motion. This suggests 
that the individual motors have lost their directionality because of their mutation. 
At the same time, collections of these motors are still able to generate motion via 
a symmetry breaking dynamic instability. We have shown that this interpretation 
provides a natural explanation of the experimental observations [26]. Interesting 
collective effects of many motors can also occur due to crowding of many motors 
on a filament along which they advance [27, 28] 
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3 Self-organization of motors and filaments 

If motors form small aggregates, they can interact with two or more filaments at the 
same time. Such aggregates then play the role of cross-linkers of a filament network. 
Since a motor can move along a given filament, these cross-links are mobile and 
the resulting polymer network intrinsically dynamic [29, 30, 31]. Experimentally, 
such filament systems in the presence of mobile cross linkers can be studied using 
artificial constructs linking motors together. It has been shown that such systems 
self-organize and generate spatiotemporal patterns such as asters and vortices 
[7, 8, 32, 33]. 

A case of particular importance is the situation where filaments are aligned 
and form a bundle [9, 10, 11]. The dynamic and mechanic properties of active bun- 
dles can be discussed in very simplified physical models. Using a one-dimensional 
description of the bundle, we denote by c+(x) and c~(x) the densities of filaments 
pointing their plus ends to the right and left, respectively. Assuming that mobile 
cross-links induce interactions predominantly of filament pairs, we can write non- 
linear dynamic equations for the filament densities. In this description, filament 
currents are resulting from active behaviors of motors, which induce the relative 
sliding of filament pairs. 

The interaction terms which induce this sliding can be obtained from sym- 
metry arguments. We have to distinguish between two types of interactions. In- 
teractions between filaments of equal orientation, characterized by an interaction 
strength a and interactions between filaments of opposite orientation with strength 
j3. We find that the interaction between equally oriented filaments generates in 
general a contractile tension and induces the shortening of a bundle. For suffi- 
ciently large a, a homogeneous density profile becomes unstable and contracts to 
become a localized distribution. If furthermore the interaction between oppositely 
oriented filaments is acting, described by a finite value of j3, a dynamic instability 
still occurs. However, the homogeneous state now becomes unstable with respect 
to propagating modes if periodic boundary conditions are imposed. These solitary 
waves can upon further increase of a become again unstable and oscillating waves 
appear. 

Even though we start from a simplified description based on basic rules of 
filament sliding, we obtain a rich scenario of bifurcations and dynamic transitions. 
We can incorporate in this description the mechanical properties of a bundle using 
momentum balance and balances of internal forces. We show that in a homogeneous 
density profile, the mechanical tension 

E = ar?£3((c+)2 + (c")2) , (1) 

is generated by the interaction of parallel filaments of strength a. Here, ry is a 
friction coefficient and i the filament length. This tension is partly relaxed when the 
density profile becomes unstable and a localized filament distribution is generated. 

Self-organization of motors and filaments can also be studied in other ge- 
ometries. Cilia and flagella are hair-like appendages of many cells which contain 
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microtubules arranged in a cylindrical geometry together with a large number of 
dynein motors [1]. These structures called axonemes are able to generate bending 
waves and periodic motion which are used by sperm and some microorganisms to 
swim. Such systems can be described as elastic filaments which undergo bending 
deformations as a result of internal forces generated by motors. We find that such 
systems naturally undergo an oscillating instability or Hopf bifurcation [34, 35]. In 
the oscillating state, we can calculate propagating bending waves which can lead 
to self- propulsion of the beating filaments. 

4 Nonlinear amplification by auditory hair cells 

In the previous sections, we have argued that cytoskeletal structures which contain 
many filaments and molecular motors exhibit rich dynamic behaviors and can un- 
dergo dynamic instabilities. Such phenomena are also relevant for mechanosensory 
cells of the inner ears of vertebrates where oscillatory instabilities and spontaneous 
oscillations play an important role [36, 12, 13]. These are highly specialized cells 
which possess at their surface a bundle of rod-like structures mainly formed by 
densely packed act in filaments. This hair bundle is a mechano-electrical transducer, 
capable of detecting bundle deflections of a few nm and generating an electrical 
membrane potential in response [14]. 

It has been shown recently that hair bundles in frog ears exhibit spontaneous 
oscillatory motion due to an active process in the hair bundle [15, 37, 38, 39]. It has 
been suggested that these spontaneous movements are the signature of an active 
process which plays the role of an amplifier of mechanical stimuli. Indeed, the 
sensitivity of the oscillating hair bundle increases for small stimuli if the stimulus 
frequency is in the vicinity of the spontaneous frequency of oscillation of the cell 

[37]. 

This signal amplification can be understood if we assume that the hair bundle 
profits from the nonlinear properties of an oscillator in the vicinity of a Hopf 
bifurcation [12, 13]. If the system is at the verge of oscillating, it responds to 
periodic stimuli of amplitude / with a deformation amplitude x that obeys a 
power law 

X ~ /1/3 (2) 

if the stimulus frequency is close to the oscillation frequency. Consequently the 
sensitivity x// ~ becomes large for small stimulus amplitudes. This non- 

linear response is generic for critical oscillators near the oscillation frequency. An 
open question is how dynamic oscillators in the ear achieve proximity to the criti- 
cal point. We have proposed that a general self-regulation mechanism which poses 
the system close to the critical point by a feedback control [12]. 

There exists evidence that all vertebrate ears use active systems for signal 
amplification. Most important is the observation of spontaneous sound emissions 
which can be recorded from the ears of many different species [40, 41]. How these 
oscillations are generated in the ears of different animals and wether different 
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species use different physical mechanisms to generate oscillations is still a matter 
of debate. The fact the ear uses active amplifiers was indeed foreseen by Thomas 
Gold [42] who remarked that the viscous damping in the inner ear would not allow 
the exquisite capabilities of the ear if it would rely on passive resonance only. 

5 Discussion 

The cytoskeleton represents an soft biological material which is inherently dynamic 
due to the presence of active proteins such as molecular motors and also due 
to the polymerization and depolymerization of filaments. Such active materials 
are able to generate motion and forces. General physical principles which govern 
such systems can be addressed by studying simplified situations where a small 
number of components is taken into account. In general, one finds that inert or 
immobile states become unstable and dynamic instabilities occur. In many cases, 
spontaneous oscillations are generated and propagating modes appear. 

Such simplified situations can in principle be studied experimentally in so- 
called in vitro systems where several important components are purified and ob- 
served under controlled conditions outside the cell. An important example are 
motility assays where motors that are attached to a solid substrate set in mo- 
tion filaments that adhere to the substrate [43, 44, 45]. Such in vitro experiments 
together with theoretical studies of simple systems makes it possible to better 
understand the interplay and self-organization of several components in the cell. 

Addressing the dynamic and mechanic behaviors of complete cells remains a 
big challenge. However, the example of hair cell oscillations reveals that general 
concepts governing the behaviors of nonlinear and stochastic systems can prove 
very useful and provide important insights for the study of cellular dynamics. An 
important goal is to understand the dynamic phenomena involved in cell locomo- 
tion. Many cells adhere to solid substrates and are able to grow protrusions at an 
advancing edge while retracting at the back. This phenomenon involves polymer- 
ization and depolymerization of filaments, the action of motor molecules and the 
formation and breakage of adhesion bonds with the substrate [46, 47]. 

I thank M. Badoual, S. Camalet, T. Duke, J.A. Hudspeth, K. Kruse, P. 
Martin, A. Parmeggiani, J. Prost for stimulating collaborations. 
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Soap Froths and Crystal Structures 



Randall D. Kamien 



Abstract. We discuss the physics and mathematics of sphere packing and minimal 
surfaces and use these to explain the crystal symmetries found in macromolecular 
and supramolecular micellar materials and charged colloids. In the case of molecular 
assemblies, we argue that the packing entropy of the hard micellar cores is frus- 
trated by the entropie interaction of their brush-like coronets. The observed crystal 
structures correspond to the Kelvin and Weaire- Phelan minimal foams. We show 
that these structures are stable for reasonable areal entropy densities. 



Over 400 years ago, Kepler conjectured that the densest way to pack equal-sized 
hard spheres was in a face-centered-cubic (FCC) or equivalent hexagonal-close- 
packed (HOP) lattice. Though this conjecture was thought to be correct, it was 
not until recently that a rigorous proof was presented [1] . Because a denser packing 
implies a greater free volume for hard spheres, they will pack in one of these dense 
lattices to maximize entropy or minimize free energy. However, a new class of 
highly monodisperse dendritic polymers [2] have been characterized. Though the 
monodendrons form spherical micelles, these spheres assemble into an A15 lattice 
with Pm3n symmetry. This lattice has the same density as the simple cubic lattice 
and is very far from dense packing. 

We have proposed an explanation of the equilibrium structure by appealing 
to the mathematics of minimal surfaces [3, 4]. We view the dendritic micelle as 
a hard sphere of diameter a with a soft, repulsive corona. We would then expect 
our conclusions to hold for a large class of materials with similar morphology, e.g. 
diblock copolymer micelles [5], nanoparticles decorated with hydrocarbon chains 
[6], and charged colloids [7, 8]. If we consider a sample of dendritic micelles at a 
fixed number density p then in a fixed volume K, there are N = pV spheres and 
so 



Kc 



= V - -a^pV = V 

6 



TTCr^ 



( 1 ) 



is the volume left over the soft coronas. This coronal volume shrouds the hard 
spheres forming a thick surface of area A and thickness r. Since 



At - Pec 



( 2 ) 



is fixed, it follows that to minimize the repulsive potential or maximize r, A should 
be minimized. The area per micelle depends on the shape of the Voronoi cell and 
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scales as the volume — p 

( 3 ) 

( 4 ) 






A 

N 



1 

r 






P 

IP 



7T(7 

“ 6 “ 



This last equality allows us to write the separation r in terms of the density and 
the geometric quantity 7. We see that to maximize r at fixed density we should 
minimize 7. 

The problem of minimizing the total area of equal volume cells that fill space 
was first proposed by Kelvin [9]. He conjectured that the body-centered-cubic 
(BCC) lattice of orthic tetrakaidecahedra (or the more modern orthic decatetra- 
hedra [10]) minimized the area of the cells. However, in 1994, Weaire and Phelan 
[11] discovered that the A15 lattice, composed of equal volume Goldberg decate- 
trahedra and dodecahedra, had a still smaller surface area. As of this writing, it 
is not known whether there is a structure with a yet smaller area. However, based 
on this discovery, we see that the repulsive energy of the coronas is smaller for the 
A15 lattice than the FCC or BCC lattice. Quantitatively we have 7^*^^ = 5.345, 
^Bcc _ 5 3 Qg^ ^ai5 _ 5 288, while the volume fractions of close-packed FCC, 
BCC and A15 lattices are pFcc = 0.74, pbcc = 0.68 and pAis = 0.52. Therefore 
there is an intrinsic frustration between maximizing the free volume and minimiz- 
ing the surface area or, in other words, between the entropy of the hard cores and 
the interactions of the coronas. 

We have found that for physical values of the interaction parameters, the 
A15 lattice can be more stable than either the BCC or FCC lattice [3, 4] and 
that the BCC lattice is more stable than FCC for charged colloids at low salt 
concentrations [8]. This trend has been born out by experiment [5, 6, 7] and by 
simulation [4, 10]. 

Future work will focus on the relative stability of non-cubic lattices [12]. 
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Sliding Phases: From DNA- lipid Complexes 
to Smectic Metals 

T.C. Lubensky 



Abstract. Materials consisting of a stack of “two-dimensional” layers that exhibit 
2D power-law order in the absence of coupling between layers include systems as 
diverse as layered magnets, DNA-lipid complexes, and coupled Luttinger liquids. 
It is common belief that these systems exhibit ’’three-dimensional” order whenever 
there is a nonvanishing interlayer coupling. However, with appropriately chosen 
interlayer gradient couplings between layers, they can exhibit unusual equilibrium 
“Sliding Phases” characterized by 2D-like power-law correlations. After reviewing 
the various types of sliding phases that can occur, this paper will discuss in detail the 
properties of the simplest sliding phases, the sliding xy phases in three-dimensional 
stacks of planar xy models. 



There are many materials in nature consisting of a stack of two-dimensional planes, 
within which order that breaks a continuous x^-like symmetry can develop. Exam- 
ples include layered magnets with in-plane spin order [Fig. 1(a)], layered supercon- 
ductors [1], DNA-lipid complexes in which strands of DNA intercalated between 
lipid bilayers form a periodic stripe or smectic pattern [2, 3, 4] [Fig. 1(b)], and 
lyotropic lamellar phases consisting of stacks of bilayer membranes punctured by 
transmembrane proteins that can form hexagonal lattices [5] [Fig. 1(c)]. If there 
were no coupling between layers in these stacks, then each layer would be an iso- 
lated two-dimensional system with power-law destruction of long-range order in the 
low-temperature phase and a Kosterlitz-Thouless transition to a high-temperature 
disordered phase. If the coupling is strong, then these systems are true three- 
dimensional systems with true long-range order in the low-temperature phase and 
transitions to a high-temperature disordered phase in a universality class deter- 
mined by the symmetry of the 3D order parameter. What happens when there 
is weak coupling between layers? Does arbitrarily weak coupling drive the system 
immediately to three-dimensional behavior at the longest length scales? Or is it 
possible for there to be a regime of non-zero couplings in which the system contin- 
ues to behave like a two-dimensional system in spite of interlayer couplings. In this 
talk I will review the case for the existence, at least in theory, of phases, called 
sliding phases [6, 7], in three-dimensional stacks of two-dimensional layers with 
2D-like power-law inplane order in spite of the presence of appropriate interlayer 
coupling. 

Similar questions arise in arrays of one-dimensional wires. Coulomb interac- 
tions in isolated one-dimensional wires lead to a destruction of the Fermi surface 
and to Luttinger liquids [8] that can be described by quantum xy-models in two- 
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Figure 1: more to come 



dimensional space time. Examples of systems that can be described as coupled 
Luttinger liquids include the proposed stripe phases in high-Tc superconductors 
[9, 10] and in Quantum Hall fluids [11, 12]. What happens when tunneling in- 
teractions are turned on between neighboring wires in a two-dimensional array 
of parallel wires? Does this system become a two-dimensional metal as soon as 
interwire interactions are turned on? Or is it possible that the unusual behavior of 
one-dimensional wires (with such exotic properties as spin-charge separation) sur- 
vive over some range of couplings? Again, within the context of theoretical models, 
sliding phases of arrays of coupled arrays Luttinger liquids with power-law correla- 
tions characteristic of one-dimensional wires, can exist with appropriate coupling 
between wires [13, 14]. 

The traditional wisdom is that three-dimensional stacks of two-dimensional 
classical systems or two-dimensional arrays of one-dimensional quantum systems, 
respectively, become true three- and two-dimensional systems as soon as inter- 
actions are turned on. The basic argument for this behavior is best seen in the 
context of a model system consisting of a stack of xy-mode\s whose dynamical 
variables are angles ^n(r) at the two-dimensional coordinate r in layer n. Isolated 
layers are characterized by power-law decay of spin correlation functions at low 
temperature and a Kosterlitz Thouless transition to a disordered phase at temper- 
ature TkT’ Interactions between neighboring layers are described by a Josephson 
coupling of the form —V f d^rcos[^„(r) -^^^i(r)]. To lowest order in V, this cou- 
pling evaluated in the low-temperature phase of the uncoupled planes scales to 
zero with system size and is irrelevant for temperatures greater than a decoupling 
temperature Td; for temperatures less Td, it is relevant, and the system flows to 
true three-dimensional behavior. Thus, for Td less than Tkt, coupling between 
layers is irrelevant, and in-plane power-law order persists when Td < T < Tkt- 
On the other hand, for Td > Tkt, the individual planes melt before the Josephson 
coupling becomes irrelevant, and the system is either ordered three-dimensionally 
at low temperature, or it is disordered at high temperature. If the only coupling 
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between layers is a Josephson coupling, then is always greater than Tkt and 
the system behaves like a true three-dimensional system in spite of the strong 
anisotropy imposed by the existence of layers. It is only when additional non- 
Josephson couplings between layers of the form ^ i^nm / d^rV^n(r) • V^rn(r) 
are added that the existence of sliding phases becomes possible. Figure 2 contrasts 
phase sequences in typical three dimensional systems with those of systems that 
can exhibit sliding phases. 

The Hamiltonian for such a system consists of (1) a sum of independent 
2D-xy, Hamiltonia 

^0 = y X^ydV[V±6l„(r)]^ (1) 



and interlayer couplings 



UjlSn] = -K/[s„] j cfr COS 



^ ^ Sp^n+pi^) 1 

. V 



( 2 ) 



where Sn is an integer- valued function of layer number n satisfying = 0 if 

there are no external fields inducing long-range order. If all Vj[sn] are zero, then 
in the low-temperature phase (^^(r))o = ? 7 log(L/^>) and cos[^n(r) — ^n(O)] ^ i 
where 



T] = 



T 

2 ^’ 



(3) 



L is the sample width, 6 is a short-distance cutoff in the x-y plane, and (.)o refers to 
an average with respect to Tio. The averages of the Josephson Hamiltonians with 
respect to Ho scale as {Hj[sn]} ~ where r][sn] = 2 Sp Clearly, the 

most relevant Josephson coupling is the one with the smallest value of r/[s^], which 
results when Sn is non-zero on the smallest number of planes. Since = 0, the 

smallest value of 77 ( 0 ^] is obtained for couplings between two layers separated by 
p layers with Sn = = ôn^o ~ For these two-layer couplings, =- p[s^] - p 

for all values of p. Thus, the decoupling temperature above which all Josephson 
couplings are irrelevant is determined by r; = 2 or = 4:TtK. The 2D KT transition 
for decoupled layers is Tkt — 7rKf2; this implies Tkt /T a = 1/8 < 1, and there is 
no decoupled phase with power-law correlations. 

Josephson couplings are not, however, the only ones permitted by symmetry. 
Gradients of On in different layers may also be coupled. The Hamiltonian for the 
ideal sliding phase is Hs = Ho T Hg, where 



Hg 




^ [Vx {9n+m{r) - 0„(r))]' . 



(4) 



This Hamiltonian is invariant with respect to ^n(r) ^ for any angle ipn 

depending on n but not r, i.e., the energy is unchanged when angles in different 
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Figure 2: (a) Phase sequences in three-dimensional systems with low-temperature 
ordered and high-temperature disordered phases. In the xy model, there is a single 
transition with a divergent specific heat from a low-temperature xy ferromagnet 
to a high-temperature paramagnet. In a hexatic smectic liquid crystal, the low- 
temperature phase is a true crystalline solid. At intermediate temperatures, there 
is a hexatic phase [15] with long-range hexatic order. Finally at high temperatures, 
there is a simple smectic phase with no inplane order. In lamellar columnar systems 
such the DNA-lipid complexes, the low-temperature phase is a columnar crystal 
with the DNA stands forming a true two-dimensional lattice in the plane perpen- 
dicular to the layers. The columnar crystal can melt to a lamellar nematic in which 
the DNA columns lose their long-range positional order but retain their directional 
order. Finally at the highest temperature, the DNA strands loose all order, leaving 
only the lamellar smectic-A phase. The final figure shows the phase sequence of 
coupled spin-gap Luttinger liquids at zero temperature from a superconducting 
to a CDW phase, (b) Phase sequences in layered systems with sliding phases. In 
the xy model, a sliding phase with power-law spin ordering intervenes between 
the ferromagnetic phase and the paramagnetic phase. The transitions from both 
the ferromagnetic phase to the sliding phase and from the sliding phase to the 
paramagnetic phase are of the Kosterlitz-Thouless type with no visible anomalies 
in the specific heat. In the hexatic system a sliding crystal with power-law posi- 
tional order and a sliding hexatic phase with power-law orientational order can 
intervene, respectively, between the crystal and hexatic phases and between the 
hexatic and smectic-A phases. Similarly in the columnar phase, sliding columnar 
and sliding nematic phases appear; and in the Luttinger liquid system, a sliding 
Luttinger liquid or smectic metal phase intervenes between the superconducting 
and CDW phases. 
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layers slide relative to one another by arbitrary amounts. The sliding Hamiltonian 
can be written as 



Hs = ^ ^ /rfV • Vx^„'(r), (5) 

nn' 

where Knn> = Kn-n', with Kq = K 2 Um and Kn = -Un for n 7^ 0 . Spin 
correlation functions with respect to Hs are easily calculated: 

G{r,p) = (cos[6l„+p(r) - 6»„(r)]) = | ^ ^ ® (6) 

[(r/b’) p = 0, 

where 

vip) = 

'J(P) = (7) 

with 

j dk cos kp 

^ = io 

K{k) = (8) 

P 

Thus, we see that if system is described by the sliding phase Hamiltonian lïc with 
no Josephson couplings, it has the following properties: (1) Correlations within 
the same layer die off with a power-law with separation as they do in a true two- 
dimensional system, (2) Correlations between layers die to zero with a power of 
the systems size, and (3) the exponents controlling the power-laws are determined 
by the full functional form of Kp. 

To determine is the sliding phase can be stable, we must (1) determine at 
what temperature the Josephson couplings first become relevant and (2) calcu- 
late the KT transition temperature of the sliding phase, which now also depends 
in general on the interlayer couplings Kp for p ^ 0. Calculation of the decou- 
pling temperatures is straightforward. The expectation of the general Josephson 
coupling [Eq. (2)] relative to the sliding phase is 

(9) 

'J' ^ 

V[Sn] = ^ SnSmK~]_ 

n^m 



where 



( 10 ) 
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The decoupling temperature for the particular case of simple Josephson couplings 
between layers n and p of the form Hp = —Jp f cos[^n+p(r) — ^n(r)] is 



t4p) = 



47T 



( 11 ) 



If we restrict ourselves to couplings of the form of Hp, the decoupling temperature 
is then, 

Ta = max[Td(p)]. (12) 

P 

Recall that the sliding phase will be stable if Td < Tkt- The value of Td can be 
adjusted by changing the gradient interactions Kp between layers. Thus, the goal 
will be to find a set of couplings that decreases Td while not significantly altering 
Tkt- 

As in the true two-dimensional power-law xy phase, vortices proliferate and 
eventually destroy the sliding phase. Vortices in sliding phases are more complex 
than those of pure 2D systems. There can be composite vortices that extend across 
several layers. For example, two vortices in adjacent layers could form a bound state 
with energy less than the sum of the energies of two independent vortices in the 
two layers. The entropy of bound vortices like that of single-layer vortices, grows 
as the logarithm of the system size. The transition to the disordered phase occurs 
when the energy of the lo west-energy vortex becomes less that T times the entropy 
of the vortex. Thus, we need to calculate the energy of all composite vortices. Let 
kni be the strength of a vortex at position r^z is layer n, and let kni be 

the total vortex strength in layer /. Then standard procedures yield 

Ev=nJ2K„ 

— m^n In^T^&j 7T Di.'f-i — Tn^nl^TTi ^ ttI 

n,m 



for the energy of all vortices, where F{r) is function that tends to zero as r ^ 0 
and is proportional to Inr as r — > oc. 

The vortex energy Ey and Boltzmann statistics imply that the number of 
times a given configuration of vortices occurs in the system scales with system size 
as L‘^-'nKT[crn] ^ where 

riKT[(Pn] = fEf En-n’O-nCrn' , (14) 

n,n' 



and the factor of counts the number of places in the 2D plane the configuration 
can be placed. Clearly, if r}[(Jn] < 2, the particular vortex configuration {(Jn) will 
proliferate. The “Kosterlitz-Thouless unbinding” for {cTn} therefore occurs at a 
temperature 



TKT[^n\ — ^ ^ 



(15) 



If there is only one vortex in layer 0, then an = cf^ — ^n,o and Tkt^^u] — 7tA"o/ 2. If 
there is a -|-1 vortex in layer zero and a ±1 vortex in layer p, an = a'^ = (5^,0 
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and Txrlo'n'^] = 7r{Ko±Kp). Note that when Kp is nonzero TiCT[cr^^] is not twice 
TKr[o'n]’ possible for TktIc^u] to be less than T/ct[ct^] for one or more 

configurations {cTn}- The interactions between layers lead to composite multi-layer 
vortices that cost less energy to create than a single vortex in an individual layer. 
Unbinding of bound pairs of any set of individual- layer or composite vortices will 
destroy the rigidity within those layers. Thus, the transition temperature to the 
disordered state is 

Tkt = (Ifi) 



For the models, we consider it turns out that the minimum T/crl^n] occurs for 
composite vortices formed by binding two +1 vortices in neighboring layers to- 
gether for which an — Sn^p + <^n,p+i for some p. 

As discussed earlier, the sliding phase exists provided all Josephson interac- 
tions between layers become irrelevant before the layers Kosterlitz-Thouless melt, 
i.e., provided 



TkT _ Tkt[<^Ti] ^ ^ 

Td maxpTd{p) 



(17) 



The goal then is to find a set of potential Kn that minimize Td while not decreas- 
ing Tkt too much. From Eq. ( 11 ) for Td{p), it is clear that to minimize Td, we 
need to maximize which in turn implies choosing potentials Kn that 

produce minima in the Fourier transform K{k) at appropriate wavenumbers k. 
For simplicity, we consider a model with only nearest-neighbor and next-nearest 
neighbor couplings for which K{k) can be written as 



K{k) = K[1 7 i (1 — cos /c) + 72 (1 — cos 2k )] , (18) 

and we choose 71 and 72 such that K{k) has a minimum with value KA at k = ko. 
Minima in 1/Td{p) occur when minima in K{k) occur at ko = 27r{p/l) for integers 
p and 1. The value of T^^ varies with ko; it has minima at ko = 27r{p/l) and grows 
as [ko — 27 t{p/1)\^ away from these minima. Figure 3 plots /3 as a function of ko for 
A = 10“^ and clearly shows that there are regions in which j3 > 1, and the sliding 
phase is stable with respect to Josephson couplings between any two layers. 

The analysis just presented for sliding xy- models applies with minor modifi- 
cation to sliding columnar phases in DNA lipid complexes arrays of coupled Lut- 
tinger liquids. In sliding columnar phase, each layer is treated as a two-dimensional 
smectic with a mass-density- wave phase variable Un(r) replacing the angle variable 
On{r). The fact that a smectic breaks both translational and rotational symmetry 
leads to some complications not encountered in the xy model, but the basic con- 
clusions about the existence of the sliding phase and the existence of power-law 
correlations remain. 

Luttinger liquids are conveniently described in terms of a bosonized action 
with dual phase variables 9n{^,t) and (f)n(x,t) on wire n describing, respectively, 
charge and superconducting fluctuations. The two-dimensional variable r is re- 
placed by the two-dimensional space-time variable (x, t) of a one-dimensional wire. 
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kg/n 



Figure 3: P = Tkt/T (1 is plotted versus Uq/'k. Local minima near k^/n = 21 /p 
are labeled by (/,p). Other possible integer pairs either do not fall in the range 
0.24 < ko /it < 0.40 or yield larger values of p than those shown above [7] 



The analog of the sliding phase of the stack classical xy-models in a 2D array of 
coupled Luttinger liquids in the spin-gap phase is a smectic-metal phase [10] char- 
acterize by superconductivity with vanishing resistivity pxx along the wires and 
vanishing conductivity cjyy perpendicular to the wires [13]. At finite temperature, 
these quantities vanish as a power- law in T : 

where Acdw > 2 and 2Asw > 3. Figure 4 shows a phase diagram for a 2D array 
of parallel Luttinger wires as a function of the wavenumber k of the analog of the 
coupling K{k) of the xy stripe phases, and a coupling constant K along the wires. 

Crossed arrays of Luttinger liquids can also have sliding phases with an 
isotropic conductivity that diverges as a power law with temperature [16]. In addi- 
tion, arrays of Luttinger liquids in an external field provide an interesting approach 
to the hierarchy of fractional quantum Hall states [14]. 

In this talk I have reviewed the properties of sliding phases that can in prin- 
cipal exist in systems as diverse as DNA-lipid complexes and quantum Hall fluids. 
Classical systems that can exhibit sliding phases consist of stacks of coupled two- 
dimensional layers with x^-like order; they exhibit power-law correlations char- 
acteristic of two-dimensional systems even though layers are coupled in three- 
dimensions. Quantum systems that can exhibit sliding phases consist of arrays 
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Figure 4: Phase diagram, showing superconducting (SC), Fermi Liquid (FL), 
charge-density- wave (CDW), and Smectic metal or striped phases, for an array 
of parallel Luttinger-liquid wires as a function of coupling constant K along the 
wires, and the wavenumber go of fho interwire coupling [13] 



of coupled one- dimensional wires; they exhibit power- law behavior in space and 
time characteristic of one-dimensional systems. I showed that sliding phases can 
be stable with respect to a restricted class of interlayer (or interwire) couplings 
that couple only two layers. The region of stability of the sliding phase is is re- 
duced when multi-layer couplings are included. The stability of sliding phases with 
respect to all multi-layer couplings remains to be investigated. 

This work was supported in part by grants from the US National Science 
Foundation under grant Nos. DMR 97-30405 and DMROO-96531. 
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Abstract. The implications of the chaotic evolution of the Solar System are briefly 
reviewed, both for the orbital and rotational motion of the planets. In particular. 
Why Venus spins backward ? can be now understood while considering the transi- 
tion through a highly chaotic state during its history; chaotic state that the Earth 
itself would have experienced in absence of the Moon, while the large variations of 
Mars’ obliquity were probably at the origin of considerable climate variations that 
may have left some geological traces on its surface. 

The limits of predictability for a precise solution of the planetary orbits is an 
obstruction to the use of the astronomical insolation computations as an absolute 
geological time scale through paleoclimates reconstructions beyond a few tens of 
millions of years. On the opposite, as the paleoclimate geological records increase 
in duration and quality, they may provide an ultimate constraint for the dynamical 
model of the Solar system. 



1 The chaotic motion of the Solar System 

The orbital motion of the planets in the Solar System is chaotic. After Pluto 
(Sussman and Wisdom, 1988), the evidence that the motion of the whole Solar 
system is chaotic was established with the averaged equations of motion (Laskar, 
1989, 1990), and confirmed later on by direct numerical integration (Sussman 
and Wisdom, 1992). For a review, see (Laskar, 1996), and for some historical 
considerations (Laskar, 1992a). The most immediate expression of this chaotic 
behavior is the exponential divergence of trajectories with close initial conditions. 
Indeed, the distance of two planetary solutions, starting in the phase space with 
a distance d(0) = do, evolves approximately as d{T) ^ doe^^^ or in a way which 
is even closer to the true value. 



d{T) « 



( 1 ) 



where T is expressed in million of years. Thus, an initial error of leads to 

an indeterminacy of « 10“® after 10 millions of years, but reaches the order of 
1 after 100 millions of years. When specific planets are considered, these results 
may vary, as the coupling between some of the planets is small, and the outer 
planets (Jupiter, Saturn, Uranus, Neptune) are more regular than the inner ones 
(Mercury, Venus, Earth, Mars). For the Earth the above formula is a very good 
approximation of the divergence of the orbits (Fig.l) (Laskar, 1999a). 
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Figure 1: Error in the eccentricity of the Earth resulting from an initial change of 
10“^^ radian in the perihelion of the Earth at the origin. After about n x 10 millions 
of years, the exponential divergence of the orbits dominates, and the solutions are 
no longer valid. Error in eccentricity is plotted versus time (in Ma). (Laskar, 1999) 



1.1 Secular resonances 

In (Laskar, 1990, 1992b), it was demonstrated that this chaotic behavior arise from 
multiple secular resonances in the inner solar system. In particular, the critical 
argument associated to 

(S4 - 53 ) - 2{g4 - gs) (2) 

where ^ 3,^4 are related to the precession of the perihelion of the Earth and Mars, 
53 S 4 are related to the precession of the node of the same planets, is presently in in 
a librational state, but can evolve in a rotational state, and even move to libration 
in a new resonance, namely 



(S 4 - S 3 ) - (P4 -g^) = 0 , 



( 3 ) 
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showing that a region of relatively strong instability extend over these two res- 
onances. This result was questioned by Sussman and Wisdom (1992) which did 
not recover such a large variation for the secular frequencies, but reached only 
2(54 — S3) — 3(5^4 — g-i) = 0 instead of (3). Recently, in our group, we performed 
precise new numerical integrations of the complete equations of motion of the Solar 
System, including all 9 main planets, the Moon as a separate object. Earth and so- 
lar oblateness, tidal dissipation in the Earth Moon system, and the effect of general 
relativity. This new solution was also adjusted and compared to the JPL ephemeris 
DE406 (Standish, 1998). Different numerical integrations were conducted, with in- 
clusion or not of the oblateness of the Sun ( J2 = 10“^) (num2002 and num2001 in 
Fig. 2). The secular equations were also slightly readjusted and integrated over the 
same time (sec2001 in Fig. 2). In all cases, the resonant argument corresponding 
to (s4 — S3) — 2(p4 — ps) was in libration over 30 Myr. The transition to circulation 
occurred at 30 Myr and about 40 Myr for sec2001 and num2002, while no transi- 
tion was observed for num2001 (over 100 Myr). In both cases when the transition 
occurs, the orbit reaches the resonance (54 — S3) — (^4 — ^3) = 0 before 100 Myr, 
comforting the conclusions of (Laskar, 1990, 1992b). The other important resonant 
argument ((^1 — g^) — (si — S 2 )) that is at the origin of the chaotic behavior of the 
inner planets (Laskar, 1990, 1992) presents a similar behavior. 

The chaotic motion of the solar system is mainly due to the interactions 
of the secular resonances in the precessing motion of the inner planets, while the 
secular motion of the outer planets is very regular. In order to evaluate the possible 
chaotic diffusion of the planetary orbits over the age of the Solar system, numerous 
numerical integrations have been conducted over several billions of years (Gyr), 
extending even the age of the Solar system. From these integrations, it appears 
that Venus and the Earth present some moderate chaotic diffusion, while the 
lightest planets. Mars and Mercury, can experience very large changes in their 
orbit eccentricity, allowing even for collision between Mercury and Venus in less 
than 5 Gyr (Laskar, 1994). From these integrations, it appears that the chaotic 
diffusion of the orbits in the earlier stages of the solar system formation could 
provide some clues for the planetary distribution in semi major axis (Laskar, 1997, 
2000). 

1.2 The outer planets 

The secular motion of the outer planets is very regular. Nevertheless, there exists 
also some instabilities among these planets, due to resonances of high order in 
their orbital motion around the Sun (mean motion), although these instabilities 
do not lead to significative changes in the orbits (Murray and Holman, 1999). 

A global view of the short period dynamics of the outer planets is given in 
figure 3 using frequency analysis (Laskar, 1999b, Laskar and Robutel, 2001, Robu- 
tel, 2003). In each plot, three of the planets initial conditions are fixed, and the 
initial conditions of the remaining one ( respectively Saturn, Uranus, or Neptune) 
are changed in a large area of the phase space, spanning semi- major axis and 
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Figure 2: Evolution of he critical angle related to 2(p4 — gs) — {s 4 — S3) (top 
and middle) and (^4 — gs) — (54 — S3) (bottom) for three different solutions for 
the Solar system over 100 Myr. sec2001 is obtained with an averaged system, 
while num2001 and num2002 are two direct numerical integrations with very close 
dynamical models. 



excentricity in a regular mesh around their current position. In each case, an inte- 
gration is conducted over 1 Myr, and a stability index, obtained as the variation 
of the mean motion frequency with time (Laskar, 1990, Laskar and Robutel, 2001) 
is reported, from very stable (blue) to very unstable (red), while close encounter 
or ejection is denoted by white dots. The resonances are identified and reported 
in black. It is thus very clear that although the motion of the outer planets are 
essentially stable, they are numerous mean motion resonances and small chaotic 
zones that in the vicinity of the present solar system solution (see also Michtchenko 
and Ferraz- Mello, 2001). 
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Figure 3: Global dynamics of the giant planets. The abscissa and ordinate corre- 
spond to the initial semi major axis and eccentricity of respectively Saturn (top), 
Uranus (middle) and Neptune (bottom), while in each case the initial conditions 
of the other planets are taken at their actual values. The current position of each 
planet is given by a circle. The colors correspond to a stability index obtained by 
frequency analysis on two consecutive time interval of 1 Myr, and the black regions 
the mean motion resonances locations (Robutel, 2003). 

2 Obliquity of the planets 

The planetary perturbations induce also some effect on the tilt (obliquity) of their 
equatorial plane over their orbital plane . For the Earth, as the precession frequency 
of the axis is far from the main orbital secular frequencies of the precession motion 
of the ecliptic, the obliquity presents only small variations of about 1.3 degrees 
around the mean value of 23.3 degrees (Laskar et aL, 1993a). In the absence of 
the Moon, the situation would be very different, as multiple resonances then occur 
between the precession of the axis and the precession of the orbital plane, and a 
very large chaotic zone would exist, ranging from 0 to about 85 degrees (Laskar 
et al.^ 1993b). This is also what will happen as the Moon recedes from the Earth 
due to tidal dissipation (Néron de Surgy and Laskar, 1997). This situation is very 
similar for all inner planets (Laskar and Robutel, 1993, Laskar, 1995). 
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Figure 4: Two different solutions for Mars obliquity over 100 Myrs, obtained with 
direct integration of the planetary orbits, and with initial conditions of the planet 
spin within the uncertainty of the most recent determinations (Folkner et al, 1997, 
Laskar et al, 2002) 



2.1 Mars obliquity 

Mars presents a very large chaotic zone for its obliquity ranging from 0 to more 
than 60 degrees (Laskar and Robutel, 1993, see also Tourna and Wisdom, 1993). On 
Figure 4 are plotted two possible solutions for the past evolution of the obliquity 
of Mars, obtained using our latest numerical integration for the orbital motion 
of all the planets, and initial conditions and parameters within the uncertainty 
of the best known values for Mars rotational parameters (Folkner et al, 1997). 
It is clear that the climatic history of the planet would be very different in the 
two situations. It is presently particularly important to understand what are the 
possible past evolutions of the obliquity (and thus climate) of Mars in the past, as 
the recent Martian spacecrafts provide very detailed observations of the Martian 
surface (Malin and Edgett, 2001), and give some accurate account of the weather 
on the planet. In particular, the polar caps of Mars present crevasses that reveal the 
layered features of the ice (Fig. 5). This layers are thought to be induced by climate 
variations on Mars surface, and the succession of layers can indeed be related to 
the insolation variations on its surface (Laskar et a/., 2002), although the possible 
relation from insolation to surface feature on Mars remains very uncertain. 

2.2 Venus final states 

In 1962, using radar measurements, the slow retrograde rotation of Venus was 
discovered. Since, the understanding of this particular state becomes a challenge 
as many uncertainties remain in the dissipative models of Venus’ rotation. Various 
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Figure 5: Mars North polar ice cap, and detailed view of a crevasse, showing the 
layered nature of the ice. The picture was taken by the Mars Orbital Camera 
(MOC) on april 13, 1999 (Malin and Edgett, 2001) 



hypothesis were proposed for its evolution, aiming to search wether Venus was 
born with a direct or retrograde rotation. The most favored scenario assumes that 
its axis was actually tilted down during its past evolution as a result of core mantle 
friction and atmospheric tides (for a detailed review, see Correia et al., 2003 and 
references therein). Nevertheless, this requires high values of the initial obliquity, 
and it was proposed that Venus was strongly hit by massive bodies which would 
have tilted it significantly or started its rotation backward (Dones and Tremaine, 
1993). 

The discovery of a very large chaotic zone in the obliquity evolution of Venus 
allowed new possible scenario for driving Venus to 180 degrees obliquity (Laskar 
and Robutel, 1993, Yoder, 1997), but some difficulties remained. Recently, we have 
shown that due to the dissipative effects, there are only 4 possible final states 
for Venus’ rotation, and only 3 of them are really reachable. When the planetary 
perturbations are added, most of the initial conditions lead to the two states 
corresponding to the present configuration of Venus, one with period —243.02 
days and nearly 0 degree obliquity, and the other with opposite period and nearly 
180 degree obliquity. We thus demonstrate that a large impact is not necessary 
to have a satisfying scenario for the reverse rotation of Venus (Fig. 5) (Correia 
and Laskar, 2001, 2003, Correia et al, 2003), and that most initial conditions can 
lead to the present reverse rotation. On the opposite, we demonstrate that the 
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present observed retrograde spin state of Venus can be attained by two different 
processes that cannot be discriminated by the observation of the present rotational 
state of Venus. In the first scenario, the axis is tilted towards 180 degrees while 
its rotation rate slows down, while in the second one, the axis is driven towards 
0 degree obliquity and the rotation rate decreases, stops, and increases again in 
the reverse direction, being accelerated by the atmospheric tides, until a final 
equilibrium value. 




Figure 6: examples of the possible evolution of Venus spin axis during its history. 
The precession frequency (in arcsec/years) is plotted versus obliquity (in degrees). 
The initial obliquity is 1 degree, and initial period 3 days. The initial precession 
frequency is about 16 arcsec/yr, but due to tidal dissipation and core- mantle fric- 
tion, the planet slows down and the precession frequency decreases. The obliquity 
then enters a zone of larger chaos (in grey) where the variations of the obliquity 
increases, until the dissipations drives the obliquity out of the chaotic zone, for a 
high obliquity. The various dissipative effects can then drive the axis towards 180 
degrees (Correia and Laskar, 2003). 



3 Earth paleoclîmates and chaos 

For Mars, the chaotic variations of the obliquity are large, and certainly induce 
some dramatic effects on its climate. For example, it is probable that at large 
obliquity ( > 45 degrees), the polar caps entirely sublimate, the ice been eventually 
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redisposed in the equatorial regions (Levrard, 2003). For the Earth, the situation 
is very different. The variation of the obliquity is also chaotic, but mainly because 
it is driven by the chaotic orbital motion of the planet that acts as a forcing 
term on the obliquity. The variations of the obliquity and orbital parameters are 
small, but they induce significative changes of the insolation on Earth at a given 
latitude, that are thought to be at the origin of large climatic variations in the 
past (Hays et al, 1976). Even more, the astronomical solution is used to provide 
an absolute time scale for the geological paleoclimate records, over a few tens of 
millions of years, when the effect of the chaotic behavior of the solution is not yet 
sensitive (e.g. Schackleton et al, 1990, 1999). The quality of the geological data is 
now becoming sufficient to allow on the other hand to constrain some geophysical 
parameters of the Earth that are not well known, by comparison of the computed 
astronomical evolution of the insolation with the geological data (Lourens et ai, 
2001, Pâlike and Shackleton, 2001). As the age of good quality geological records, 
obtained over long period of time, is now exceeding 30 to 40 millions of years, it 
becomes interesting to search whether it would be possible to use these geological 
data to trace back the chaotic diffusion of the solar system in the past. 

In order to tackle such a challenging problem it is important, as the quality 
of geological records decreases with age, to search in the solution some features 
that would have an important implication on the observed data. Such factor could 
actually be the resonant argument 



(S 4 - S 3 ) - 2(^4 - 93 ) , (4) 

and its evolution with time. It would be indeed fascinating to be able to retrieve in 
the geological data the transitions from libration to circulation of this argument, 
as well as its eventual transition to the resonance 



(«4 - S 3 ) - (ff4 - 53 ) = 0 . (5) 

The direct observation of the individual arguments related to ^ 3 ,^ 4 , 53,54 is 
certainly out of reach. But it may be possible to follow the evolution in time of 
the differences — g 4 and S3 — S4. Indeed, S3 — S4 appears as a beat of about 1.2 
million of years in the solution of obliquity, as the result of the beat between the 
p + S 3 and p + S 4 components of the obliquity, where p is the precession frequency 
of the axis (see Laskar, 1999a) (Fig. 7). In a similar way, gs — p 4 appears as a beat 
with period of about 2.4 Myr in the climatic precession (Fig. 7). One understands 
that because of the occurrence of these beats, the detection of the resonant state 
in the geological data can be possible, as one has to search now for phenomena 
of large amplitude in the geological signal. Indeed, in the newly collected data 
from Ocean Drilling Program Site 926, the modulation of 1.2 Myr of the obliquity 
appears clearly in the spectral analysis of the paleoclimate record (Zachos et al, 
2001 ). 

The search for the determination of these resonant angles in the geological 
data is just starting, but we may expect that a careful analysis, and new data 
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Figure 7: The main components of the obliquity of the Earth are given by two terms 
of period close to 40 000 years (p + S3 and p + S4) than induce a beat of about 1.2 
Myr period. For the climatic precession (esino;, where e is the excentricity and u> 
the longitude of perihelion from the moving equinox), the main terms are of period 
close to 20 000 years, and the beat ps — p 4 has a period of about 2.4 Myrs. 



spanning the last 60 to 100 Myr, will allow to determine the possible succession 
of resonant states in the past, allowing, for example, to discriminate between the 
solutions displayed in figure 2. Of particular importance would be the detection of 
the first transition from libration to circulation of the resonant argument (S4-S3)- 
2(p4 — Pa) - This program, if completed, will provide some extreme constraint for the 
gravitational model of the Solar System. Indeed, the observation of a characteristic 
feature of the solution at 40 to 100 Myr in the past, because of the exponential 
divergence of the solutions, will provide a constraint of 10~^ to 10“^° on the 
dynamical model of the Solar System. 
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Orbit Bifurcations and Quantum Fluctuation Statistics 



J.P. Keating 



Abstract. Recent results concerning the semiclassical influence of bifurcating clas- 
sical periodic orbits on quantum fluctuation statistics are reviewed. These point 
to the appearance of a new class of universal scaling exponents that characterize 
the divergence of spectral and wavefunction statistics in systems where chaotic and 
regular dynamics coexist (i.e. in systems with a mixed phase space) in the limit as 
0. 

According to a conjecture of Berry (Berry 1977), statistical fluctuations on the 
scale of the de Broglie wavelength in the quantum wavefunctions of classically 
chaotic systems can be modeled in the semiclassical limit by those of gaussian ran- 
dom functions (e.g. random superpositions of plane waves). This in turn prompts 
the conjecture that spectral statistics on the scale of the mean level spacing should, 
in generic chaotic systems, coincide with those of the eigenvalues of random ma- 
trices (Bohigas et al. 1983). 

On scales larger than a de Broglie wavelength, wavefunctions can be scarred 
by classical periodic orbits (Heller 1984, Kaplan 1999). According to a theory 
developed by Bogomolny (Bogomolny 1988) and Berry (Berry 1989), in completely 
chaotic systems with two degrees of freedom, where all of the periodic orbits are 
isolated and unstable, these scars have a width of the order of and, in the 
square of the modulus of the wavefunctions, an amplitude also of the order of 
It follows from the Gutzwiller trace formula (Gutzwiller 1971), which relates the 
quantum density of states of a system to a sum over its classical periodic orbits, 
that the periodic orbits also strongly influence spectral statistics on scales larger 
than the mean level spacing in the semiclassical limit (Berry 1985). 

The question arises as to what happens when, as one changes system param- 
eters, periodic orbits bifurcate; that is when combinations of stable and unstable 
orbits collide and transmute, or annihilate. Such phenomena are characteristic of 
dynamics in systems with a mixed phase space, where both regular and chaotic 
motion occurs. The contribution from a periodic orbit to the Gutzwiller trace 
formula and to the Bogomolny scar formula diverges when the orbit bifurcates, 
hinting that bifurcations are likely to give rise to fundamentally new behaviour in 
quantum fluctuation statistics in the semiclassical limit. 

That individual orbit bifurcations can have an important and sometimes 
dominant influence on spectral statistics was first demonstrated by Berry et al. 
(1998). Let N{E) denote the spectral counting function (i.e. the number of energy 
levels En < E) and let N{E) = (N{E)) denote the local average of N over an 
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energy window centered at E containing a semiclassically increasing number of 
levels. The kth moment of N is then defined by 

Mk = {{N(E)-N{E)f^). (1) 

For fully chaotic systems one would expect, on the basis of random matrix theory, 
that Mk diverges like \og{l/h)^ in the semiclassical limit. For completely integrable 
systems (in which the level statistics are expected to be Poissonian) with two 
degrees of freedom one would expect instead that Mk diverges like 1/h^. For mixed 
systems one would expect (Berry & Robnick 1986) that Mk is a sum of these two 
contributions: ^ 

Mfc ~Ci (^log(l)^ +C2^ (2) 

as /ï 0, where ci and C 2 depend, respectively, on the relative volumes of the 
chaotic and regular regions of phase space; that is, c\ and C 2 are system-specific, 
or non-universal It was shown by Berry et al. that a single periodic orbit under- 
going a tangent bifurcation (the simplest of the generic bifurcations), gives rise to 
a contribution to Mk proportional to and that this holds over a parame- 

ter range with a size proportional to around the bifurcation value. Thus one 
single orbit undergoing a tangent bifurcation dominates the contributions from 
all of the unstable non-bifurcating orbits (which collectively give rise to the loga- 
rithmic contribution in (2)), but is dominated by the regular contribution. Also, 
tangent bifurcations make a contribution to the parameter average of Mk that is 
semiclassically of the order of 

This picture extends to all of the other generic bifurcations possible in Hamil- 
tonian systems. For each bifurcation there is an exponent describing how the size 
of its contribution to Mk scales with /i, and a second exponent representing the 
h-scaling of the hypervolume in parameter space infiuenced by it. The explicit 
values of these exponents for several classes of generic bifurcations in two-degree- 
of- freedom systems were computed by Berry et al. (2000). The general trend is 
that smaller hypervolume exponents go with larger size exponents. 

In mixed systems the moments Mk are expected to be a sum of contributions 
from all of the generic bifurcations, each contributing in a way described by its 
size and volume exponents. In the semiclassical limit there is thus a competition 
between the bifurcations to determine which, if any, dominates; this is called ’’the 
battle of the bifurcations” in Berry et al (2000). It is not obvious that there will 
be a winner; it could be that the increasing sizes and decreasing volumes do not 
balance semiclassically. It turns out, however, that there is: it was found by Berry 
et al (2000) that for increasingly elaborate bifurcations the trade-off between 
decreasing volume and increasing size factors has a maximum. This leads to the 
result that the twinkling exponents Uk defined by 



lim 



= 



( 3 ) 
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exist and are given by 



/ 2mK 

\kTï 




( 4 ) 



where K runs through the positive integers. 

There are two things to note about this result. First, i^/c > /c for /c > 3. This 
means that for /c > 3 the bifurcation contribution to Mk dominates all others, 
including the contribution from the stable islands. Second, the final expression (4) 
is universal for two-degree-of- freedom systems, provided that there is a full and 
fair competition between the generic bifurcations. 

Turning now to eigenfunctions, it was shown by Keating Prado (2001) that 
the scar amplitude associated with a bifurcating orbit is of the order of and 
the width is of the order of , where a and lu are bifurcation- dependent scar 
exponents whose values were calculated for several classes of generic bifurcations. 
Crucially, a; < 1/2 and a < 1/2 for all the bifurcations studied, and for most 
a < 1/2. In this sense bifurcations may be said to give rise to superscars (recall 
that a = = 1/2 for non-bifurcating orbits in chaotic systems). 

As in the case of the spectral counting function, these bifurcations dominate 
the moments of wavefunctions. Let 



C2m{h) 



i / ((i-A"W)i’> 



0(q';F;) 






( 5 ) 



where ijjn is the eigenfunction associated with the energy level En^ the q'-integral 
is over an /i-independent volume Aq, Q is the volume of the surface in phase space 
of constant energy E, and where the angular brackets denote an average with 
respect to position (over a region which shrinks as /i ^ 0, but which contains an 
increasing number of de Broglie wavelengths) and energy (semiclassically many 
levels). 

Assuming that the wavefunctions are quantum ergodic (Shnirelman 1974) on 
the scale of the local q-average implies that C2m(^) — ^ 0 as /i — > 0 when m > 0. 
It follows from the fact that the scar exponents for non-bifurcating orbits are 
a — UÜ = \/2 that the individual contributions from these orbits to the moments 
scale as limit. The corresponding contribution from a bifurcating 

orbit is of the order of ^ and so is semiclassically larger. 

The moments defined by (5) are implicitly functions of the system parameters. 
Averaging them with respect to these parameters produces an opportunity for a 
competition between the various generic bifurcations. The contribution of each 
bifurcation must be weighted by the associated hypervolume in parameter space, 
and so scales as where 7 is the hypervolume scaling exponent. The 

bifurcation that wins the competition, and hence which determines the rate at 
which the parameter- averaged moments tend to zero in the semiclassical limit, is 
the one for which 2ma -h u; -h 7 is minimized, assuming a minimum exists. It was 
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shown in Keating & Prado (2001) that a minimum does indeed exist, and that 



with 



log h 



— Mm? 



liyn = min 



/ 2m + l K\ 
\2{K + l)^j) 



( 6 ) 

( 7 ) 



where (C 2 m(^)) denotes the parameter- averaged moments and the minimum is 
with respect to the positive integers K. 

It is natural to compare /i^ to the corresponding exponent for Gaussian ran- 
dom functions, which are often taken as models of quantum chaotic wavefunctions. 
In that case, the moments (5) are semiclassically of the order of This follows 
from the results of Section IIIB of Eckhardt et al. 1995, if the operator considered 
there is the characteristic function of the region over which the local q-average in 
(5) extends. The same rate of vanishing also holds for the eigenvectors of random 
hermitian matrices (see Section IIIA of Eckhardt et ai). Crucially, < m for 
m > 1 and so then, if the background to the scars due to individual periodic orbits 
is modeled by a Gaussian random function, bifurcations dominate the semiclassical 
asymptotics. The contributions from individual non-bifurcating orbits are always 
subdominant. 

To summarize, the exponents /am determine the asymptotic scaling of the 
parameter- averaged moments C 2 m in the limit as M 0 when m > 1. Note that, 
like the twinkling exponents, they are also universal^ that is, system independent, 
because they are determined solely by generic bifurcation processes. As pointed 
out earlier, these processes are characteristic of mixed phases-space dynamics, and 
so one might expect the exponents (7) to describe the semiclassical deviations of 
the irregular (in the sense of Percival 1973) eigenfunctions (i.e. the eigenfunctions 
associated with the chaotic regions of phase space) in mixed systems from their 
ergodic limit. (They do not describe the regular eigenfunctions, for which the 
corresponding moments have a different origin.) 

Finally, Backer et al. (2002) have recently shown that the autocorrelation 
length-scale of the of the wavefunctions 'ipn around a bifurcating orbit scales semi- 
classically as where a is the corresponding scar amplitude exponent. This is 

somewhat reminiscent of the relations between scaling exponents in the theory of 
critical phenomena. 
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Nondeterministic Dynamics and Turbulent Transport* 

A. Kupiainen 



Abstract. Velocity fields occurring in fully developed turbulence are spatially rough 
and give rise to the phenomenon of nonuniqueness of trajectories in the motion of 
fluid particles even in the absence of noise. We review progress in understanding of 
this phenomenon in the context of synthetic gaussian velocity ensembles. 



1 Richardson law 



The paradigmatic case of deterministic dynamics is given by the flow induced by 
a smooth (possibly time dependent) vector field 



dt 



v(t,R) . 



( 1 ) 



Such flows often exhibit the phenomenon of deterministic chaos signaled by the 
exponential separation of nearby trajectories: 

p(0~e^V(0). (2) 



where p — R' — R for two solutions R and R' of (1) and the equation holds 
for sufficiently small separations. The exponential separation leads to the lack of 
predictability in the long time behavior of the flow. However, such dynamics is 
still deterministic: in the limit of zero initial separation p(0) ^ 0 the trajectories 
coincide: p{t) 0. 



A very different phenomenon seems to occur in the dynamics of small test 
particles suspended in a turbulent fluid. Classical experiments on this phenomenon 
were performed by L.F. Richardson [1] in the 1920’s. He was studying the motion 
of balloons released in the atmosphere. Such motion is described by the equation 
(1) where v(t, R) is the velocity field of the air. 

The long time behavior of the trajectory of a single balloon once the mean 
drift is subtracted tends to be diffusive 

(3) 

Diffusive asymptotics is observed in a variety of dynamical systems and its occur- 
rence here is not a surprise. 

* Research partially supported by EC grant FMRX-CT98-0175 
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The surprising observation that Richardson made concerned the relative mo- 
tion of two balloons. Richardson found this separation to behave as 

p{t)^ ^ At^ (4) 

for a wide range of separations from the smallest scale to the scale where the dif- 
fusive behavior (3) sets on. Thus the separation grows slower than exponentially 
but faster than ballistically. Moreover, the constant A in (4) seems to be indepen- 
dent on the initial separation p(0), seemingly having a nonzero limit as p(0) 0! 

The Richardson law (4) seems quite well confirmed by later experiments as well as 
numerical simulations (for references see [2]) although the power 3 might only be 
approximately correct in three dimensions. We conclude that the motion of test 
particles in a turbulent fluid seems to exhibit properties that are very different 
from those observed in smooth dynamical systems. To understand the source of 
this discrepancy we need to digress on the properties of turbulent velocity fields. 

2 Turbulent velocities 

It is an experimental observation that the velocity field of a fluid in the regime 
of homogeneous isotropic turbulence exhibits approximate scale invariance on a 
wide range of length scales. Such fields are characterized by two length scales, 
the dissipative scale r] and the injection scale L. For atmospheric flows such as 
considered by Richardson 77 can be of the order of a fraction of a millimeter whereas 
L can be of the order of a kilometer. For spatial scales between 77 and L the 
velocity field is approximately self similar: if we consider the difference Av(t,p) = 
v(t, p) — v(t, 0) then for 77 << |p|, \^p\ « L 

Av(t, p) - £-°^Av{t, £p) . (5) 

where ~ means statistically, i.e. as a spatial, temporal or ensemble average and 
û; (5) holds only approximately for small moments 

<(Av(t,p)-pr>^aipr (6) 

with significant corrections in the exponent for large n (so called intermittency ) . 
The ratio ^ is proportional to where R is the Reynolds number of the flow (in 
atmospheric flows R can easily be of the order 10^). Hence 77 ^ 0 as R ^ 00 and in 
that limit the turbulent velocities loose their smoothness and become only Holder 
continuous in their spatial dependence, that is, the difference of the velocity at 
two arbitrary nearby points scales as a sublinear power of the distance between 
the points. Although R oc is a mathematical limit it should be stressed that 
smoothness of the velocity field is not the right assumption for scales larger than 
a fraction of a millimeter in atmospheric flows. For such scales the right model is 
one of Holder continuous velocities. 
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It is well known that the differential equation (1) need not have a unique 
solution if v is only Holder continuous. Indeed, the standard textbook example 
V = a G (0,1), in one dimension has two solutions x = 0 and x = ((1 — 
a)t) starting from the origin. It is therefore natural to expect non- uniqueness 
of trajectories in the turbulent fluid and seek an understanding of Richardson’s 
observations from this phenomenon. 



There is also a natural timescale related to the spatial scale r fluctuations 
(“eddies”) in the velocity field. This is the so-called eddy turnover time r(r) = 
~ For turbulent velocities ~ T Thus it would be natural 

to expect decorrelation of velocity differences in such time scales: 

<Av(i,p)Av(0,p)>=lpr/(^) (7) 

with / falling off at infinity. There seems to be, however, scant experimental data 
on (7), in particular on whether it might be true only in the Lagrangian frame. 



3 Synthetic velocity ensembles 



To gain further understanding how Richardson’s observations might be explained 
in terms of the roughness of the turbulent velocities we will consider the equation 
(1) with a velocity field with explicitly given gaussian statistics mimicking the 
properties of real turbulent velocities described in the preceding section. It should 
be noted however that high Reynolds number solutions to the Navier-Stokes equa- 
tions are far from gaussian so gaussianity is a simplifying assumption. 

Thus we assume v(x,t) is gaussian with zero mean and covariance satisfying 
(7). Concretely, we take 



lv\t,T)v^{t',T')) = A [ e 

Let us discuss the various parameters in (8). 



i k-(r— r') 






dk 

{27rY 



( 8 ) 



1. L is the largest length scale in the problem, modeling the injection scale of 
turbulent velocities. It dominates the mean square velocity field: 

/v(t,r)2\ = const f lk| ~ 

Therefore we expect strong L dependence in the one particle motion. 



2. 77 is the smallest length scale in the problem, modeling the dissipative scale of 
turbulent velocities. For ry 0 the velocities are smooth, but as 77 0 they loose 

their smoothness and become only Holder: 



(Av(t,r)2) 



= const 



L 



L-i<|k|<r7-i 



(1 — cos(k • r))|k| 



-2a-d 



dk 

{2iTr 
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which, as 77 — 0 is proportional to for |r| « L. Note that this velocity 
difference has both the 77 ^ 0 and the L ^ 00 limits. In those limits our velocity 
ensemble becomes scale invariant: 

Av(t,r) - A-^Av(A^^t,Ar) (9) 

where ~ means in law. It is believed that real turbulent velocities, i.e. solutions 
of the Navier-Stokes equations, have the 77 ^ 0 limit but not the L cx) limit. 
Indeed, it is believed that 



((f • Av(t, r))") = |r|« (1 + o(M)) 

with Cn > 0 for > 3 . This aspect {intermittency) of turbulent velocities is thus 
not modeled by our gaussian ensemble. Since the two particle motion ( 15 ) involves 
velocity differences we expect it to be less sensitive to the large scales than the 
one particle motion. 

3 . The factor 

involves the compressibility degree 0 < p < 1 which can be used to interpolate 
between incompressible velocities corresponding to p = 0 and ones corresponding 
to p = 1 which are gradients of a potential (in one dimension necessarily p = 1). 

4 . The two parameters A and B can be used to discuss limiting cases of the 
ensemble. For ^ = 0 we have the frozen ensemble of time independent velocities 






/ îk-(r-x') 



whereas in the opposite limit A 00^ B 00 with ^ held constant one obtains 
the Kraichnan ensemble of delta correlated velocities 



C 5 {t-t') j' 



i k-(r— r') 



( 11 ) 



which are Holder continuous with any exponent smaller than with 



^ = 2{a + p). 
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4 Kraichnan ensemble 

The one and two particle motion can be analyzed completely explicitly in the 
Kraichnan model. The first question we must face is the meaning of the equation 
(1) when the velocity field is not smooth enough (Lipschitz) for the uniqueness 
results of differential equations to be applicable. That is, we need to regularize (1). 
There are two natural ways to do this. The first is to reintroduce the viscous scale 
rj and study the rj ^ 0 limit. The second is to introduce molecular noise, i.e. to 
replace (1) by the stochastic equation 

— = v(t,R) + v^/3. (12) 

where [3 is Brownian motion. Both regularizations and their mixture are physically 
relevant, the relative size of the viscosity and diffusivity is described by the Peclet 
number. There are also some subtleties in the order of the limits, see [3]. Here we 
content ourselves to study the equation (12). 

The solution of (12) with a fixed realization of v defines a Markov process 
x(t|v). (This was done rigorously in [4].) Let us denote by P^(x,y|v) its transition 
probability density. It satisfies the equation 



= (/^A-vV)F,. (13) 

Due to the white noise nature of v one has to be a bit careful with the product 
in (13): the right definition is in the Stratonowich sense. This means in particular 
that the average of Pt over the velocity realizations satisfies the equation 

dt{Pt) = (kA + \D^i{Q)didj)(^Pt). (14) 

Note how the drift term in (13) contributes a second order term involving the ve- 
locity two point function. This leads to a renormalization of the molecular diffusion 
coefficient At by the “eddy diffusion constant” 

^ren = At -|- Ateddy 

where as in (9) we get that Ateddy is proportional to Thus the single particle 
motion is diffusive, given by eq. (3), and the diffusion persists even in the limit 
of vanishing molecular diffusivity At ^ 0 and is dominated by the large scale 
fluctuations of the velocity field. 

Let us next consider the motion of two particles in the velocity field picked 
randomly from the Kraichnan ensemble. Denoting the position of the first particle 
by R and the second by R -h p where p is the separation vector of the particles 
the latter satisfies the equation 



^=v(t,p + R)-v(t,R), 



(15) 
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We may write this in terms of the velocity field in the frame of the first particle, 
the so called quasi- Lagrangian velocity 

=\{t,r + R{t)) , (16) 



so that 

(17) 

In general, the probability law of the quasi- Lagrangian velocity field is a compli- 
cated function of the law of v, in particular not gaussian. However, in the Kraich- 
nan model due to the delta correlation in time, it is not hard to check that the 
quasi-Lagrangian velocity field has the same law as the velocity itself. Therefore 
p satisfies the single particle equation with the velocity v replaced by the velocity 
difference Av. Since the spatial part of the covariance of the latter is 

2(L>^^(0) - D^^{p)) = 2æ^{p) 

we conclude that in the limit of vanishing molecular diffusivity the transition 
probability density for p averaged over the v, Vt{p, p') satisfies 

dtVt^é^{p)d,d^Vt (18) 

i.e. a diffusion equation where the diffusion constant depends on the separation of 
the particles. In the limit 7/^0 and L ^ oo the function (p) is homogeneous 
of degree ^ i.e. 

æ^{p) = \p\^éHp). 

This leads to the scaling 

'^t{po,p) = ^p) (19) 

which implies 

= t^ (20) 

The function / turns out to be sensitive to the compressibility degree of the velocity 
field [5]. 

For incompressible velocities /(O) ^ 0 and therefore 
< p{t,p^f >= 



where 

lim H > 0. 

This fits to Richardson’s observations and yields the Richardson law provided 
^ = I which in turn follows from a = ^ = /3, the Kolmogorov values. 
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Since (20) represents the average over velocity realizations of the trajectory 
separation we may conclude that with nonzero probability the trajectories in a 
fixed incompressible velocity field are non unique. Thus for such velocities, the 
limit of the Markov process x(t|v) as the molecular diffusivity aî ^ 0 remains a 
stochastic process. The solution to the deterministic looking differential equation 
(1) is not a single trajectory, but a probability measure on a set of trajectories! 

If we reintroduce the viscous scale r) below which the velocity field is smooth, 
the equation (20) remains true for separations larger than rj. For smaller separa- 
tions the standard exponential separation (2) holds, with an explicitly calculable 
Lyapunov exponent A [10]. 

The Kraichnan model has served as a testing ground for many other ideas 
in turbulence. Just to mention one, the study of the multi-particle motion leads 
to understanding the problem of intermittency in the statistics of a tracer concen- 
tration carried by the velocity field [6], [7], [8], [9]. However, the delta correlation 
in time of the velocity field is certainly an unphysical idealization and therefore it 
would be nice to be able to relax this assumption and test the robustness of the 
mechanisms uncovered in the Kraichnan model. Thus we will turn to the study of 
the time correlated self similar ensemble (8). 



5 Beyond the Kraichnan ensemble 

Let us now consider the equation for the trajectory separation (16) for more general 
velocity fields [11]. If we wish to model this problem with velocities taken from the 
self similar ensemble (8) we need to address the question of the difference of the 
statistics of v and In general, these agree only in the delta correlated case. 
We have two choices: 

(a) Model the Eulerian velocity v by (8). 

(b) Model the quasi-Lagrangian velocity by (8) . 

We consider first the case (b). This means we replace (16) by the equation 

f=Av(i,p), (21) 

where v has the statistics (8) which implies 



r) r') 

2A [ e 

J\k\>L 



1 - cQs(k ♦ (r - r')) 

Jçd-\-‘2oi 



p«(k,p) 



dk 

{2ttY 



( 22 ) 



where we took r} = 0. 
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In the Kraichnan model the separation process p is self similar as seen in 
(19). Let us therefore define the rescaled separation 



Px{t) = X-^p{Xt) 


(23) 


and study its dynamics. We obtain 




^ = Avx(t,px), 


(24) 


where 




^'Vx{t,p) = A^“‘^Av(Af, A^'p). 


(25) 



Now Ava is distributed again as in (22) with the parameters 

^(A) = 

B{\) = 

L(A) = A-'^L. 

We can now ask when is this rescaled velocity field Ava identical in law to Av. 
This happens when 

L = oo 

and either 

(1) a + 2/? = 1 , a = 

or 

(2) B = 0 , a = ^ 

or 

{3) A ^ oo B oo with ^ fixed, a = 

In all these cases we conclude that p{t,po) and p\{t,X~^ po) should be iden- 
tically distributed and therefore 

(26) 

where the scaling function g would again presumably depend on the compressibility 
and possibly also the exponents a and j3. 

The case (1) consists of a line of fixed points under the scaling (25), whereas 
the case(2) is the frozen ensemble and case (3) the Kraichnan ensemble. Note that 
the Kolmogorov value (^, ^) lies on the line (1) and gives rise to the Richardson 
value <j = I in case g{Q) > 0. 

Let us finally ask the question of what can be said of the behavior of the pair 
separation when the parameters don’t correspond to the fixed point values above 
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i.e. let a + 2/3 1 and A, B finite, nonzero. As usual in such contexts we expect the 

short and long time behavior to be controlled by appropriate fixed points. Indeed, 
since A ^ 0 behavior of p\ gives the short time asymptotics and A ^ oo behavior 
the long time asymptotics we need to study what happens to the velocity Av^ of 
(25) in these limits. It will be convenient to distinguish various regions in the a, [3 
plane as depicted in Figure 1. 



P' 

1 V 

A 

B \ 

\ 

c 

0 




i, 

2i 



Fig. 1 

Let us consider first A — > oo at L = oo. Taking <r = fixes the ratio 

with >1(A), B{\) — >oo if a + p > 1 (domain A in Fig. 1) or if a + 2P < 1 
(domain C in Fig. 1). The latter case leads to a non-singular Kraichnan ensemble 
of velocity differences with Ç = 2{a + /?) whereas the former one does not (it 
would correspond to ^ > 2, L = oo). We may then expect that 

Urn V{X^ po, A°'p; At) = V^’^{po, p\ t) 

A— >oo 

^ = 2(i-L-/ 3) a + 2/3 < 1 (27) 

where pertains to the Kraichnan model with ^ = 2(a + /3) . This is indeed 
consistent with the scaling properties of the Kraichnan model dispersion. 

Taking a = keeps A{\) constant while B[\) ^ 0 if a+2/5 > 1 (domains 
A and B in Fig.l). We then expect that 

lim X^V{X^po, A^p; Xt) = V^'^{po,p; t) , for <j = -r^ and a; + 2/3 > 1, (28) 

A-^oo ^ ^ 

where stands for the PDF of the frozen velocity model with Holder exponent a. 

Inquiring about the short-time asymptotics of the trajectory dispersion re- 
verses the asymptotics. We should then have 

lim X^ V{X^ pQ,X^ p\Xt) = V^^{po,p;t) for a = and o + 2/3 < 1 (29) 
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(i.e. in domain C in Fig, 1) with the same value of the Holder exponent a, and 



y V{V poA" p-At) 



'P^'{Po,P,t) 



for a = 



1 

2{l-a-p) 



and 



(a-\-p <1, 
\a^2f3>l 



(30) 



(i.e. in domain B in Fig. 1) with ^ = 2{a + /3) for the Kraichnan model. Again, 
this is consistent with the scaling of the limiting PDF’s. 

To summarize, the scale invariance of the statistics of the pair dispersion 
although broken away from the a + 2/? = 1 line, should be restored at long and 
short times. In the regions B and C the long (short) time asymptotics of the 
separation is controlled by the frozen (Kraichnan) and Kraichnan (frozen) fixed 
points respectively. In the region A, although the velocities there possess equal 
time statistics that is only Holder continuous, the trajectories seem to be unique 
due to the rapid decrease of time correlations in short scales. 

It is worth stressing, that the relations (27) to (30) are conjectural. The PDF 
V is â complicated nonlinear functional of the velocity statistics and the conjec- 
tured relations assume their continuity in an appropriate topology, which is not 
obvious. In particular, since the convergence of the rescaled velocity covariances to 
the one of the Kraichnan model is very slow at long distances, there is a potential 
threat for the corresponding convergence of the rescaled PDF 'P{po,p;t) coming 
from the contribution of trajectories that venture far apart, if such contributions 
are important. Similarly, the slow convergence to the frozen model at long dis- 
tances could create problems for the corresponding convergence of the rescaled 
pair dispersion PDF. Whether such effects invalidate some of the conclusions (27) 
to (30) could be, in principle, studied in perturbation theory around the Kraichnan 
or frozen model. 

Let us finally note that the regions B and C have a simple interpretation 
in terms of the so called eddy turnover time. This is the typical timescale of 
length scale r fluctuations in the velocity field Teddy The 
correlation time for such fluctuations in our ensemble is Tc(r) = Thus on the 
line a-\-2j3 = 1 these times coincide, whereas e.g. in the region B Teddy ( t) > 
for small separations i.e. such fluctuations are decorrelated, in accordance with the 
picture that this region in small times should be governed by the Kraichnan fixed 
point. 



Let us now turn to the caise (a) above i.e. modeling the Eulerian velocity 
field by the gaussian self similar ensemble. The quasi-Lagrangian velocity field 
governing the evolution of the pair separation is defined as the Eulerian v relative 
to the frame of reference of one of the particles. The motion of this particle, as we 
saw before, is dominated by the large scale fluctuations in v that have variance of 
the order Indeed, for short times this motion is ballistic as can be seen by 
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rewriting the trajectory equation (1) as 



R(i) = I 
0 



v(s, 0) ds 



+ 



J [v(s, R(s)) — v(s, 0)] ds . 
0 



(31) 



Since the r.m.s. value of velocity t’(s,0) in the ensemble (8) is proportional to 
we may expect that, for fixed t, the first integral is of the order . On 
the other hand, the r.m.s. equal-time velocity differences on scales much smaller 
than L are of the order distance^. In particular, on the scales ~ they are of 
the order and the second integral is of the order <C L^. More precisely, 
let us observe that the Gaussian process with the components L”“v(t,0) and 
[v(t,L^r) — v(t,0)] converges in law when L ^ oo to the t-independent 
Gaussian process (vq, w(r)) with the 2-point functions 

J (2n)‘i ’ 

/• (1 _eik r)(i _e-ik.r') 

0 . (32) 



{ Vo Vo ) = 

(w(r)w(r')) 

(vow(r)) 



Note the independence of vq and w(r). It is then natural to conjecture that the 
following convergences in law take place, describing the leading terms in the single 
trajectory statistics for large L : 

L-“r(f) ^ vot, (33) 

L—>‘Og 

t 

- L^vot] — ^ [w{vos)ds. (34) 

L^oo J 

0 



Turning next to the pair separation, we expect it to show L dependence in 
its statistics due to the strong L dependence of the trajectory of the reference 
particle which we dent expect to decouple as in the Kraichnan case. It appears 
quite difficult to study this L dependence in the general gaussian ensemble above 
and particularly for long times. Here we will consider just a simple case of (8), 
namely the frozen model in one dimension. 

We shall consider two particle trajectories x{t) and x(t) -fpo staring at time 
zero at origin and po > 0, respectively, and we shall try to estimate the behavior 
of their separation p{t). First notice that p{t) > 0, i.e. the order of the particles 
on the line will never change. For large L, the dominant events are when the 
velocities of the particles and at the intermediate points are all of the order 
and of the same sign during the time interval (0, t). Let us suppose that they are 
positive (the case of negative velocities can be treated in a symmetric way). 
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The crucial fact resulting from the one-dimensional geometry is the identity 



Po a^(^)+p(i) 




0 x{t) 



The left hand side is the time At that the first particle takes to reach the initial 
position of the second one. The best way to understand the above identity is 
by releasing the second particle after the delay At so that both particles move 
subsequently together. The delay changes nothing in the movement of the second 
particle since the velocity field is frozen. The delayed particle will then arrive at 
position x{t) at time t (together with the first particle) and at position x{t)-\-p{t) 
at time t -h At. But the right hand side of Eq. (35) is the time that the second 
particle takes to move from x{t) to x{t) -h p{t). Hence the identity which may be 
also proven more formally by noticing that the time derivative of its right hand 
side vanishes. 

Writing for large L 

x{t) = L^vot + 0{L^ ), ^(^(0) — ^^^0 + w{vot) -}- 0(L^^), (36) 



see relations (33) and (34), and anticipating that p{t) = 0(1), Eq. (35) may be 
approximated as 



/^Q —2a\ 

L'^uo + L^^w{vot) 

from which we infer that 

p{t) - Po = L°‘^~°'pov^'^w(v^^t) + 



(37) 



(38) 



The process w{x) is the two-sided fractional Brownian motion, i.e. the Gaussian 
process with mean zero and 2-point function 

{w{x)w{y)) = + y^°‘ - \x - y\^°‘) = G{x,y) (39) 



for X, y > 0. Note the scale invariance under w{x) i— ^ p ^w{px) . The precise 
conjecture would then assert the convergence in law 

^ ■")[/?({)- ^o] — > poWo“^M^). (40) 

L—*^oo 

Note that the above calculations indicate that not only a single particle motion, 
but also the separation of trajectories in the Eulerian frozen one-dimensional ve- 
locity ensemble are dominated by the scale L velocities, i.e. by the large eddy 
sweeping, but the effect on the separation is inverse to that on the single particle 
motion. Whereas the latter one becomes very fast for large L, the trajectory sep- 
aration becomes essentially frozen to the initial value in a localization- type effect. 
It would be interesting to know if the localizing tendency persists in the more 
general Eulerian Gaussian ensembles (8). 
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6 Outlook 

The Richardson law for pair dispersion seems to be intimately connected to the 
irregularity of turbulent velocity fields. The framework of classical chaos is not 
applicable in such situations, being restricted to the viscosity dominated scales 
where the velocity field presumably is smooth. 

The simple gaussian ensemble of velocities that are delta correlated in time 
(the Kraichnan model) reproduces this law and so does the model where quasi- 
Lagrangian velocities are modeled by a more general scale invariant gaussian en- 
semble with time correlations. Such gaussian models with time correlations how- 
ever lead to strong sweeping effects by the large scale velocity fields and might not 
be good models for turbulent velocities where the sweeping effects should be less 
pronounced. It is still a major open problem how to quantitatively understand the 
Richardson law in real turbulent velocity fields. 
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Prime Corrélations and Fluctuations 



R Leboeuf 



Abstract. Based on the expansion of the counting function of the prime numbers 
in terms of the zeros of the Riemann zeta function, several aspects of the fluctua- 
tions of the primes are reviewed and further developed. In particular, the average 
density-density pair correlation, and the moments and autocorrelation of the fluctu- 
ations of the counting function are considered. The Riemann hypothesis is assumed 
throughout the paper. 



1 Introduction 

In the semiclassical theory of bounded quantum systems two closely connected 
“spectra” are of central importance, namely the set of quantum eigenvalues and the 
set of classical periodic orbits. Trace formulae express the quantum density of states 
in terms of the classical periodic orbits, both for integrable and chaotic systems 
[1, 2]. Properties of one set have been identified to corresponding properties of the 
other set, like for example the classical Hannay-Ozorio de Almeida sum rule with 
the quantum linear growth of the form factor close to the origin [3]. 

In the realm of analytic number theory, since the work of Riemann two dis- 
tinct sets are attached to the study of prime numbers, namely the primes them- 
selves and the zeros (and pole) of the Riemann zeta function. Their relationship 
with the theory of complex quantum systems received a decisive impulse after 
Montgomery’s work on the pair correlation of the zeros. It establishes the valid- 
ity of the GUE ensemble of random matrices [4], thus opening the road to their 
interpretation as the eigenvalues of some “chaotic” hermit ian operator. In this dy- 
namical analogy, the prime numbers play the role of the classical periodic orbits of 
some unknown chaotic system. This interpretation is based on the formula express- 
ing the density of Riemann zeros as a sum over prime numbers, whose structure 
is very similar to Gutzwiller’s trace formula for chaotic systems [5]. 

The primes and zeros are, however, “doubly-connected”, since the density 
of prime numbers can in turn be expressed as a sum over the Riemann zeros 
(and pole). Although on large scales the properties of prime numbers are very 
regular, as described for example by the prime number theorem, on local scales 
comparable to their mean spacing they show erratic fluctuations that make their 
prediction quite difficult. These fluctuations are controlled by the zeros, and it 
is therefore quite natural to relate as much as possible the properties of both 
sets. It is our purpose here to review and further develop some of the statistical 
properties of prime numbers, keeping in mind the applications and potential use 
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of the results in the context of dynamical systems. Two main quantities are to be 
considered, the average pair correlation of the prime numbers and the distribution 
and autocorrelation of the counting function 7t(x) (the number of primes < x). 

After introducing the basic formulae in §2, the results for the pair correlation 
of primes are presented in §3. It is shown that the average density i? 2 (y) of prime 
pairs separated by a distance y and located around x (with x oo) is, locally, 
given by the formula 



R2{y) 



-p\x) 



Si [27Txlog(gg^) 



:log 



±yH\ 

-vl-i) 



o^{x) 



1 - 



Si(27Tj/)' 



( 1 ) 



where Si(rr) = sinu/u du is the sine integral function and p{x) = log“^ x is the 
asymptotic average density of primes. The last approximation on the r.h.s. holds 
when 2 / <C X. If, moreover, y > 1, Si(27ry) ^ tt/ 2 and R 2 {y) ~ p^(l - 1/2^), a 
result in agreement with the average asymptotic pair correlation conjectured by 
Hardy and Littlewood [6] (cf §3 for more details). Eq.(l) expresses the presence 
of anti-correlations, i.e. on average the prime numbers repel each other on local 
distances. However, since the average distance between primes increases with x, 
the effective action of these interactions diminishes as x ^ oo. 

Equation (1) is the analog for prime numbers of the well known random 
matrix formula 






sin^(7r p^{t) e) 



rr2^2 



( 2 ) 



that describes the density of Riemann zeros separated by a distance e and located 
around a point f ^ oo on the critical axis (p^(0 = (27 t)“^ log(t/27r) is the aisymp- 
totic average density of the zeros) . This behavior has been conjectured to be valid 
generically for quantum chaotic systems [7]. 

In the same way as Eq.(2) is known to hold only on a limited range in e 
(with non-universal structures appearing at large c’s), Eq.(l) also has a limited 
range of validity. This is a further consequence of the properties of the Riemann 
zeros on the prime correlations. Denoting ti = 14.1347. . . the imaginary part of 
the lowest non-trivial zero of the Riemann zeta function, in §3 it is shown that at 
y ^ xjt\ a transition takes place, and that for y ~ x/^i Eq.(l) does not hold any 
more. Instead, a different behavior, which includes positive correlations, occurs. 
This new regime is also marked by the appearance of sharp peaks of negative 
correlation at positions determined by the lowest prime numbers. Again, this is 
similar (although less complex) than the hierarchical structure of anticorrelation 
peaks observed in the non-universal fluctuations of [8]. 

§3 contains also a detailed analysis of the Fourier transform of the pair cor- 
relation function, the form factor of the primes. 

Similar results to those presented here for the pair correlation of primes were 
previously obtained in Refs. [9, 10, 11]. In this respect the material of §3 should 
be considered to a large extent as a review. The Hardy-Littlewood average tail 
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has been discussed in most of the above mentioned articles. Some aspects of the 
long-range fluctuations (i.e., the presence of anticorrelation peaks) can be found in 
[11]. Eq.(l) is not in the literature, but could in fact be derived from the inversion 
arguments of Ref. [10]. Though similar in nature with respect to other works, the 
techniques used here are distinct and provide a systematic, global and alternative 
perspective of the correlations acting among primes. 

In §4 another aspect of the fluctuations of prime numbers is considered. 
Here the central object is the counting function 7 t(x), and more specifically the 
fluctuations of that function around its mean value. From a statistical point of 
view, the basic problem is to compute the distribution of the fluctuations that 
occur in a window centered around a point x. In the asymptotic limit x oo this 
problem was solved recently. Under the hypothesis of the linear independence of 
the complex part of the Riemann zeros, a closed form for the Fourier transform of 
the distribution was obtained in [12]. The distribution was found to be symmetric 
with respect to its average value, and different from a Gaussian law. 

Here we concentrate on the computation of the moments of the distribu- 
tion, and on the finite-x corrections to the asymptotic law. The moments can be 
expressed in terms of sums over the Riemann zeros. It is shown that the domi- 
nant contributions to these sums come from the low-lying zeros. Assuming, as in 
Ref. [12], their linear independence, explicit expressions are obtained for the lead- 
ing order behavior as x ^ oo. We find that for large but finite x the distribution is 
asymmetric, with a negative third moment which is of order \fx with respect to 

the leading behavior of the distribution. This is in agreement with numerical ob- 
servations. Finite-x corrections to the even moments are also computed, including 
those arising from the correlations acting among the zeros. 

The autocorrelation of the fluctuating part of 7t(x) is also computed. This 
provides a complementary information with respect to the distribution, it gives 
information on how the value of this function depends on neighboring points. An 
explicit formula is obtained that shows that the correlations never decay. The 
structure of those correlations is also remarkable, a series of parabolae joined by 
discontinuities of the first derivative. The origin of this structure is explained by 
making a connection with the fluctuations of the sum of prime numbers < x. We 
also provide a numerical verification of these oscillations and discontinuities, which 
incidentally also demonstrate the existence of the anticorrelation peaks in the pair 
correlation of the primes. 

§5 summarizes the results and adds some concluding remarks. 

2 Prime number density and counting function 

The density of prime numbers is defined as 

- p) , 

V 



( 3 ) 
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where the sum extends over all prime numbers p >2, and 0{x) is Dirac’s distribu- 
tion. The work of Riemann provides an explicit formula for this function [13], 



p{x) = 



X log X ^ r 



( 4 ) 



The a;’s label the location in the complex s-plane of the zeros and pole of the 
Riemann zeta function C(^)- The index = +1 for the pole, and = —1 for 
the zeros. For a fixed x, the series in r is finite, and restricted to values satisfying 
the condition > 2. /i(r) is the Mobius function defined as 

{ 0 if r has one or more repeated prime factors 

( — 1)^ if r is a product of k distinct primes 
1 if r = 1 . 



The structure of Eq.(4) is better displayed when p{x) is split into a smooth 
and an oscillatory part, p(x) = p + p. The smooth part p is a monotonie function 
of X and is obtained by keeping in Eq.(4) only the contributions of the real cj’s 



p{x) 



1 fMO/iA 1 ^ 

X log X ^ r V “ 1 / 

r—l ^ ' 



( 5 ) 



The first term inside the parentheses is the contribution of the pole located at cj = 
1, while the second comes from the trivial zeros located at a; = —21, I = 1, 2, 3, . . .. 
In the limit x — > oo the contributions of the latter are negligible. 

In contrast to the pole and real zeros, the complex zeros of (^(s) located at 
uj = 1/2 da it a, (y = 1,2,3,..., introduce oscillatory terms whose interference is 
responsible for the singular structure of p(x). 



p{x) 



2 

xlogx ^ r 

r=l 




( 6 ) 



We will assume Riemann hypothesis (i.e., the are real) and concentrate on 
asymptotic properties of prime numbers as x ^ oo. Most of the results that follow 
are valid to leading order in x. In general, we do not explicitly write down any 
correction terms, unless explicitly stated. The leading order behavior of p, obtained 
by keeping only the r = 1 term in Eq.(5), provides the well known result 



p{x) 



1 

logx 



( 7 ) 



while the leading r = 1 contribution to the oscillating part is [14] 



2 



y/xlogX 



oo 

COs(ta log x) . 

a=l 



p{x) = 



( 8 ) 




Vol. 4, 2003 Prime Correlations and Fluctuations 



S731 



In a quantum mechanical context, loosely speaking it is possible to give to 
Eq.(8) a “semiclassical” interpretation, as the oscillating part of the trace formula 
for the spectrum of an Hermitian operator whose eigenvalues are the prime num- 
bers. In this analogy, the complex zeros of ({s) are playing the role of periodic 
orbits. If X is regarded as the energy, the action of the orbits is Sa = ta logx and 
their period, defined as dSa/dx, is 



Ta — tajx . 



( 9 ) 



Since, as we are going to see in more detail in §3, the primes asymptotically tend 
to behave as an uncorrelated sequence, then this suggests that the underlying 
classical dynamics should be integrable (i.e., a non-chaotic system). 

From Eq.(8) it follows that each complex zero contributes to the prime num- 
ber density with an oscillatory term whose period of oscillation, in the neighbor- 
hood of some point x, is given by 






a 



2ttx 



2tt 

Ta 



( 10 ) 



The structure of p{x) is thus governed by several scales. The first one is 
simply X, the neighborhood around which the properties of primes are evaluated. 
Asymptotically x ^ oo, and this is the large parameter in the problem. The 
second, usually intermediate in dynamical systems but here of order x, is given 
by the complex zero with the lowest imaginary part, ti, that produces in p(x) 
oscillations of wavelength Xi = 27rx/ti. The third relevant scale is the mean 
distance 5 between consecutive primes. According to Eq.(7), asymptotically it is 
given by 

6 = p~^ = logx . (11) 



To resolve structures in a scale of order S, Riemann zeros with imaginary part of 
order 




27TX 

logx 



( 12 ) 



should be included in the sums (6) and (8). 

There are expressions analogous to Eqs.(4)-(8) for the counting function of 
the prime numbers, 

^(^) = 

p 



related to the density by p{x) — d7r{x)/dx. 0(x) is the unit step Heaviside function. 
7t(x) increases by one at each prime p. The exact “trace formula” obtained by 
Riemann for the counting function reads [13], 



= E ^ E Li (a;"'^’-) 

r—1 uj 



( 14 ) 
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where \A{z) — dt/ log t is the logarithmic integral function. As for the density, 
for a fixed x the series in r is finite, and restricted to values satisfying the condition 
> 2 . 

As the density, the counting function can be decomposed into two parts 
(smooth and oscillatory), corresponding to the contributions of the real and imag- 
inary ct;’s, respectively 

7t(x) = W(x) -h w(x) . 

The smooth part is given by 



r=l 



ti(r) 



-^Li 

/e-1 



(15) 



The first term inside the square brackets is the contribution of the pole at a; = 1, 
while the second one, a sum, originates from the trivial zeros. In the limit x ^ oo 
the contribution of the latter are negligible. 

The complex zeros of C(s) give oscillatory terms. 



oo 

7t{x) = -2 Rey^ 

r=l 



/i(r) 



oo 

I] Li [a: 



{l/2+ita)/r 



(16) 



The same three scales x, X\ and 6 mentioned before are also relevant for the 
counting function. In the limit x ^ oo the dominant terms of the two contributions 
are 



7t(x) = Li(x) , 



(17) 



and 



2^/x 

loga:;^ [2 






; COs{ta log x) + ta siïi{ta log x) 



(18) 



When only the term r* = 1 in Eq.(14) is considered and 7t(x) is approximated by 
(17) and (18), it reproduces in fact the asymptotic behavior of the function [13] 

— X^r=i increases by 1 at primes, by 1/2 at prime squares, 

etc. Our asymptotic results of the forthcoming sections describing the behavior of 
7t(x) are therefore also valid for J(x). 

The functions p(x) and 7t(x) are thus described in its simplest approximation 
by a smooth part associated to the pole of (^(s). Superimposed to this monotonie 
behavior are oscillations controlled by the complex zeros. The singular structure of 
these functions arises from subtle interferences between the zeros, and imaginary 
parts of order ts are necessary in order to resolve the spectrum on a scale Ô. 
Aside from these small-scale structures, there are long-range fluctuations in the 
density and counting function on scales of order Xi, determined by the lowest 
complex zero. The terms with r > 1 in Eqs.(6) and (16) produce oscillations in 
even larger scales, but these are asymptotically negligible compared to the leading 
contributions. 
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3 Pair Correlations 



The fluctuations of the prime numbers are quite different from that of the Riemann 
zeros. In a simple model proposed by Cramér [15] and later on developed in [16], 
the correlations are ignored and, locally, the primes are considered as random 
independent numbers. If that were true, the density of prime pairs separated by a 
distance k, and located around a point x ^ oo, would be simply proportional to 
the square of the probability to find one prime number in that region 



R2{k) - 



1 

log^ X 



(19) 



Cramer’s model provides accurate predictions for some properties concerning the 
primes. However, correlations among primes are believed to exist. In a celebrated 
paper [6], Hardy and Littlewood conjectured that the density of prime pairs is 
given by 

^ . (20) 

log X 

For k even, a{k) is expressed in terms of products extending over all odd primes p 



a{k)=2C2 n "^here C 2 = [] = 0.660161 , 

p\k p>2 ^ ^ / 

p>2 

(21) 

whereas a{k) = 0 if /c is odd. 

(j{k) is an erratic function of A;, and definite trends are difficult to extract. 
However, it is known [9] that when averaged by a suitable function, and for A: ^ 1, 
it takes the simple form 

= 1 - ^ • ( 22 ) 

This relation expresses the presence of anti- correlations, i.e. on average the prime 
numbers repel each other at large distances (see also [17, 18] for some related 
results) . 

Our aim is to investigate the correlations between the primes by focusing our 
attention on the average behavior of <j(A:). The approach used is based on Eq.(8), 
that expresses the fluctuating properties of the prime numbers in terms of the 
imaginary part of the non trivial zeros of the Riemann zeta function. 



3.1 Prime Correlations: smooth part 

The Hardy-Littlewood conjecture Eq.(20) describes a departure from Cramer’s 
model. As a function of the distance between primes, i?^^(A:) predicts highly 
irregular oscillations while, on average, it leads to a slow decay of correlations 
according to Eq.(22). 
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Analytic methods like Riemann’s expansion of p(x) are not well adapted 
to describe the highly irregular number-theoretic oscillations of the correlations, 
since these are encoded in subtle interferences acting among the In contrast, 
they are well adapted to describe coarse-grained properties because appropriate 
averages are natural in a wave-oscillatory description. We thus concentrate on 
average or mean- values of the correlations, that provide less accurate but more 
simple and elegant results. 

Our purpose is to compute the asymptotic x ^ oo behavior of the local 
average density of pairs of primes separated by a distance defined as 

R 2 {y) = {p{x - y/2) p(x + y/2))Ax ~ P S(y) . (23) 

In contrast to the variable k used for the argument of R 2 in Eqs.(19)-(22), y is 
here a continuous variable. The angular brackets denote a local average over the 
position of the reference point x, 




The window size Ax must satisfy two conditions. It must contain a sufficiently 
large number of oscillations of f(u) to guarantee the convergence of the average. 
This means that it should be large compared to Xi. Moreover, it should be small 
compared to x to avoid variations of the smooth part. Although the latter condition 
is not strictly satisfied due to the proximity of both scales, in the following we will 
neglect the variations of the prefactors in Eq.(8), which introduce lower-order 
corrections in x. Alternatively, the full average {f{x)) = x~^ f{u)du could be 
considered; it produces equivalent results. For simplicity, from now on we omit the 
subindex Ax in the angular brackets. 

The asymptotic behavior of R 2 {y) is obtained by approximating in Eq.(23) 
p = p p hy Eqs.(7) and (8). The product of the smooth parts gives p^{x). The 
average product of the cross terms p(^ is zero, since all oscillations occur on a 
scale smaller than Ax. Then 

R 2 {y) = ^ + Y 2 (y) = ^ + T 2 {y) - p 6{y) , (24) 

log X log X 

with 

T 2 {y) = {p{x - y/2) p{x + y/2)) = 

■ ^2 ( El cos [ta log(a: + y/2)] cos [i„/ log(x - y/2)] ) . (25) 

X log X 

^ a, a' 

Using cos a cos 6 = [cos(a + 6) T cos(a — 6)]/2, the average over the cosine of the 
sum of the arguments vanishes. Then 

Î 2 (y) = , ^ 2 " ( E cos [ta log(x + y/2) - ta' log(x - y/2)]) . 

X log X 



( 26 ) 
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There are two distinct contributions to this double sum. The first, diagonal one, 
is considered in this subsection. The second is computed in §3.2. 

The diagonal terms a = a' give 







COS 




(27) 



To evaluate the sum, we simply approximate it by an integral (to simplify the 
notation we omit the brackets). 



f “"(y) = 



X log X Jo 



^oo 

/ dtp^{t)i. 
Jo 



nog 



x + y/2 
x-y/2)\ 



(28) 



where we have introduced the smooth density of the complex Riemann zeros, /9^(f). 
We only need the asymptotic behavior of , given by 



It is convenient to re-write the integral in Eq.(28) in terms of the variable r — tjx 
(cfEq.(9)) 



pdiag 



(y) = ‘2^ p- 



noo 

/ dr 
Jo 



XT log 



/ x + y/2 Y 
\x-y/2j 



Using Eq.(29), we have P(^{xr) = p,^{t) + (27rp) Then the diagonal part of T 2 (y) 
is 



roo 

=pS{y) + 2p'^ dr p^(r) cos 
Jo 



XT log 



/ X + y/2 
\x - y/2 



) 



(30) 



Now we evaluate the integral. When it is written as = limi^^oo fo* , the highly 
oscillatory terms arising from the upper limit are washed out by the average. 
Taking into account the definition (24), the result is. 






1 

2a;log^xlog(f^) 



1 1 
log^x 2y 



(31) 



The last approximation holds when y x. 

This is the first result concerning the two-point correlation function of the 
prime numbers, that coincides with Eq.(22) (see also [11]). It confirms, in agree- 
ment with the Hardy-Littlewood conjecture, the presence of (anti)correlations. It is 
a smooth function valid, in the approximation used to obtain Eq.(31), for arbitrary 
distances. However, it is accurate only for large values of y. This is because the 
correlations between Riemann zeros can only be ignored for small t^s (compared 
to the scale associated to the mean spacing). This corresponds, through a Fourier 
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transform, to y logo: (cf §3.3 for details). Therefore, Eq.(31) can only be trusted 
in that range (in §3.3 we will show that it holds in the range \ogx y x/ti). 

The derivation of Eq.(31) depends only on a global property of the com- 
plex Riemann zeros, namely their average density. It does not require any further 
knowledge, and assumes the Riemann hypothesis. In contrast, the computation of 
the off-diagonal terms (to be done in §3.2) requires additional information. 

It is important to stress an elementary point here. Since the average distance 
between primes increases as x increases, in Eq.(31) it is natural to re-scale all 
distances by the mean spacing between consecutive primes. Defining s = y/6 = 
y/\ogx, the rescaled two-point function 7 ^ 2 ( 5 ) = R 2 {y = <5<5)/p^ is given by 



n2{s) = l- 



1 

2s log X 



(32) 



For a fixed (and finite) value of s, clearly 7 ^ 2 ( 5 ) — > 1 as a: ^ 00 . Therefore, asymp- 
totically the prime numbers become uncorrelated, in agreement with Cramér’s 
model. Eq.(32) also means that asymptotically the only relevant part of the av- 
erage prime pair correlation function is its tail, described by Eq.(31) (and further 
elaborated in §3.3). 

The prime correlations are thus a finite-rr effect, that vanishes asymptotically. 
Further finite-a; corrections to the Hardy Littlewood tail may be computed from 
the higher orders in the expansion of the logarithm in Eq.(31) . 



3.2 Prime correlations: oscillations 

In §3.1 we have computed the diagonal contributions to Eq.(26). Here we improve 
the calculation by considering the off-diagonal terms a ^ o'. 



rpOf f 



(y) = 



X log X 



{ y] cos [ta log(x + y/2) - ta' log(x - y/2)]) 



(33) 



a^a' 



We set 



+ € , 



i.e., e is the (positive or negative) distance between two different Riemann zeros. 
For positive values, it varies in the interval {emin^oo) (where emin > 0 is the dis- 
tance to the nearest neighbor), while for negative values it varies in the range 
—{ta — ti)). Large values of e produce highly oscillatory frequencies in 
Eq.(33), and these are eliminated by the averaging procedure. We are therefore 
allowed to extend the negative range to (— Cmm, — 00 ) with negligible error. More- 
over, Y 2 should be an even function of e. Re-arranging the sum in terms of positive 
values of e only, Eq.(33) becomes 









00 00 

(E E 



X log X “ ^ 

° a— 1 



COS 



iclog 



/ x + y/2 \ 
\x-y/2j 



cos(eloga;)) . 
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Similarly to the diagonal terms, in the simplest approximation the double sum 
is replaced by integrals. The substitution requires the average density of pairs of 
Riemann zeros separated by a distance c, Because can 

be arbitrarily small we end up with 



nOff 



(y) = 



xlog X Jo 



f 



dt cos 



nog 



x + yt‘2 

X - y/2 



roc 

/ de Y 2 {^) cos(elogx) 
Jo 



(34) 



We now need an explicit expression for T 2 ^(c). Following Montgomery’s conjecture 
[4] , we make use of the pair correlation function of the Gaussian Unitary Ensemble 
of random matrices [19], 



Y2H0 






(35) 



Making the change of variable 6 — irp^e, the second integral in Eq.(34) becomes 



rOO 

/ de 4^2^ (e) cos(6logo:) = 

Jo 

roc 

Jo 



P({t) 

IT 



dO cos(a 6) = 

0"^ 



p<(0 1 



|o-2|-(a-2)] 



where a = logx/TTp^{t) = [tt p{x) pç{t)] ^ > 0. The value of the integral is non- 
zero only if a < 2. According to the definition of a, this implies t > 2ttx. Then 



roc 

/ de T 2 ‘"(e) cos( 
Jo 



elogx) = - ^ 






1 



0(t — 27Tx) . 



2tt p{x)\ 

Inserting this result in Eq.(34) and changing to the variable r = t/x it follows that 



rOO 

T 2 ^{y) = - 2 dr p^{t) cos 

Jo 



XT log 



/ x-\-y/2 \ 
\x--y/2) 



0(r — 2tt) . 



When this expression is combined with the contribution coming from the 
diagonal terms, Eq.(30), we remark that the integrand exactly cancels for values 
of r > 27 t, leading to the following expression for T 2 = + T 2 



T 2 {y) = P S{y) + 



r2'l\ 

' / dr p^{t) COS 

Jo 



XT log 



/ x + y/2 \ 
\x - y/2) 



(36) 



Using Eq.(29) the last integral is straightforward and leads, by the definition (24), 
to 



Y2{y) = - 



Si [27ra:log(|^) 
TTxlog^xlog 



1 Si(27Tÿ) 
log^ X 'ïïy 



(37) 



The last approximation holds when y <^x. 
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This is the second result concerning the two-point correlation function be- 
tween the prime numbers, given by Eq.(l) in the Introduction. It confirms the 
presence of (anti) correlations, even for small values of y. Contrary to the diagonal 
part, the computation of the off-diagonal terms requires the pair-correlation func- 
tion between different non-trivial Riemann zeros. Their effect is to superimpose to 
the smooth part Eq.(31) some oscillations, and make moreover the function finite 
at small values of y, where the behavior is 



Y2{y) 




-2 + 




y ^ 0 . 



However, in the relevant range y > 2 the function — Si(27r?/)/7rj/ does not signif- 
icantly deviate from its smooth behavior —l/2y, to which it clearly tends when 
y > 1. 

An alternative view of the prime pair-correlations is offered by the Fourier 
transform of T 2 , called the form factor. It is defined as 



/*oo 

K{t) = 47t / dy cos(yr) T 2 (y) , (38) 

Jo 

whose inverse relation is 

1 

72(y) = ^ J dr cos( 2 /t) K{t) . (39) 

Comparing Eqs.(36) and (39) we deduce that the form factor of the prime numbers 
is, in this approximation, 

K{t) = 27Tp + 27rp^ log ©(27 t — r) . (40) 

For r < 2 tt this function behaves as K{r) — 2 ttp‘^ log(xr/27r). In the limit r ^ 0, 
K{t) — oc. This reflects the fact that the area under the function Y 2 {y) is infinite 
(cf. Eq.(38)). At r = 27t, K{t) has a discontinuity (of the first derivative), and 
saturates at the value K{r) = 27rp = 27t/ logx, which is proportional to the inverse 
of the mean spacing between prime numbers. The discontinuity of K (r) does not 
take place for values of r of order 27vp (as it happens for example for the form factor 
of GUE random matrices, see e.g. [19]), but on the much larger scale r = 27 t 27rp. 

In contrast, as it does happen in the GUE case, the diagonal approximation is 
“exact” until the saturation takes place. In the limit x ^ oo and for a fixed r, 
K{r) — 2 tï~p^ log(rrr/27r) 27rp, and the whole function tends to a constant. 
This behavior corresponds to Cramér’s model. The function K{r) is represented 
in Fig.l, using the value x = e^^. The behavior close to the origin is displayed in 
the inset. The function in fact vanishes when < r < t\/ x — 2.6 x 10~^. This is 
explained in the next subsection. 
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T 



Figure 1: Form factor of the prime numbers (at x = e^^). Inset: behavior close to 
the origin. 

3.3 Long-Range Deviations 

The form factor and the two-point function calculated in the previous sections 
were obtained by a straightforward approximation of the sums by integrals. It is 
possible to go beyond that approximation and make more accurate evaluations. In 
particular, we will show that for large values of y there are deviations with respect 
to the smooth tail predicted by Eqs.(31). 

A better description of the form factor is obtained when it is written in terms 
of the complex part of the Riemann zeros. Inserting, when x ^ oo. the relation 
(26) in Eq.(38), and taking the limit y <C x, we get 

K{t) = s(t- ■ ( 41 ) 

® a, a' ^ 

This is a very useful relation, that expresses the form factor as the average of 
a series of delta peaks located at r = {ta + ta')l2x. It is the analog for prime 
numbers of the semiclassical expression of the spectral form factor [3]. For small 
values of r (of the order of ti = ti/x) only the lowest Riemann zeros contribute 
to K{r). Assuming incommensurability between the ta’s, the non-diagonal terms 
involving these low-lying zeros are washed-out by the average and the form factor 
is approximated by the diagonal contribution. The latter takes the simple form of 
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a sum over delta peaks located at the rescaled t„’s 



K{t) ~ Kd{t) 




r 0 . (42) 



When the sum is approximated by an integral, the previous result, 

Kd{t) = 2-ïïf' log , (43) 

is recovered. It holds in fact not only in the limit r ^ 0 but in the whole range 
T < 27t. However, Eq.(43) is not an accurate description for r ~ ti. In particular, 
it ignores a basic property of A"(r), present in Eq.(42) but not in (43), namely 
K(t) = 0 for T < T\. With this property incorporated, K{r) takes the form 



{ 2ttp + 2ttp^ log(^) ©(27 t-t) 



if T < t [/x , 
if r > t\/x . 



(44) 



This expression justifies the behavior of K(r) close to the origin represented in 
Fig.l 

It is easy to find out the modifications induced in ^ 2 ( 2 /) by the vanishing of 
K{r) for T < T\. This simply amounts to impose a lower bound, at i = ti, in the 
integral in Eq.(28). Then picks up some additional terms related to the 

lower bound, and instead of Eq.(37) we now get, in the limit y x, 



Y2{y) = 




-Si(27ry) + Si{tiy/x) - logti sin{tiy/x) 
Try 



(45) 



In the range tiy/x 1 the contribution of the correction terms tends to 
(logti — l)/('7Txlog^ x), i.e. they add a (small) constant which is of order 1/x 
with respect to the value of —Si(2'Ky) / iry . However, when t\y/x ~ 1, the decay of 
the two Si functions cancel out and the sine function dominates, giving 



Y2{y) = - 




logti sm(tiy/x) 
ny 



y - ^/ti . 



(46) 



Thus, the smooth tail —ll2y predicted by Eq.(37) holds only for separations up to 
y ~ x/ti. For larger values of y the behavior is modified. The exponent of the tail 
is unchanged, but now Y 2 {y) oscillates with mean value zero according to Eq.(46). 
It implies the presence of regions of positive correlation, where the primes tend to 
attract instead of repel each other. 

Equations (45) and (46) arise from the existence of a lower limit in the integral 
in Eq.(28), but otherwise make use of the average density of the Riemann zeros. 
Such an approximation ignores the subtle interference mechanisms between the 
taS that could be at work in Eq.(27). And indeed such processes exist. In order to 
see them, we remark that the structure of Eq.(27) resembles that of the oscillatory 
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part of the density of states, Eq.(8). At some fixed height x, the comparison with 
Eq.(8) suggests that a constructive interference is going to occur at values of y 
defined by {x H- y/2) f{x — yl2) = p, with p some prime number (in the asymptotic 
approximation employed here constructive interference would also happen, though 
with a lower weight, at integer powers of primes, see [14]). This gives 

y = 2x^, p = 2,3,5,... (47) 

In contrast to p(x), in Y 2 {y) the sum will not produce delta peaks at these values 
because the sum in Eq.(27) is truncated by the average, and only zeros satisfying 
tc ~ 2'kx^ j y bix contribute significantly. Instead of delta peaks, some broaden 
and finite negative peaks of anti-correlation will be located at the positions (47). 
These are superimposed to the oscillations described by Eq.(46). The peaks were 
also found in Ref. [11]. An indirect numerical test of their existence is given in 
§4.2. The structure found is reminiscent of the complex zeros of the Riemann 
zeta function [8], where low-lying Riemann zeros create a hierarchical structure 
in the tail of their two-point correlation function, although in the present case, 
and contrary to what happens for the Riemann zeros, the shape of the correlation 
peaks depends on the averaging window size. 

4 Fluctuations of 7 t(x) 

The fluctuations of 7t(x) around its mean value are now considered. Many (and 
old) problems of number theory are directly related to them. As mentioned in 
the introduction, our purpose is to compute the moments of the distribution of 
the fluctuations of 7r(a;), as well as their autocorrelation. Before that, we briefly 
considered a related problem. 

There is a long-standing question about the relative value of 7t(x) with re- 
spect to its leading asymptotic average term, Li(x), and in particular about the 
first time 7t(x) becomes greater than Li(x). From Eq.(14), and to leading order in 
X, the asymptotic difference is 

tt{x) - Li{x) = 2-^ — - cos(ta logx) + sin{ta logx) 

logx logx ^ (I [2 

(48) 

In this approximation, the first crossing between 7t(x) and Li(x) is therefore defined 
as the lowest value of x for which 

nr,{x) > 1 . (49) 

We have introduced the normalized asymptotic function (cf Eq.(18)) 

7t(x^ '^11 

7T„(a;) = ^-j = -2 Ô cos(«a logx) + ic, sin(t« log a:) . (50) 

^X/ iogx „=l( 4 +*aj 
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1.3980 1.3982 1.3984 

X (10**316) 



Figure 2: 7 T;v( 2:) as a function of x in the neighborhood of the first crossing found 



It is possible to investigate numerically Eq.(49) for increasing values of x, 
by first using a relatively large step in x (of about 10^ points in each interval 
10-^ ^ 10-^^^) and then reducing the step whenever is greater than some 

arbitrary fixed value. By this procedure, the first crossing we could find was located 
at 

X, 1.39815 X . (51) 

A plot of ^^r(x) in this neighborhood is shown in Fig. 2. ^;v(^) has been computed 
truncating the sum to the first 2 x 10® Riemann zeros, which were provided by A. 
Odlyzko [20]. 

The value of x^ is in agreement with the lowest bound known up to date [21], 
which considerably reduces the previous bounds found by Lehman [22] (1.65 x 
10“®®) and by te Riele [23] (6.650 x 10®^®). 

4.1 The moments 

Our aim is to compute the moments of the asymptotic, x — > oo, distribution of 
the fluctuations tt{x) = 7t(x) — 7f{x). To leading order, it is useful to consider the 
normalized function (50), but lower-order terms will be taken into account as well 
later on. To characterize the distribution of the fluctuations, we shall compute the 
average of the k th moment over a window Ax, 

= A; = 1,2,3, ... . (52) 

The angular brackets denote an average over the position of the reference point x, 
as defined in §3. 
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Because of the average, any oscillatory behavior on a scale equal or smaller 
than Xi has zero average. Therefore, the first moment of the fiuctuating part 
vanishes, as it should 

Ml = {nN{x)) = 0 . 

The second moment is written 



M2 = 2 



ai ,0:2 



1 

(i 





cos [(tai 



taj log a;] 



sin -t„ 2 ))loga:l|) ■ 

(53) 



This equation is obtained by reducing the products to single trigonometric func- 
tions of the sum and differences of the arguments, and by further ignoring the 
terms involving the sum of the arguments because their average vanishes. 

In Eq.(53) only the terms satisfying ~ ^02 = ti/(27rx) have a 

non-zero average and therefore contribute to the double sum. In the large-x limit 
this difference is small, and it is legitimate to ignore the difference between tai 
and tcc 2 in the prefactors. Then the terms involving the sine do not contribute. 
Using the asymptotic definition of the form factor of the prime numbers, given by 
Eq.(41), the second moment takes the simple and compact form 



M 2 



xlog^x J 

Jo (ÎT^ 



(54) 



Different degrees of precision are obtained for M 2 according to the approx- 
imation used for K{r). The simplest one is the smooth approximation discussed 
in §3.2, and given by Eq.(44). The latter, together with Eq.(54), allow to make a 
simple qualitative estimate of the second moment. We notice that since K(r) sat- 
urates to a constant for large values of r, in that limit the integral is convergent. 
The main contributions come therefore from the lower limit r ^ 0, where the 
divergence is stopped by the short-time cutoff of K{r) dX r — t\/x. When K{t) is 
replaced by (44), the integral in Eq.(54) is somewhat cumbersome. To simplify the 
result, we only give the leading order of an expansion where x and t\ are assumed 
to be large parameters. The result is 



M 2 



1 

Tit I 



1 + log (^) 



1 

2'k‘^x 



0.04078 



1 

2'k‘^x 



(55) 



In the last equality we have used the value t\ — 14.1347 ... for the imaginary part 
of the lowest zero. In Eq.(55) the first term in the r.h.s comes from the diagonal 
smooth approximation of the form factor, Eq.(43). A more accurate estimate is 
obtained when, instead, a direct evaluation of the diagonal sum a\ = a 2 is made 
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in Eq.(53) , 

oo 1 1 - 

where we have used the well known result [13] V (i + ^a) — ~ + 

1 -\r (7 is Euler’s constant). 

In contrast to the diagonal contributions, the term — (27 T^x)“^ arises from 
off-diagonal terms in Eq.(53), and contain therefore information about the corre- 
lations acting among the Riemann zeros. This information is embodied through 
the discontinuity of the form factor of the prime numbers at r = 27 t, and gives rise 
to a lower-order correction to the asymptotic second moment. This, however, is 
not the leading order correction to M 2 . If in the expansion of the Li(a:) in Eq.(16) 
the term of order x/ log^ x is kept, it leads to a “diagonal” correction to M 2 given 
by logT^ X (I + • The specificity of the corrections arising from correla- 

tions, Compared to contributions from lower-order diagonal contributions, is the 
negative sign. 

The same principle can be extended to compute the higher order moments. 
The important point is that, as for the second moment, the main contributions 
come in all cases from the lowest terms in the sums. These are given by a gener- 
alization of the diagonal approximation that has been used for M 2 . Off-diagonal 
corrections provide asymptotically negligible contributions. In the following we 
ignore those corrections and concentrate on the main terms. 

The third moment. When the third power of is computed from Eq.(50), the 

product of three trigonometric functions is decomposed as a sum of trigonometric 
functions involving the sum (which has zero average) and differences of the argu- 
ments, which are of the type ~ (plus permutations). The generalized 

diagonal approximation consists in finding the terms for which the argument of 
the trigonometric functions vanishes. By assuming incommensurability of the 
there is no way to cancel linear combinations as those just mentioned. We then 
conclude that, 

Ms = 0 , X ^ 00 . (57) 

The same conclusion can be reached for all odd moments. The distribution of the 
values of 7t(x) is therefore asymptotically an even function, as was also found in 
[ 12 ]. 

As we now show, at a finite x the odd moments are non zero. To capture the 
non-vanishing terms the repetitions in Riemann’s formula (16) should be kept. 
Instead of Eq.(50), we now have 



(x)- 



- cos 



tct 



logx 



+ ta sin 




(58) 



The main qualitative difference introduced by the terms r ^ 1 are the 1 /r fac- 
tors in the argument of the trigonometric functions, that introduce asymmetries 
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in the behavior of When the third power of is computed, the ar- 

guments of the trigonometric functions have, up to a permutation of indices, the 
form ta^/ri + The cancellation of the argument of the oscillatory 

functions imposes the condition 



r\ T2 r-3 

For incommensurate the only possible solutions are a\ = a 2 = (^ 3 , 
two given (positive) integers and 



riT2 



which should also be an integer. The leading asymptotic solution that leads to an 
integer value of ra is ri = V 2 = 2, with — 1. All other solutions are of lower 
order in the limit x ^ 00 . Then the asymptotic non- vanishing expression for the 
third moment becomes 



Ms 




1.113 X 10“^ 
yjx 



(59) 



To evaluate the r.h.s., we made use of [12, 24] V (i + = 7 ~ log(47r) + 

271 + 7^ + 3 - 7tV8 - 3.71 ... X 10"^ (71 = -0.072816 . . .). 

To conclude this section, we give the result for the fourth moment. It is 
obtained by similar arguments as for the lower moments (repetitions are not nec- 
essary). The leading order contribution is found to be 



( 00 1 \ ^ 

S(3+«â)j ^5 (3 



^1 (3 + 

CO - 

3M| - 6 V ^ ~ 6.178 X 10“® . (60) 

t'iil+tlŸ 



The value of the fourth moment differs, by a factor —6 (| + 1^) ^ ~ —2.23 x 

10““*, from the value required by a Gaussian distribution. Similar results are found 
for the higher moments. 

To summarize, the distribution of 7 t(x) — 7f(x) is asymptotically an even func- 
tion whose momenta differ from the Gaussian law, as illustrated by Eq.(60). At 
finite values of x the distribution looses its parity and becomes an asymmetric 
function, with the third moment given by Eq.(59). Correlations between zeros in- 
troduce corrections, that were explicitly computed for the second moment, Eq.(56). 

To check these results we have numerically computed the distribution of the 
fluctuating part of 7t(x). This was done by subtracting the smooth part Eq.(15) 
to the exact value of 7r(x) (obtained with Mathematica) , and then computing 
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Figure 3: Normalized asymptotic distribution of the fluctuations of 7t(x) (full line), 
compared to a Gaussian law having the same variance. 



the distribution of the re-scaled quantity (7t(x) — n(x))/(y/x/ logx). The result, 
for 63000 points computed in the interval 10^ ^ 10^°, is shown in Fig. 3. We 
have numerically computed the moments of that distribution. We obtained M 2 = 
4.577 X 10-^ Ms = -3.123 x 10-^ and M 4 = 5.778 x 10“^ The second and 
fourth moments are in good agreement with the asymptotic predictions M 2 = 
4.620 X 10“^, and M 4 = 6.180 x 10“^. The third moment is too large compared to 
the average value expected from Eq.(59). 



4.2 The autocorrelation 

The function 7t(x) wildly oscillates around its mean value as a function of x. In 
the previous section we have studied the distribution of those oscillations. An- 
other important aspect of this process is how the oscillations are correlated at 
different points x. These “memory effects” may be characterized by computing 
the autocorrelation function of the fluctuating part of 7t(x), defined as 

C(y) = (tTnIx - y/2) + y/2)) . (61) 

The average window satisfy the same conditions as those defined for the moments. 

C(y) is asymptotically evaluated by using the expression (50) for ^^(x). 
Because of the same reasons as for the second moment, the resulting double sum 
is asymptotically dominated by the diagonal terms (the off-diagonal terms add 
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(l+s/2)/(l-s/2) 

2 3 456789 




S = y/x 



Figure 4: Autocorrelation of the normalized oscillatory part of 7t(x), as a function 
of s (lower scale) and (1 + 5/2)/(l — s/2) (upper scale). Dots: numerical values. 
Solid line: Eq.(62). 



some asymptotically vanishing corrections). This approximation leads to 

1 -f s/2^ 






ta log 



1 - s/2 



(62) 



which depends only on the rescaled variable 5 = y/x. To test this result, the 
exact values of the fluctuating part n{x) = 7t(x) — 7t(x) were computed using 
Mathematica for some 60000 values of x in the range 10"^-10^°. Then the average 
autocorrelation function was computed for several windows in that range. The 
result is represented, as a function of the rescaled variable s = y/x, in Fig. 4. It 
is compared to the prediction Eq.(62). Notice that, because of the large values of 
X used in the numerical evaluation, the relative distance y takes also very large 
values and therefore correlations between points separated by large distances exist. 

The first change of sign of C{s) occurs at s 7 t/(2^i) = 0.11 As seen in the 

figure, (7(s) is a non decaying oscillatory function. This behavior is due to the 
dominance of the lowest Riemann zeros in the sum (62). 

The structure of C{s) as a function of s, which shows a series of (inverted) 
parabolae connected by points where the first derivative of the function is discon- 
tinuous (edge points) has its origin in the following connection. 
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Consider the sum of the prime numbers smaller than x, measured with respect 

to X, 

u (x) = ~ p)Q{x - p) . (63) 

P 

Prom a physical point of view, U (x) represents (up to a sign) the grand-potential at 
zero temperature of a non-interacting Fermi gas confined to an external potential 
whose single-particle energy levels are given by the prime numbers. In this analogy, 
the variable x represents the chemical potential. An analogous quantity for the 
zeros of the Riemann zeta function has been studied in detail in [25]. 

The previous sum can be alternatively written as U{x) = 7r(y)dy. We are 
int^ested in the fluctuations of this function with respect to its average, defined 
as U (x) = U{x) - f^W(y)dy, or 

U(x)= / TT{y)dy . 

Jo 

The qualitative behavior of this function is clear from this expression. Since ^(x) 
has a sawtooth shape (with teeth of negative slope and vertical discontinuities), 
its integral consists of a series of parabolae (each corresponding to a tooth of ^(x)) 
connected by discontinuities of the first derivative of /7(x), which occur when x is 
equal to a prime number. This behavior coincides, qualitatively, with the behavior 
of C{s) observed in the figure. 

We now show that the asymptotic behavior of U (x) (appropriately normal- 
ized and with a rescaled argument) coincides with Eq.(62). To see this we first 
notice that, according to Eq.(16), 7t(x) ~ Li(x‘^/^), with u = 1/2 + it a- Since 
/ dx Li(x^A) = — Li(x^+‘^/'^) + x Li(x‘^/^), keeping in the limit x ^ oo the ap- 
proximation Li(x) ~ x/logx, we obtain 



U{x) 



2 ^ 1 

iï+tDiî + tl) 

^ '3 



- ) COs(^a logx) +2 ta Sm{ta log x) 



(64) 



Since in each bracket t^ is much larger than its companion constant, we can neglect 
those constants without introducing an important error. By the same reason, the 
term containing the sine is neglected compared to the one with the cosine. Then 

2 t3/2 ^ 1 



Up to normalization of this function by the factor x^/^/ logx and by the transfor- 
mation X ^ (1 -f s/2)/(l — s/2), the oscillatory part of U{x) therefore coincides 
with C(s), given by Eq.(62) (when the 1/4 in the denominator is neglected). To 
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explicitly show that in C{s) the discontinuities of the derivative joining the parabo- 
lae are, as predicted by this connection, related to prime integers, in the upper 
part of Fig.4 we have displayed the variable (1 + s/2)/(l — s/2) for the values of 
s indicated in the lower axis. The discontinuities observed at powers of integers 
in the plot of Eq.(62) are spurious and due to the asymptotic approximation used 
(cf [14]). 

Finally, we also notice that the diagonal expression of the autocorrelation of 
7t(x), Eq.(62), and the diagonal part of the density of prime pairs, Eq.(27), satisfy, 
to a good approximation, the relation 



d^cjs) 



—X log^ X 



= xs) 
(s2/4- 1)2 



It follows from this that the structure of C{s) discussed above and displayed in 
Fig. (4) is related to the long-range deviations of F 2 (y) considered in §3.3. In partic- 
ular, the abrupt changes of slope observed in C{s) at values {l-\-s/2)/{l — s/2) — p, 
p — 2,3,5,7, ..., provide a numerical test of the presence of anti-correlation peaks 
(located also at positions determined by the lowest prime numbers) in the asymp- 
totic density of prime pairs. 



5 Summary and concluding remarks 

The properties of prime numbers and those of the complex Riemann zeros are 
intricately interrelated. The duality and close relationships that operate between 
these two sets - one behaving as eigenvalues of random matrices, the other re- 
sembling a random uncorrelated sequence - is one of the more striking features 
of number theory. The present investigation further illustrates and clarifies this 
connection. 

Our first task has been to revisit one of the most basic aspects of the Hardy- 
Littlewood conjecture, namely that for large distances y ^ 1 the average prime 
pair correlation behaves as i? 2 (y) = ~ l/2y). This follows from the diagonal 

contributions to R 2 {y)- It is derived under Riemann hypothesis, and requires the 
average density of the Riemann zeros as a further ingredient. Some rigorous results 
related to this asymptotic behavior can be found in [17]; it was also related to the 
discreteness of the sequence of prime numbers in [11]. 

Montgomery’s random matrix conjecture concerning the correlation function 
of the zeros allows to improve the formula for the average behavior of the pair 
correlation of the primes. To the smooth decay of 7^2 (y) mentioned above, now 
is superimposed an oscillatory structure described by a sine integral function (cf 
Eq.(37)). This provides a more accurate description of the average properties of 
the correlations. The corresponding form factor has been explicitly computed. We 
ignore if this result can be directly obtained from the pair correlation function 
conjectured by Hardy and Littlewood, Eq.(21). 




S750 



P. Leboeuf Ann. Henri Poincaré 



Finally, we argued that the previous expressions of R 2 {y) are only valid in 
a finite domain of distances. As a general rule, the correlations between primes 
described above hold only for distances y ~ x/t\^ where t\ is the imaginary part 
of the lowest Riemann zero. In the limit x ^ oo, this scale is much larger than the 
typical distance, of order logx, between consecutive prime numbers. For values 
of y ^ x/ti, a transition takes place and a different behavior begins. In the lat- 
ter regime, regions of positive correlation exist, and sharp anti-correlation peaks 
appear at distances determined by the lowest prime numbers. 

A related problem is that of the fluctuations of tx{x) around its mean value. 
The moments of the distribution were explicitly computed in terms of convergent 
sums over Riemann zeros. Linear independence of the latter has been assumed. 
Asymptotically the distribution is symmetric and non-Gaussian, in agreement 
with the findings of Ref. [12]. However, at finite x the odd moments are non-zero, 
and arise from terms with r > 2 in Riemann ’s formula. Also, the correlations 
between zeros add finite-x corrections to the moments, which can be efficiently 
computed using the form factor of the prime numbers. Illustrating a different 
aspect of the fluctuations, a numerical search made for finding the first crossing 
between 7t(x) and Li(x) gave a value that is in agreement with the best bound 
known today. 

In the context of dynamical systems, the moments and distribution of the 
fluctuations of the counting function of the quantum eigenvalues have been in- 
vestigated in recent years. The general conclusion [26] is that asymptotically the 
distribution is universal (and obeys a Gaussian law) when the corresponding clas- 
sical dynamics is fully chaotic, while it is system-dependent, non-universal for an 
integrable dynamics. The two basic “spectra” encountered in number theory obey 
to that general rule. The fluctuations of the counting function N(t) of the Riemann 
zeros has been shown to be asymptotically Gaussian [27], in agreement with the 
behavior of a chaotic quantum spectrum. In contrast, the fluctuations of 7t(x) (the 
counting function of prime numbers) are not Gaussian. This is consistent with the 
fact that, asymptotically, the prime numbers behave as an uncorrelated sequence, 
and can therefore be assimilated to the spectrum of an integrable system. 

The autocorrelation of the counting function has a peculiar structure. First, 
it is a non-decaying function, showing from a different perspective the existence 
of long-range correlations. Second, the distribution of the fluctuations give the 
estimate ~ 0.2y/x/ logx for the typical size of the oscillations with respect to the 
mean. The autocorrelation obtained then implies that those oscillations have a 
typical size ~ O.lx in the x direction. Third, the peculiar structure of the autocor- 
relation was explained by making a connection with the fluctuations of the sum of 
the prime numbers < x, U{x). Their numerical verification (cf Fig. 4) also demon- 
strates the existence of the anticorrelation peaks in the density of prime pairs at 
large distances mentioned above. 
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Statistical Mechanics of Many Interacting 
Point Masses in Spherical Geometry 

Yves Pomeau 



Abstract. The long terms dynamics of point masses in a trapping potential inter- 
acting gravitationally cannot be derived from the usual ergodic assumption. The 
microcanonical partition function diverges for more than two particles because of 
the large volume allowed in momentum space when the potential energy of a close 
pair becomes very large negative. In the long term the trajectories explore by dif- 
fusion a phase space of infinite volume. This does not say what happens for a large 
number of self gravitating masses, like the globular clusters or even the galaxies 
perhaps. I reexamine below this question and try to point out the various difficul- 
ties faced by a rational approach. There is a small parameter, the inverse number 
of masses. At the lowest order in this parameter, the dynamics is given by solutions 
of the mean field Newton-Vlasov (N-V) kinetic equation. This yields well defined 
predictions for the steady state under given initial conditions, at least without ini- 
tial angular momentum, because, besides the usual mechanical invariants, the N-V 
equation has as an invariant function, the histogram of the probability distribu- 
tion in phase space. The next step is far more difficult: it requires to take into 
account the irreversible effects due to ‘collisions’. Actually, these irreversible effects 
are mostly coming from the resonant interactions between orbits of commensurate 
or close to commensurate period, not from close pair encounters as usually assumed. 
The evolution there is the one of slow changes of the steady states of N-V. I discuss 
briefly the possible solutions on the long time scale linked to the ‘collisions’. 



The Note [1] explained what replaces Boltzmann’s ergodic assumption in systems 
of classical point masses interacting by Newton’s law in a trapping potential. Be- 
cause of the diverging attraction at short distance, equilibrium statistical mechan- 
ics in its usual sense does not apply there (as seemingly noticed first by Boltzmann 
himself). The case we dealt with involved the ergodic assumption in its purest form, 
that is independently of the limit of a large number of interacting masses. Because 
the constant energy surface of three or more point masses in a confining potential 
has infinite Lebesgue measure, the microcanonical ensemble cannot be defined. 

This concerns few particles and cannot be easily extended to situations with 
Y > 1 masses. I shall review the various problems posed by the statistical me- 
chanics of such a large number of interacting masses. Typically, I shall have in 
mind a situation where the masses are close enough and with a low enough energy 
that the cluster is self gravitating, with no need of an external confining potential. 

The interest of this situation is that a small parameter exists, the inverse 
number of masses. Thanks to this there is a short time scale, called to later on, 
typical of the mean field/ collisionless evolution. This is the time scale of the mean 
field N-V kinetic equation, written below. I shall review briefly the status of this 
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N-V equation, and then turn to effects that appear at higher order in the expansion 
parameter, and on longer time scales though. 

1 The Newton- Vlasov kinetic equation 

The N-V kinetic theory describes the dynamics of the one-body probability distri- 
bution for the particles, a positive function of time t, of the position of the particle, 
r and of its velocity v (boldface are for 3D vectors in Euclidean space- no general 
relativity effects here) . The number of particles in the element of phase space drdw 
at time t is E(r, v, ^)drdv. This probability distribution is normalized in such a 
way that 

J F(r,v,t)drdv = N 

The N-V equation of motion for F(.) is: 

|^+v.V,F-Vr$.VvF = 0 (1) 

In this equation, ^(r) is the self consistent attracting potential depending linearly 
on the mass density distribution p{r,t) — f dvF(r,v,t) and given by: 

$(r) = -gJ (2) 

G being Newton’s constant, and all particles have the same mass m. 

For times large on the scale to the solution of N-V tends to a spherically 
symmetric form (in the absence of total angular momentum) independent of time, a 
not so trivial assumption after all. If the potential is spherically symmetric, steady 
solutions of N-V should be functions as /(^;^, r, (r.v)^). But on the other hand, 
once ^(r) is given, the N-V kinetic equation is time independent only if it depends 
on the constant of the motion of a single mass in this potential. The constant of 
the motion are the one particle energy divided by the mass e = ^ + $(r), and 
the square of the angular momentum. Given and it can be replaced in the 
argument of the probability distribution by (r.v)^ by elementary algebra (notice 
that ^(r) is used instead of $(r) to emphasize the spherical symmetry). The 
potential ^(.) is related to /(e, (r.v)^), by equations (1,2). This class of solutions 
/(u^,r^, (r.v)^) is closed under the evolution. Therefore, the angular momentum 
of each trajectory is conserved in the course of time, and there is no exchange 
of particles between the various values of this angular momentum at the N V 
order. An important remark due to Robert (private communication) is that besides 
the usual mechanical invariants, there is an added invariant of the N-V kinetic 
equation, that is the density in phase space. Here this density is conserved for each 
value of the angular momentum, the invariant density of F becomes a function of 
two variables, /o and Jq: 

P{fo, ./ol-F) = J drd-v0D{F(v^,r, (r.v)^) - fo)SD{Jo ~ (r.v)^) 



(3) 
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In equation (3), is the usual Dirac distribution. In principle, the knowledge 

of the function P(/o, Jo\F) is enough to get the final steady state solution of N- 
V when it depends on e and (r.v)^, although the explicit formulae are far more 
cumbersome than the ones for the case without (r.v)^ Dépendance, to which we 
shall restrict ourselves now. The explicit relationship between P(.|.) and /(e) is: 

PiMf) = (47 t )2 drr^ Ç _ $(,.)) (4) 

The sum in (4) is over the roots ei, 62, ...,ep...of /(cp) = /o, |/p(/o)| is the 
absolute value of ^ for the root e — and S{X) is 1 for X > 0 and zero 
otherwise. This, together with equations (1,2) yields an explicit (but not simple!) 
way to compute P(.|.) once /(e) is known. Reciprocally, given P(.|.) computed 
with the given initial value F of the distribution in phase space, one can at least 
theoretically use equation (4) to get the final form /(e) of the energy distribution. 
Therefore, up to possible degeneracies in the various inversion problems involved 
there, one can say that, given the initial F{y^ (r.v)^), one can compute the final 
steady state solution of the N-V kinetic equation. 

A word of caution however is appropriate here: all this assumes that the 
masses remain trapped inside the self consistent potential. It is an interesting issue 
(and to my knowledge largely untouched) to assess the validity of this assumption. 
Presumably, there are initial conditions such that part of the masses ‘evaporate’ 
in the long run, with some fraction remaining trapped. Then the trapped masses 
are in a kind of marginal state, since some masses should have orbits close to free 
parabolic orbits (they are close to parabolic because they spend most of their time 
far from the cluster). 

In the next Section I shall examine the effect of irreversible collisions and 
interaction of resonances on the N-V equilibria. 

2 How do collisions and other irreversible eflFects 
change the N-V equilibria ? 

The mean field interactions looked at in the previous Section are formally re- 
versible, although it does not preclude at all an irreversible evolution toward an 
uniquely defined steady state. This is because the notion of irreversibility has not 
much significance for systems with infinitely many degrees of freedom, as the one 
under consideration. The foremost example of built-in irreversible effect is the one 
of the close two-body encounters, neglected in N-V, and taken into account in 
the familiar Boltzmann kinetic theory. Two things should be said concerning their 
effect in the present framework. First they are rare in the sense that they occur 
with a frequency far less than the one of the orbital motion in the mean field. Then 
they are not the dominant irreversible effect in the limit N very large: they are 
dominated by the resonances between trajectories with commensurate frequencies 
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in the mean field. Consider a cluster of N identical masses, N large. The typi- 
cal time scale to for the dynamics of this cluster is the period of an orbit in the 

average field. Supposing that the orbiting mass has as much potential energy as 

/ 3 \ 1/2 

kinetic energy, one obtains tz) ~ ( ) , where R is the size of the system 

(assumed constant on the time scale to), rn mass of each individual particle. In 
this approach, one assumes the masses more or less uniformly distributed inside a 
volume of size R, that the escape rate remains small and that R is also the size of 
the single particle orbit. 

In the large N limit, the interval between two close binary collisions along a 
particle trajectory is much longer than tp (ternary collisions are neglected, being 
even rarer, which makes very unlikely the formation of close binaries if there is 
none at the beginning, a situation very different of [1]): the mean free flight time is 
tcoii ~ ^ number density, b impact parameter and v average particle velocity, 

with n ^ ^ and v ^ The impact parameter b should be such that the 
collision changes significantly the trajectory. This happens when the interaction 



energy with the other partner in the collision, is of the same order as the 
interaction energy of the particle with all the others, - . This gives b ^ 
much shorter than the interparticle distance for N large. Therefore, tcoii ^ Ntjj 
is much longer than for A/' large [2]. 

However things are not that simple: it is well-known that the Boltzmann 
kinetic theory for particles (be they electric charges or Newtonian masses) in- 
teracting via a 1/r potential diverges logarithmically because at the long range 
interaction (this divergence is in the total cross section for momentum transfer). 
In an electrically neutral plasma, this divergence is cut-off because the charges are 
screened at distances larger than the Debye length. Here there is no such screening, 
and the full range of interaction parameters should be kept. Therefore the longest 
length scale to be inserted into the diverging logarithm must be the largest pos- 
sible length, namely the diameter of the cluster, proportional to if one scales 
out the total energy. The diverging logarithm is to be replaced by (InN), so that 
the estimate of the relaxation time becomes 



icoii 



N 



This leaves however unsettled the question of the physics of this relaxation process: 
because it is dominated by interaction at distances of the order of the size of 
the cluster, the only irreversible interaction at this scale is by resonance between 
individual trajectories with commensurate frequencies. No theory showing this 
explicitely has been proposed yet, although progress in this direction has been 
made by Luciani and Pellat. 

The coming Section is even more speculative since it builds-up on a kinetic 
theory including these resonant interactions that is yet to be written! Nevertheless 
I shall present remarks and speculations on the possible behaviour of solutions of 
this yet unwritten kinetic equation. 
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3 Long time behaviour of the kinetic equation 
including irreversible effects 

As I just said, no such consistent theory exists yet. Therefore it is quite speculative 
to discuss what such a theory would predict! Although this is absolutely not obvi- 
ous, one may assume that things are not too much dependant upon the detailed 
form of the kinetic operator. 

I shall assume that the irreversible effects add to the N-V equation (1) a 
collision term that is formally small. Such a term has been written by [4] for 
instance. It would not help much to write it here. Its derivation can be found in 
Liboff’s book [5], based on Landau’s kinetic theory for plasmas. The principle of 
the derivation followed by Cohn and coworkers in [4] and related works is based 
upon the assumption that the equilibria of the N-V equation slowly evolve under 
the effect of the small collision term (a similar method is used in [3]). The potential 
$(r) is assumed to be a slowly varying function of the time (to be found at the end 
by a self consistency condition). One writes first formally the steady state solution 
of N-V as a function of the constants of the motion, the angular momentum and 
the energy of each individual mass me. The collision term is separated out of 
the N-V (dominant) term by averaging the full kinetic equation over the periodic 
trajectories in the mean field. The N-V term disappears and only the averaged 
collision term remains. Practically however the details of this calculation are rather 
tricky (see [4]). 

There remains to find the long time behaviour of solutions of this kinetic 
equation. It can be safely said first that, although it is a basically irreversible 
equation, it does not describe evolution toward any equilibrium, just because there 
is no such equilibrium solution. Recently a lot of interest has been devoted to 
related problems [6], [7], for kinetic equations in wave turbulence and for Bose- 
Einstein particles. Various scenarii are possible for the long term behaviour of 
their solutions. The problem of interest is the behaviour at very large negative 
energies. The central concept there is the one of cascade of (negative) energy. It 
can occur in two different ways. In the so-called finite capacity case, the energy 
distribution builds up a tail going to infinitely negative energies in a finite time, 
by a self-similar time process. Later the solution builds a Kolmogorov-Zakharov 
algebraic tail with a constant energy flux toward the infinitely negative energies. 
The scaling exponent for this Kolmogorov-Zakharov solution are of the first kind 
in the Zel’dovich classification: they can be derived purely from scaling relations. 

The other possibility is the infinite capacity case of the large energy part of 
the system. Then the solution goes to infinity in an infinite time, with scaling laws 
of the second kind that are to be found by solving a non linear eigenvalue problem. 

In the first case (Kolmogorov-Zakharov spectrum), this solution looses sense 
at too low densities (and so at too large negative energies): at finite number of 
masses, the tail of the spectrum (that is integrable by definition) stops to make 
sense beyond energies such that the total number of particles of higher energy is 
of order 1. Presumably the continuous cascade is replaced by a constant flux of 
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masses of low energies orbiting inside a dense local cluster at the center of the 
N masses, something reminiscent of the observed core of the globular clusters. In 
the infinite capacity case (cascade to large negative energies building up forever), 
things are likely similar, except that the deeply bound core is not filled at constant 
rate, but at a rate depending on the way the tail grows in time. 

4 Conclusion 

To conclude this rather speculative essay, it seems that we have a well defined 
problem at hand together with a lot of tools borrowed from modern developments 
on solutions of kinetic equations. Hopefully progress on this issue will be made in 
the near future. It is certainly an example where the brute numerical force will 
not help much: the larger is the number of masses, the more different are the time 
scales, which makes it unlikely to reach very long times necessary to explore the 
interesting long term dynamics, the short term dynamics being well understood. 
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Quantum Information Processing with Quantum Optics 
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Abstract. We review theoretical proposals for implementation of quantum comput- 
ing and quantum communication with quantum optical methods. 



1 Introduction 

It is generally recognized that all the microscopic phenomena that we observed 
can be described and explained by the principles of Quantum Mechanics. These 
principles have been extensively tested, and some of them are commonly used in 
several technological applications. Other principles, like the ones related to the 
superposition principle and the measurement process, and which are in the realm 
of most of the paradoxes and strange phenomena related to Quantum Mechanics, 
have only recently become important in some applications. In particular, they 
form the basis of a new theory of information which may revolutionize the fields 
of communication and computation [1]. 

Quantum communication deals with sending quantum states from one place 
to another one in such a way that they arrive intact. The most important appli- 
cation so far in this field is the one in which a sender (Alice) tries to convey a 
secret message to a receiver (Bob). The message is encoded in the state |^} of a 
quantum system. Due to the fact that the quantum state of a system is distorted 
if somebody performs a measurement, Bob will receive a wrong state if a third 
(malevolent) party (Eve) tries to read the message. This way of secret communi- 
cation is called quantum cryptography, and it is the only provably secure way in 
which two partners can share secret messages. In the context of quantum compu- 
tation, the existence of entangled states of several particles offers the possibility of 
performing certain computational tasks in times much shorter than the ones taken 
by common (classical) computers. By acting on a system entangled to other sys- 
tems, one modifies the state of the whole system at the same time, which leads to 
an important speed up in several computations. In particular, if one could build a 
quantum computer one would be able to decompose very large numbers (of n 1 
digits) into prime factors in a time that scales polynomially with n (t ~ an^, with 
a and h constants) [2] , in contrast to the exponential dependence of classical com- 
puters [3]. A quantum computer would therefore allow to break all the classical 
cryptographic protocols which are based on the impossibility of factorizing large 
numbers in relatively short times. There are also other algorithms which make use 
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of the superposition principle in Quantum Mechanics and they are more efficient 
that the classical counterparts. 

The experimental situation in quantum communication and quantum com- 
putation is very different. Whereas in the first case it is already a mature field, 
close to reaching the commercial level, the second one is still in its infancy. For 
the moment, it is only possible to construct very small prototypes of quantum 
computer models which are certainly not useful per se. Nevertheless, pursuing re- 
search in this field does not have the only goal of being useful in the near future, 
but there are several other goals which can be attained on the way. In particular, 
creating a quantum computer means that we can manipulate the quantum state 
of an enormous system at will which apart from being capable of bringing some 
surprises to our present knowledge of Quantum Mechanics, paves the way for some 
other applications based on this theory which may be discovered in the future and 
do not require a large system. On the other side, present experiments on quantum 
cryptography with photons cover distances of the order of 50 km. Their extension 
to longer distances does not seem to be straightforward though, since the systems 
carrying the quantum states (i.e. photons) are eventually absorbed and therefore 
the quantum states are distorted before they arrive at their destination. A way to 
overcome this problem is to use quantum repeaters [4], in which small quantum 
computers amplify in a sense the quantum states so that they arrive safely at their 
destination. 

There are very few systems in which one can implement a small quantum 
computer. Many of the ideas come from the field of Quantum Optics. The reason 
is the spectacular experimental development of this field during the last years, 
which has allowed, so to say, to dominate the quantum world. These methods, 
when combined appropriately, allow, at least in principle, to perform quantum 
computations and to build quantum repeaters. Here we will review on a qualitative 
level some of the quantum optical systems that have been proposed to perform 
quantum computations. We will start with a brief introduction to some of the 
main topics in quantum information theory. 

2 Entanglement 

Most of the counter-intuitive predictions of Quantum Mechanics are related to 
the superposition principle, and this principle, when applied to more than one 
system, may lead to very intriguing phenomena related to non-locality. The basic 
ingredient of such phenomena is entanglement^ i.e. the possibility of having two 
or more systems in a state which displays quantum correlations. These entangled 
states lie at the heart of quantum communication and computation. 

The characterization of entanglement is one of the central theoretical issues 
in Quantum Information Theory. In fact, there are still many open questions re- 
garding the entanglement properties of two or more quantum systems. For pure 
states of two systems, entanglement is well understood, of course. For example. 
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a system B consisting of two state systems or qubits {|0), |1)} a state of the 
form \0) (^\0) e H = Ha 0 Hb is a product state, while the Bell states 

1 ^^) = ^(| 0 , 1 ) ± | 1 , 0 )), |$±> = ^(| 0 , 0 ) ± | 1 , 1 » 

are examples for entangled states. The most important properties of entangled 
states is that they carry correlations, correlations which are stronger than classical 
correlations as expressed by Bell’s inequalities [5]. Entanglement of pure states is 
quantified by the Schmidt decomposition [ 1 ], and the entropy of entanglement^ 
i.e. the von Neumann entropy of the reduced density operator of the composite 
quantum states [ 6 ]. 

In order to create entangled states out of product states we need interac- 
tions because under time evolution with a Hamiltonian H = Ha 0 1b + Is 0 Hb 
a product state will remain a product state. Similarly, we cannot get entangled 
states by measuring observables in A and B, independently since the state after 
the measurement will be changed by local operators. One says that entanglement 
cannot be created by local operations (operations meaning any action on the sys- 
tems). Note, however, that product states can be obtained by local operations (in 
particular, by measurements). 

The quantification of the multipartite entanglement is a more complicated 
question. This can be illustrated by the following example. Let us consider three 
parties and the states [7], 

IGHZ) = ^(|0,0,0>-|1,1,1)) (1) 

|W) = ^(|0,0,1} + |0,1,0) + |1,0,0)). (2) 

Those are entangled states according to the partition (Ai A 2 A 3 ). However it is hard 
to say which one is more entangled. Certainly, the first one possesses a sticking 
non-local behavior, in the sense that it can be used to prove Bell’s theorem without 
using inequalities [7] . However, it is very weak in the sense that if one party does not 
participate in the measurement (or is lost), then all the entanglement disappears. 
However, the second one retains some entanglement even if one particle is lost (in 
fact it is the most robust against particle losses) [ 8 ]. 

In reality, all systems interact with some sort of environment, a process we 
call decoherence. Thus, we should include the state of the environment in our 
description in order to be consistent. In fact, due to the interaction between 
system and environment they will become entangled even if initially they were 
in a product state: |^ 5 '( 0 ))a 0 |^b(0))b ^ |^(^)) 5 B- Since we are only inter- 
ested in our system, all the information that we can acquire (without performing 
measurements in the environment) are contained in the reduced density operator 
ps{t) =tr^[|^(^))B£;(^(^)|], which will correspond to a mixed state. 
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We call a mixed state entangled if it is not separable, i.e. where separable 
states are defined by 

P = 'Y^Pk\ak,hk){ak,hk\ (Pfc > 0). 

k 

It turns out that it is very hard in practice to determine whether a given state is 
entangled or not. There exists, however, an important sufficient criterion that may 
be useful in some occasions. It states [9] that if has a negative eigenvalue (i.e. 
it is not positive) then p is entangled. Here p^^ stands for the partial transpose of 
p with respect to the first system in the basis ^ he. a{^\p^^W) a =a 

{k'\p\k). In general, the converse of this criterion is not true [10]. That is, there exist 
entangled states fulfilling p^^ > 0. However, in low dimensions (if x < 6) this 
criterion (called Peres-Horodecki criterion [9, 11]) gives a necessary and sufficient 
condition: p is separable if and only if > 0. For other separability criteria see 
[12]. 

Given some state, sometimes we would like to know ’how close’ it is to some 
pure state, like for example a Bell state. In order to measure this quantity we 
define the fidelity of p with respect to some state |^) as 

A fidelity F 1 means that our state is very close to the desired one. 

Most of the applications in the field of Quantum Information are based on the 
use of superpositions of pure states. However, in practice, the state that one has at 
disposal are mixed. For example, if one would like to perform quantum cryptog- 
raphy over long distances using entangled photons, when they arrive at the final 
location their state will also be entangled to the environment and therefore mixed. 
The longer the distance the photons have to travel, the more mixed they will 
become. Unfortunately, if they are significantly mixed, the security of the corre- 
sponding cryptographic protocol will no longer be ensured. This fact considerably 
limits the distances over which one can perform secure quantum cryptography. 
Fortunately, there is a method that allows to make the states more pure, and even 
more entangled. The idea is to use several copies of a state which is not useful for 
the applications of Quantum Information, but that it is still entangled [13, 14]. 
Using local operations and classical communication it is sometimes possible to ob- 
tain fewer copies of particles in a state which is closer to a maximally entangled 
states, for example the state |^"^). This process is called entanglement purification 
(or distillation). 

3 Quantum computing 

3.1 What is a quantum computer? 

A computation can be considered as a physical process that transforms an input 
into an output. A classical computation is that in which the physical process is 
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based on classical laws (without coherent quantum phenomena). A quantum com- 
putation is that based on quantum laws (and in particular on the superposition 
principle). In quantum computation, inputs and outputs are represented by states 
of the system. For example, enumerating the state of a given basis as {|1), |2), . . .}, 
the number N would be represented by the Nth state of this basis. A quantum 
computation consists of evolving the system with a designed Hamiltonian inter- 
action, such that the states are transformed as we want. Note that the operation 
that transforms input into outputs has to be unitary. In general, if our algorithm 
consists of evaluating a given function /, we can design an interaction Hamilto- 
nian such that the evolution operator transforms the input states according to 

\j) ® |o) ^ li) ® I/O)) {j = 1, N). 

Note that using this transformation we can, at least, do the same computa- 
tions with quantum computers as with classical computers. However, with quan- 
tum computers we can do even more. We can prepare the input state that in a 
superposition 



N N 

after a single run. In principle, all the values of / are present in this superposition. 
Note, however, that we do not have access to this information since if we perform a 
measurement we will only obtain a result (with certain probability). Nevertheless, 
we see that with a quantum computer we can do at least the same as with a classical 
computer, . . . and even more. This property of using quantum superpositions to 
run only once the computer was termed by Feynman quantum parallelism. 

3.2 Requirements 

A quantum computer consists of a quantum register (a quantum system) that can 
be manipulated and measured in a controlled way. In order to build a quantum 
computer, one needs the following elements (see also Ref. [15]): 

1. A set of qubits: These are two- level systems perfectly identified and forming 
the quantum register. We denote by {|0)/c, |l)/c} two orthogonal states of the 
k-th qubit, so that the state of all the qubits (the quantum register) can be 
written as 



i^)= E Ckik2...kN l^i)i 1^2)2 . . . ( 8 ) |A: 7 v)iv. 

k\ ,k2,.--kN=0 

In the following, we will simplify the notation and write |/ci, A: 2 , . . . kjsj) in- 
stead of the cumbersome notation that uses tensor products. Note that these 
qubits can be in superposition and entangled states, which gives the extraor- 
dinary power to the quantum computer. Note also that the state of the qubits 
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must be kept almost pure since otherwise the power of the superpositions 
would not be effective. This means that the qubits must be well isolated from 
the environment in such a way that decoherence is sufficiently slow. 

2. Universal set of quantum gates: The controlled manipulation of the qubits 
means that we can perform any unitary operation U on the qubits so that 
1^) ^ U|^). In principle, if we want to perform general operations we should 
be able to engineer arbitrary interactions between the qubits. Fortunately, 
this task is enormously simplified given the fact that any U can be decom- 
posed as a product of gates belonging to a small set, the so-called universal 
set of gates. This means that if we are able to perform the gates of this 
set we will be able to perform any computation (unitary operations on the 
register) by simply applying a sequence of them. There are many sets of 
universal gates, and of course, they are all equivalent. The most convenient 
set is the one that contains one two-qubit gate (an operation acting on two 
qubits only) plus a set of single-qubit gates. Let us recall here some of the 
gates of this sort (crx,y,z are the Pauli operators acting on a qubit): 

(a) Single-qubit gates: act on a single qubit. 

i. Phase gate: 

| 0 ) ^ 

|1) 

ii. Excitations: Uz^\0) = 

|0) ^ cos0|O) — isin^jl) 

|1) — » — isin^lO) + cos0|l) 

(b) Two-qubit gates; act on two qubits. 

i. Controlled- not: = |0)(0| ® 1 -I- 11)(1| ® 

| 0 , 0 ) ^ | 0 , 0 ) 

|0,1> - |0,1) 

|1,0) - 11,1) 

|1,1> ^ |1,0) 

This gate changes the state of the second qubit conditioned to the 
state of the first one. The first qubit is therefore called control 
qubit, whereas the second one is called target. 

ii. Controlled-phase: = |0)(0| ® 1 — |1)(1| ® cTj, 

| 0 , 0 ) ^ | 0 , 0 ) 

|0,1) - |0,1) 

|1,0) - |1,0) 
ll,l) - -ll,l) 
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Two different universal sets of gates are [1]: 

S2 = {C/®,C/(i)(7r/4),t/i^)(^),^e[0,27r)}. 

Note that the two-qubit gates require interactions between the qubits, and 
therefore are the more difficult ones in practice. The fact that the operations 
are unitary (and therefore reversible) means that the uncontrolled interaction 
with any other part of the quantum computer must be avoided. 

3. Detection: One should be able to measure on each of the qubits (or, equiv- 
alently, to detect whether they are in state |0) or |1)). Note that this process 
requires the interaction with a measurement apparatus in an irreversible way. 

4. Erase: We must be able to prepare the initial state of the system, for example 
the state |0, 0, . . . , 0). Actually, this is not an extra requirement since if one 
is able to detect and to apply the single-qubit gate Ux^^ir) this is enough. 

5. Scalability: The difficulty of performing gates, measurements, etc., should 
not grow (exponentially) with the number of qubits. Otherwise, the gain in 
the quantum algorithms would be lost. 

For the moment, we know very few systems which fulfill the requirements to 
implement a quantum computer with them. Perhaps, the most important problem 
is related to the necessity of finding a quantum system which is sufficiently isolated, 
and for which the required controlled interactions can be produced. We will discuss 
below examples from quantum optics. We note, however, important proposals in 
solid state systems based on Cooper pairs and quantum dots, as well as the nuclear 
magnetic resonance system, which in its present form is a high temperature system 
and thus not scalable (see Ref. [16]). 

Finally, we remark that in any computation (classical and quantum) or during 
storing of information errors will occur. It is essential for quantum computing to 
make sense that quantum error correction can be developed. We refer to reference 
[1] for details. 

4 Quantum communication 

The situation one has in mind in quantum communication is the following: Alice 
wants to send Bob an unknown state |^). One way of doing this is to send the 
particle carrying the state directly. However, the particle will very likely interact 
with the environment which may result in a different state, generally mixed. There 
are some ways of avoiding this. In the following we will describe a basic tool in 
quantum communication which allows to send a quantum state from one place 
to another provided one has a maximally entangled state shared between the two 
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places, and is able to communicate classically without errors. A high fidelity EPR 
pair can be generated by a quantum repeater protocol. 

By teleportation we define a protocol to transfer an intact quantum state from 
one place to another, by a sender who knows neither the state to be teleported nor 
the location of the intended receiver [17]. Consider two partners, Alice and Bob, lo- 
cated at different places. Alice has a qubit in an unknown state |(/)), and she wants 
to teleport it to Bob, whose location is not known. Prior to the teleportation pro- 
cess, Alice and Bob share two qubits in a Bell state |^“) ~ |0,1) — |1,0). The idea 
is that Alice performs a joint measurement of the two-level system to be teleported 
and her particle. Due to the nonlocal correlations contained in the Bell state, the 
effect of the measurement is that the unknown state appears instantaneously in 
Bob’s hands, except for a unitary operation which depends on the outcome of 
the measurement. If Alice communicates to Bob the result of her measurement, 
then Bob can perform that operation and therefore recover the unknown state (for 
experiments, see [18, 19, 20]). 

It is important to note that Alice ends up with no information about her 
original state in this protocol so that no violation of the no-cloning theorem occurs, 
and there is no instantaneous propagation of information. Bob has to wait until 
he receives the (classical) message from Alice with her outcome. Before he receives 
the message, his lack of knowledge prevents him from having the state. Note that 
no measurement can tell him whether Alice has performed her measurement or 
not. Since teleportation is a linear operation applied to a state, it will also work 
for statistical mixtures, or in the case in which particle 1 is entangled with other 
particles. 

We have now all necessary tools available to introduce the concept of the 
quantum repeater. Our goal is to create an EPR pair of high fidelity between two 
distant locations. Since nonlocal entanglement between distant particles cannot be 
created using only local operations, this involves the usage of a quantum channel, 
which is noisy in general. The bottleneck for communication over large distances 
is the scaling of the error probability with the length of the channel. When using, 
for example, optical fibers and single photons as a quantum channel, both the 
absorption losses and the depolarization errors scale exponentially with the length 
of the channel. The state of the photon or the photon itself will therefore be 
destroyed with almost certainty if the channel is longer than a few half-lengths of 
the fiber. 

To overcome this problem, one can use quantum repeaters. The idea of such a 
repeater is to divide a long quantum channel into shorter segments, which are pu- 
rified separately, before they are connected. Connecting two segments of a channel 
means here to build up quantum correlations across the compound channel from 
correlations that exist across the individual segments. This can be done by telepor- 
tation of entanglement. A quantum repeater must therefore combine the methods 
of entanglement purification and teleportation. Although the combination of these 
methods should, in principle, allow to create entanglement over arbitrary distances, 
it is another question how much this "costs” in terms of resources needed for purifi- 
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cation. Resources means here the number of low fidelity entangled pairs that have 
to be provided for purification of each channel segment. This quantity is related 
to the number of particles that have to be manipulated locally (at the connection 
points between the segments) in a coherent fashion. If the resources grow too fast 
with the length of the channel, not much will be gained by the whole procedure. 
A further important quantity is the error tolerance for the local operations. In 
every real situation, the local operations applied to one or more particles will bear 
some imperfections. Since such operations are the building blocks for any entangle- 
ment purification protocol, their imperfections will limit the maximum attainable 
fidelity for an EPR pair and the efficiency of the protocol. In the context of the 
quantum repeater, a maximum fidelity F < 1 corresponds to a residual amount of 
noise for each segment. When the segments are connected, this noise accumulates. 
To overcome this limitation [4], we can divide the long channel into N smaller 
segments and create less distant entangled pairs across each segment. The number 
of segments N is thereby chosen in such a way that it is possible to create entan- 
gled pairs with sufficiently high initial fidelity F > Fmin over the distance of such 
a segment, such that they can be purified, according to our previous discussion. 
In a next step, we connect these “elementary” pairs by using teleportation. For 
example if we have an entangled purified pair between the nodes Aiand A2, and 
another one between A2 and A3, we teleport the state of the first particle in A2 
to A3 by using this second pair. The result of this teleportation will be that the 
nodes Aiand A3 will now share an entangled state. Of course, due to imperfections 
during the teleportation procedure, as well as the fact that the pairs used were 
not perfectly pure, the new entangled state will not be pure. But as long as its 
fidelity is larger than a certain F^in, it will be possible to purify it to a value close 
to an Fmax (where Fmin = 1/2 and Fmax = 1 for an ideal purification protocol). 
Thus, the crucial point is that, on the one hand, the operations that are performed 
cannot be too noisy since otherwise they could decrease the fidelity below Fmin, 
which would make the process impossible; on the other hand, the distance between 
nodes has to be such that purification be possible. This limits the number of pairs 
one can connect before purification becomes impossible. We therefore connect a 
smaller number L N of pairs so that the resulting fidelity F^ stays above the 
threshold value for purification {Fl > Fmin) and purification is possible. 

The general strategy will be to design an alternating sequence of connection 
and (re-) purification procedures in such a way that the number of resources needed 
remains as small as possible, and in particular does not grow exponentially with 
N and thus with 1 . This is possible, in principle, using a nested purification pro- 
tocol [4]. 
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5 Quantum optical implementations 
5.1 Introduction 

This chapter discusses briefly various schemes for quantum information processing 
with trapped atoms and photons. Experimental realizations include laser cooled 
trapped ions, either in a linear trap [21, 22, 23] or in arrays of micro traps, and 
neutral atoms stored in far-olf-resonance traps [24] or optical lattices [25], Single 
atoms and ensembles of atoms can be stored in cavities, providing an interface 
between atoms and photons. 

All of the models discussed below share the feature that long lived internal 
atomic states, such as atomic hyperfine ground states or metastable states, serve as 
quantum memory to store the qubits. Photonic channels are used for transmitting 
quantum information between the nodes of a quantum network [26]. Furthermore, 
single qubit rotations can be performed by coupling the qubit states to laser light 
for an appropriate time period. We thus concentrate on discussing schemes for 
two-qubit gates and on the generation of entanglement between distant nodes of 
a quantum network as required for quantum communication. 

Schemes for two qubit gates can be classified in two categories. The first 
version relies on the concept of a quantum data bus: in this case the qubits are 
coupled to a collective auxiliary quantum mode, and entanglement of qubits is 
achieved by swapping qubits to excitations of the collective mode. Examples for 
such systems are the collective phonon modes in ion traps [27], and photons in 
cavity QED [28, 29]. The requirement of some of these schemes to initialize the 
data bus in a pure state can be relaxed by specific protocols for “hot gates” 
[30, 31]. The second concept for performing two-qubit gates deploys controllable 
internal- state dependent two-body interactions between atoms. Examples for this 
scheme are coherent cold collisions of atoms in optical traps and optical dipole- 
dipole interactions [32, 33]. A third example is the “fast” two-qubit gate based on 
large permanent dipole interactions between laser excited Rydberg atoms in static 
electric fields [34]. Besides these dynamical schemes for entanglement creation it 
is also possible to generate entanglement by purely geometrical means. We refer 
to [35] for proposals on holonomie quantum computing. 

The implementation of a quantum repeater protocol with a communication 
time growing polynomially with the transmission distance [36, 37] requires to gen- 
erate entanglement between distant qubits, store them for sufficiently long time 
and perform local collective operations on several of these qubits. The requirement 
of quantum memory is essential since all purification protocols are probabilistic. 
When entanglement purification is performed separately for each segment of a 
channel, quantum memory can be used to keep the segment state if the purifi- 
cation succeeds and to repeat the purification for the segments only when the 
previous attempt fails. This is essentially important for polynomial scaling prop- 
erties of the communication efficiency since without quantum memory we have to 
require that the purifications for all the segments succeed at the same time; the 
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Figure 1: Ion trap quantum computer (schematic). 



probability of such an event decreases exponentially with the channel length. 

In Sec. 5.2 we present two examples of quantum computing proposals for 
trapped ions. This is followed by Sec. 5.3 which deals with an example for a two- 
qubit gate for neutral atoms based on cold coherent collisions. Finally, in Sec. 5.4 
we discuss the implementation of a quantum repeater by using atomic ensembles. 

5.2 Quantum computing in ion traps 

In this section we show how to implement two-qubit gates in a linear ion trap 
via a quantum data bus and discuss how to realize a two-qubit gate deploying 
Coulomb interaction in arrays of micro traps. 



5.2.1 Ion trap quantum computer ’95 

As first proposed in Ref. [27], N cold ions interacting with laser light and moving in 
a linear trap provide a realistic physical system to implement a quantum computer. 
The distinctive features of this system are: (i) it allows the implementation of a 
complete set of quantum gates between any set of (not necessarily neighboring) 
ions; (ii) decoherence is comparatively small, and (hi) the final readout can be 
performed with essentially unit efficiency [38]. 

Fig. 1 illustrates the basic setup. The qubits are represented by the long- 
lived internal states of the ions, with \g)j = |0)j representing the ground state, 
and |ro)j = |l)j a metastable excited state ( j = 1, • • • ,N). (In addition, we assume 
that there is a second metastable excited state |ri) which serves below the role 
of an auxiliary state.) The implementation of a two-qubit gate between two (or 
more) arbitrary ions in the trap is accomplished by exciting the collective quantized 
motion of the ions with lasers, i.e. the collective center of mass phonon mode plays 
the role of a quantum data bus. For this we assume that the collective phonon 
modes have been cooled to the ground state [23]. 
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A laser addressing the j-th ion is tuned to the lower motional sideband of, 
for example, the center-of-mass mode with creation and annihilation operators 
and a, respectively. In the interaction picture we find the Hamiltonian [27] 

H,,, = [K)j{g\ae-^^ + b),(r,|ate'^] . (3) 

Here Q is the Rabi frequency, (j) the laser phase, and r] is the Lamb-Dicke parameter. 
The subscript g = 0, 1 refers to the transition excited by the laser, which depends 
on the laser polarization. It is easy to prove that the unitary time evolution 
for a laser beam kept on for a certain time t = kn/ {Q.r]/y/N) keeps the state \g)j |0) 
unaltered, whereas 

|p)i|l> — > cos(/c7r/2)|£;)j|l) -*e"^sin(fc7r/2)|r,)j|0), 

k)j|0> — *■ cos(A:7r/2)|r,)j|0) - ie”"^sin(fc7r/2)|c?)j|l), 

where [0) (|1)) denotes a state of the CM mode with no (one) phonon. 

Let us now show how a two-bit gate can be performed using this interaction. 
We consider the following three-step process (see Fig. 2): (i) A tt laser pulse with 
polarization g 0 and 0 = 0 excites the m-th ion. The evolution corresponding 
to this step is given by (Fig. 2a). (ii) The laser directed on the 

n-th ion is then turned on for a time of a 27r-pulse with polarization g = 1 
and 0 = 0. The corresponding evolution operator changes the sign of the 
state |g)n|l) (without affecting the others) via a rotation through the auxiliary 
state |ei)yi|0) (Fig. 2b). (iii) Same as (i). Thus, the unitary operation for the 
whole process is /7m, n = effect of this interaction is to change 

the sign of the state only when both ions are initially excited and thus realizes 
a phase gate |ci)|e 2 ) ^ (— 1)^^^^ (ei ^2 = 0,1) which together with single 
qubit rotations becomes equivalent to a controlled-NOT. Note that the state of 
the CM mode is restored to the vacuum state |0) after the process. This proposal 
for the implementation of quantum computing with trapped ions has provided the 
stimulus for a significant body of experimental and theoretical work over the last 
few years [21, 22, 23, 38, 31, 30, 35]. 

5.2.2 Ion trap quantimi computer 2000 

We now describe a version of an ion trap computer where entanglement is achieved 
by designing an internal- state dependent two-body interaction between the ions 
[39, 40]. In comparison to the proposal described in Sec. 5.2.1 this scheme has 
the advantage of being conceptually simpler (e.g. there is no zero temperature 
requirement), and obviously scalable. The model assumes that ions are stored in 
an array of microtraps (Fig. 3) and long lived internal states of the ions serve as 
carriers of the qubits. 

A set of N ions is confined in independent harmonic potential wells separated 
by some constant distance d, where d is large enough so that: (i) the Coulomb 
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Figure 2: The two-qubit quantum gate, a) First step according to (i): the qubit 
of the first atom is swapped to the photonic data bus with a 7r-pulse on the lower 
motional sideband, b) Second step (ii): the state |^, 1) acquires a minus sign due 
to a 27T-rotation via the auxiliary atomic level |ri) on the lower motional sideband. 





Figure 3: Ions stored in an array of microtraps. By addressing two adjacent ions 
with an external field the ion wave packet is displaced conditional to its internal 
state. 
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repulsion is not able to excite the vibrational state of the ions; (ii) the ions can 
be individually addressed. Two-qubit gates between two neighboring ions can be 
performed by slightly displacing them for a short time T if they are in a particular 
internal state, say |1). In that case, and provided the ions come back to their 
original motional state after being pushed, the Coulomb interaction will provide 
the internal wave function (quantum register) with different phases depending 
on the internal states of the ions. Choosing the time appropriately, the complete 
process will give rise to the two-qubit gate |ci)lc 2 ) ^ . 

The term responsible for entangling the atoms, giving rise to a conditional 
phase shift, can be simply interpreted as arising from the energy shifts due to the 
Coulomb interactions of atoms accumulated on different trajectories according to 
their internal states, 

Z z z z 1 , (4) 

d-\-X 2 —x\ d-\-X 2 d — xi dj ’ 

where xi ,2 are the conditional displacements of the wave functions (cf. Fig. 3). The 
four terms in Eq. (4) are due to atoms in |l}i|l) 2 , |l)i|0)2, l0)i|l)2 and |0)i|0)2, 
respectively. The expression Eq. (4) depends only on mean displacement of the 
atomic wavepacket and thus is insensitive to the temperature (the width of the 
wave packet) which will appear only in the problem in higher orders in Xi, 2 /d 1 
or in cases of non-adiabaticity. A detailed theory of this proposal including an 
analysis of imperfections has been given by Calarco et ai [40] . 

5.3 CoUisional interactions between neutral atoms 

Experimental advances in manipulating Bose-Einstein condensates (BEC) and in 
particular the loading of a BEC into an optical lattice demonstrating a quan- 
tum phase transition from a BEC into a Mott insulator [25] allow unprecedented 
experimental control over the center of mass motion of single neutral atoms. In 
particular it enables to engineer controlled coherent atomic collisions between the 
atoms which provide the basic mechanism to entangle neutral atoms [32] and im- 
plement two-qubit gates between qubits stored in long lived internal states of these 
atoms. 

5.3.1 Entanglement via coherent ground state collisions 

In a field theoretic language the s-wave interactions between ultracold atoms cor- 
respond to Hamiltonians which are quart ic in the atomic field operators, analogous 
to Kerr nonlinearities between photons in quantum optics. By storing ultracold 
atoms in arrays of microscopic potentials provided, for example, by optical lattices 
these collisional interactions can be controlled via laser parameters [32]. Further- 
more, these nonlinear atom-atom interactions can be large [32], even for interac- 
tions between individual pairs of atoms, thus providing the necessary ingredients 
to implement quantum logic. 
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Figure 4: We collide a first atom in the internal state \a) with a second atom 
in state \b). In the collision the wave function accumulates a phase according to 
Eq. (6). 



Consider a situation where two atoms with electrons populating the internal 
states \a) and \b), respectively, are trapped in the ground states of two poten- 
tial wells (Fig. 4). Initially, these wells are centered at positions and 
sufficiently far apart (distance d = Xb — Xa) so that the particles do not interact. 
The positions of the potentials are moved along trajectories x^{t) and x^{t) so 
that the wave packets of the atoms overlap for certain time, until finally they are 
restored to the initial position at the final time. This situation is described by the 
Hamiltonian 



/3=a,b 






2m 



+ (: 






(t)) 






( 5 ) 



Here, and are position and momentum operators, (x®’^ — x^’^(t)) 
describe the displaced trap potentials and is the atom-atom interaction term. 
Ideally, we want to implement the transformation from before to after the collision, 

-x°-)'iPq{x^ - x’’) ( 6 ) 



where each atom remains in the ground state of its trapping potential and preserves 
its internal state. The phase 0 = </>" + will contain a contribution 

from the interaction (collision) and (trivial) single particle kinematic phases 
and (j)^. The transformation (6) can be realized in the adiabatic limit, whereby 
we move the potentials slowly on the scale given by the trap frequency, so that 
the atoms remain in their ground state. In this case the collisional phase shift is 
given by = J^^dtAE{t)/h, where AE{t) is the energy shift induced by the 
atom-atom interactions 

AE{t) = f dx\il)Q {x - Æ“(i)) p(i/>o {x - x'^it)) p, 

771 J 



( 7 ) 
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with ÜS the 5-wave scattering length. In addition we assume that |A£'(t)| <C hu 
with p the trap frequency so that no sloshing motion is excited. 

The assumption behind the colliding atoms by hand, as described above, is 
that different internal states of the atom see a different trapping non-dissipative 
potential. In practice, this can be achieved by trapping atoms in a far-off- resonant 
trap or optical lattice, where the lasers are tuned in such a way that the atomic 
states I a) and |6) couple differently to the excited atomic states, so that their AC 
Stark shifts differ. A specific laser configuration achieving this state dependent 
trapping has been analyzed in Ref. [32] for Alkali atoms, based on tuning the laser 
between the fine structure excited states. The trapping potentials can be moved 
by changing the laser parameters. Such trapping potentials could also be realized 
with magnetic and electric microtraps [41]. 

We note that loading of single atoms in laser traps has been achieved recently 
[24], and movable arrays of trapping potential have been created with arrays of 
microlenses [42]. Finally, by filling the lattice from a Bose condensate, and us- 
ing the ideas related to Mott transitions in optical lattices [43] it is possible to 
achieve uniform lattice occupation (“optical crystals”) as demonstrated recently 
by M. Greiner et al. [25]. Even specific atomic patterns, as well as the low temper- 
atures necessary for performing the experiments proposed above can be achieved 
by loading an optical lattice from a BEC. 

5.4 The quantum repeater - scedable long distance communication 

We summarize a scheme to realize quantum repeaters discussed above based on 
the use of atomic ensembles. The laser manipulation of the atomic ensembles, 
together with some simple linear optics devices and moderate efficiency single- 
photon detectors, provide the only resources required for long-distance quantum 
communication. Remarkably, the scheme is not only a significant simplification, 
in particular in comparison with the single-atom and high-Q cavity proposals 
[44, 29, 26], but also circumvents the realistic noise and imperfections, and at 
the same time keeps the overhead in the communication time increasing with the 
distance only polynomially. 

The realization of the quantum repeater with atomic ensembles relies on three 
steps: (i) entanglement generation, (ii) entanglement connection via swapping, 
and (iii) application in communication protocols, such as quantum teleportation, 
cryptography, and Bell inequality detection. We give below a simplified description, 
referring to reference [37] for a detailed discussion. 

5.4.1 Entanglement generation 

The key element in the realization of entanglement generation is single-photon 
interference at photodetectors, where atomic ensembles allow for the collective 
enhancement of the signal- to- noise ratio. We consider a sample of atoms prepared 
in the ground state |1) in a A configuration according to Fig. 5a. For excitation 
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Figure 5: (a) The relevant level structure of the atoms in the ensemble with |1), 
the ground state, |2) , the metastable state for storing a qubit, and |3) , the excited 
state. The transition |1) — >■ |3) is coupled by a laser, and the forward scattering 
Stokes light comes from the transition |3) ^ |2). (b) Setup for generating entangle- 
ment between the two atomic ensembles L and R. The ensembles are illuminated 
by the synchronized classical laser pulses, and the forward-scattering Stokes pulses 
are interfered at a 50%-50% beam splitter, with the outputs detected respectively 
by two single-photon detectors D1 and D2. If there is a click in D1 or D2, we 
successfully generated entanglement between the ensembles L and R. Otherwise, 
we apply a repumping pulse and repeat the process. 



with a weak and short laser pulse the signal mode a of the forward-scattered Stokes 
signal and the collective atomic mode S = (l/\/]Vâ) (2| are in the state 

\ 4 >) = | 0 a) | 0 p) + | 0 p) 4 . o (p^) _ ( 8 ) 

where pc ^ 1 denotes the (small) excitation probability, and |0a) and |0p) are 
respectively the atomic and optical vacuum states with |0a) = |1}^. 

To generate entanglement between two distant sites L and R (Fig. 5b) we 
excite the ensembles simultaneously, so that the system is described by the state 
\(f >) where and are given by Eq. (8). The forward scattered Stokes 
signal from both ensembles is combined at the beam splitter and a photodetector 
click in either D1 or D2 measures the combined radiation from two samples, 
or oLa_ with a± = (ül ± clr) /V^- Conditional on the detector click, we should 
apply a_(- or a_ to the whole state |</))^ (g) and the projected state of the 
ensembles L and R is nearly maximally entangled with the form 

l^)L= (^I±4)/^|0a)Ll0a)H- (9) 

The probability for getting a click is given by pc for each round, so we need repeat 
the process about 1 /pc times for a successful entanglement preparation. 
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Figure 6: Setup for the entanglement swapping. We have two pairs of ensembles 
L, Il and I 2 , R distributed at three sites L, I and R. Each of the ensemble-pairs 
L, Il and I 2 , R is prepared in a maximally entangled state by photodetection. 
The excitations in the collective modes of the ensembles R and I 2 are transferred 
simultaneously to the optical excitations by repumping pulses applied to the atomic 
transition |2) ^ |3), and the stimulated optical excitations, after a 50%-50% beam 
splitter, are detected by the single-photon detectors D1 and D2. If either D1 or D2 
clicks, the protocol is successful and an entangled state is established between the 
ensembles L and R with a doubled communication distance. Otherwise, we need 
to repeat the previous entanglement generation and swapping until the protocol 
finally succeeds. 



5.4.2 Entanglement connection through swapping 

We extend the quantum communication distance by entanglement swapping ac- 
cording to Fig. 6. Suppose that we start with two pairs of the entangled ensem- 
bles described by the state In ideal case, the setup shown 

in Fig. 6 measures the quantities corresponding to operators S]^S± with S± = 
{Sj^ zb S12) /V2. If one of the detectors registers one photon, we will prepare the 
ensembles L and R into another maximally entangled state. This method for con- 
necting entanglement can be cascaded to arbitrarily extend the communication 
distance. 

5.4.3 Applications: entanglement-based communication schemes 

After an effectively maximally entangled (EME) state has been established be- 
tween two distant sites, we would like to use it in the communication protocols, 
such as quantum teleportation, cryptography, and Bell inequality detection. In 
the following we will show how the EME states can be used to realize all these 
protocols with simple experimental configurations. 

Quantum cryptography and the Bell inequality detection are achieved with 
the setup shown by Fig. 7. The state of the two pairs of ensembles is expressed 
as (8) |^)L 2 i? 2 * register only the coincidences of the two-side detectors, 

so the protocol is successful only if there is a click on each side. Under this con- 
dition, the vacuum components in the EME states, together with the state com- 
ponents [vac) and I vac), where |vac) denotes the ensemble state 

|0a0a0a0a)£^^^^^2i?2’ have no contributions to the experimental results. Thus, for 




Vol. 4, 2003 Quantum Information Processing with Quantum Optics 



S777 




Figure 7: Schematic setup for the realization of quantum cryptography and Bell 
inequality detection. Two pairs of ensembles Li, Ri and L 2 , R 2 have been prepared 
in entangled states. The collective atomic excitations on each side are transferred 
to the optical excitations, which, respectively after a relative phase shift (fL or (pR 
and a 50%-50% beam splitter, are detected by the single-photon detectors 
and We look at the four possible coincidences of with 

which are functions of the phase difference pi — pr. Depending on the choice of 
Pl and pR, this setup can realize both the quantum cryptography and the Bell 
inequality detection. 



the measurement scheme shown by Fig. 5, the ensemble state L 2 R 2 

effectively equivalent to the following ‘polarization’ maximally entangled (PME) 
state (the terminology of ‘polarization’ comes from an analogy to the optical case) 

l^)pME “ 

One can check that in Fig. 5, the phase shift p\ (A = L or R) together with the 
corresponding beam splitter operation are equivalent to a single-bit rotation in 
the basis ||0)^ s |0a0a)^^^^ , |l)y^ s | 0 a 0 a)AiA 2 } with the rotation angle 
^ == ^a/2. Now, it is clear how to do quantum cryptography and Bell inequality 
detection since we have the PME state and we can perform the desired single- 
bit rotations in the corresponding basis. For instance, to distribute a quantum 
key between the two remote sides, we simply choose pA randomly from the set 
{0,7t/2} with an equal probability, and keep the measurement results (to be 0 if 
clicks, and 1 if clicks) on both sides as the shared secret key if the two 
sides become aware that they have chosen the same phase shift after the public 
declare. This is equivalent to the Ekert scheme. Similar arguments can be made for 
Bell inequality detection, and the established long-distance maximally entangled 
states can also be used for faithful transfer of unknown quantum states through 
probabilistic quantum teleportation, with the setup shown by Fig. 7. 



5.4.4 Noise, built-in entanglement purification, and scaling 
of the communication elfficiency 

A central feature of the above protocol is the fact that entanglement genera- 
tion, connection and application contains built-in entanglement purification which 
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makes the whole scheme resilient to the realistic noise and imperfections. As an ex- 
ample, in entanglement generation and entanglement swapping the dominant noise 
is the photon loss. This includes channel attenuation, the spontaneous emissions in 
the atomic ensembles in the non-forward direction, the coupling inefficiency of the 
Stokes signal, and the inefficiency of the single-photon detectors. This photon loss 
decreases the success probably for getting a detector click, but it has no influence 
on the resulting entangled state. 

The bottom line of the lengthy analysis of the influence of noise and imperfec- 
tions is that for a fixed given fidelity the communication time scales polynomially 
with the distance L, which we illustrate here by a simple example. Consider a total 
communication distance L of lOOLatt with Latt the attenuation length of the fiber, 
and a photo detection efficiency of rjs ~ 2/3. The communication time in this case 
is Ttot/Tcon ~ 10® with Tcon the connection time for each segment and with an 
optimal segment length Lq ~ 5.7Latt- This result is a dramatic improvement com- 
pared with the direct communication case, where the communication time Ttot 
increases with the distance L by the exponential law Ttot ~ For the 

same distance L ~ lOOLatt, one needs Ttot/Tcon ~ 10^^ for direct communication, 
which means that for this example the present scheme is 10^^ times more efficient. 



6 Conclusions 

We have reviewed some recent proposals for implementing two-qubit gates with 
quantum optical systems. We have given an example for an implementation relying 
on a quantum data bus for entangling two ions in a linear ion trap, shown how 
internal state dependent Coulomb interactions between two ions might entangle 
them, and discussed coherent atomic collisions between ultra cold neutral atoms as 
a basic mechanism to entangle two neutral atoms. We have also reviewed a proposal 
for a quantum repeater based on the use of atomic ensembles. This scheme allows 
quantum communication with a large fidelity and an overhead that grows only 
polynomially with the communication distance. 
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Exact Reformulation of the Bosonic Many-Body 
Problem in Terms of Stochastic Wave Functions; 
an Elementary Derivation 



lacopo Carusotto and Yvan Castin 



Abstract. Recently, a new method was developed to reformulate in an exact way 
the iV-body problem of interacting bosons in terms of the stochastic evolution of 
single particle wavefunctions, see I. Carusotto, Y. Castin, and J. Dalibard, Phys. 
Rev. A 63, 023606 (2001). The original derivation is however often considered as 
rather involved. For the specific scheme called ‘simple Fock scheme’ we present here 
a very elementary derivation of such an approach. 



1 Introduction 

Consider a gas of N indistinguishable bosonic spinless particles with binary inter- 
actions, a situation typically encountered in Bose condensed atomic gases [1]. The 
time evolution of the many-body state vector jt/)) of the gas is described by the 
Schrôdinger equation 

where 7i is the Hamiltonian of the system, including kinetic energy, trapping po- 
tential energy and interaction terms. In the general case of a large number of 
particles and a large number of modes of the atomic matter field, the Hilbert 
space of the system is far too large for a direct integration of (1) to be feasible on 
a computer. A possible way out, intensively explored in particular in the quantum 
optics community, is to reformulate the evolution equation in terms of the stochas- 
tic evolution of classical fields, which are much smaller objects. Historically, the 
first suggestion for such a reformulation is based on the Positive-P distribution 
[2, 3]. Other schemes have been proposed more recently, based on different repre- 
sentations of the many-body density operator [4, 5, 6]. Generalizations to fermionic 
systems are also possible [7, 8]. 

As these methods are stochastic in nature, their prediction for a finite number 
of realizations deviates from the exact result by a fluctuating, zero- mean quantity, 
called the statistical error. One of the challenges of such reformulations is to prove 
that the average of the absolute value of the statistical error remains finite at all 
times, so that an actual Monte Carlo simulation converges to the exact result in 
the limit of an infinite number of realizations. To our knowledge, a proof of such 
a convergence property for the multi-mode Hamiltonian problem defined above 
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was given only for the so-called simple Fock scheme in [4] and for its fermionic 
counterpart [8]. 

The goal of the present paper is to present a much simpler derivation of the 
simple Fock scheme than the original one in [4]: the main idea here is to work 
directly with state vectors \ip) rather than with the many-body density operator. 
The simple derivation also provides in a simple way the proof of the statistical 
convergence of the method. 

2 The model Hamiltonian 

A model for an ultracold trapped interacting Bose gas in D dimensions with short 
range interactions can be obtained in a second-quantization formalism by using 
the Hamiltonian 

H = J2AV ¥(r)ho^{r) + ^'^AV 4't(r)4'1'(r)4'(r)4'(r); (2) 

r r 

the spatial coordinate r runs on a discrete orthogonal lattice of M points with 

periodic boundary conditions; V is the total volume of the quantization box and 

2 

Ay = V/M is the volume of the unit cell of the lattice, ^ + ^ext is the 

one-body Hamiltonian in the external trapping potential ^7ext(r), 'm is the atomic 
mass and interactions are modeled by a two-body discrete delta potential with a 
coupling constant Qq. The field operators ^(r) satisfy the Bose commutation rela- 
tions [^^(r), ^^(r')] = ôr^r'/^y and can be expanded on plane waves according to 
^(r) = with k restricted to the first Brillouin zone of the recipro- 

cal lattice; in the plane wave basis, the kinetic energy term has the diagonal form 
Sk In order for the discrete model to correctly reproduce the under- 

lying continuous field theory, the grid spacing must be smaller than macroscopic 
length scales like the thermal de Broglie wavelength and the healing length. 

3 The simple Fock scheme 

Consider the N particle bosonic Hartree-Fock state defined as 

\N:4>) = ^âl^\0) (3) 

where the operator dj, creates a particle in the non-necessarily normalized single- 
particle wave function 0(r): 



ôj, = AF (/>(r)4't(r). 

r 



(4) 



The challenge is here to find an equation of motion for 0(r) in order for the Hartree- 
Fock ansatz to give an exact solution to the full many-body dynamics, i.e. we aim 
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to obtain a dynamics for </>(r) such that 

7 fit 

d\N : <!>) = -n\N : 4>) (5) 

in a sense to be defined. In the following we proceed to evaluate both sides of (5). 

The action of the Hamiltonian (2) on |A^ ; (j)) can be worked out by splitting 
the field operator in its longitudinal and orthogonal components with respect to (j)\ 

«'(r) = -^â^ + ^±(r) (6) 

II'PII 

where the squared norm of (j) is 



= im ( 7 ) 

with the following scalar product 

=^AV,^*(r)V>(r). (8) 

r 



Prom this splitting of the field operator it follows that 

4'(r)|Ar:<^} = y/N4>{r)\N - 1 : (P) (9) 

¥{r)\N:cP) = ./N + l^\N+l-.<P) + ^{(r)\N :4>). (10) 

Inserting these expressions into the Schrôdinger equation for the Hartree-Fock 
state and collecting the terms containing the different powers of we finally 
obtain 



n\N:<p)=VN AV x(r) 4'+ (r)^ |V - 1 : <^) 



VN{N - 1 ) 



J2^V9oHr)4>{rW±{r)'î>{{r) \N - 2 : <f>) (11) 



with 



X(r) 



ho + 



9o{N - 1 ) 

ll<Af 



9o{N - 1) 

2 Uf 






<^(r). 



(12) 



On the other hand, the variation of a Hartree-Fock state following a variation 
of (j)(r) can be calculated by replacing (j){r) by </>(r) + d(f>{r) in (3) and expanding 
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with the binomial formula. Up to second order in the variation d0(r), this leads 
to 



\N : (j) + d(f>) = \N : 4>) + VN AV #(r) (r)^ |AT - 1 : ^) 

^ ' \ 
^AV"2d(/)(r)#(r')^t(r)4'+(r') \N - 2 : cj)) + . . . (13) 



VN{N - 1 ) 



. r,r' 



In the usual Hartree-Fock mean-field approximation, (f) has a deterministic 
evolution with d(f) oc dt^ so that the last term on the right-hand side of (13) 
is negligible in the limit dt ^ 0 and therefore the last term on the right-hand 
side of (11) cannot be accounted for. The best that one can do in the mean- field 
approximation is to set 

#WImf = (14) 

thus recovering a nonlinear Schrôdinger equation for (j) of the same form as the 
usual Gross-Pitaevskii equation [1]. 

The main idea to recover the exact evolution is to include in d(f) a term dB 
proportional to y/dï: 

d(f>{r) = ^x(r) + dB{r) (15) 

SO that the quadratic term in d(f) in (13) is no longer negligible. The price to pay 
for this is the apparition of an extra term proportional to y/dï in (13) which has 
no counterpart in (11). The trick is to make dB stochastic with a vanishing mean 
value. Then the average variation of the Hartree-Fock ansatz exactly matches the 
one required by the many-body Schrodinger equation: 

d\N:<j>) = ^n\N:(j)) (16) 

in 

provided that the noise term has the following correlation function 

dB{r)dB{T') = [|^<5,.r-0(r)0(r')] (17) 

where the projector Qr = 1 — |(/))(</)|/ ||0||^ projects orthogonally to 0. The zero 
mean property of the noise makes the terms proportional to y/di vanish in the 
averaging over noise. The terms of order three or more in d(j) in the expansion 
(13) are negligible as compared to dt in the limit dt ^ 0. In mathematical terms, 
this procedure corresponds to assuming that the evolution of (j) is governed by an 
Ito stochastic differential equation [10]. Fig.l gives a geometrical interpretation 
of the Monte Carlo sampling of the many-body state vector lip) by Hartree-Fock 
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states. The exact many-body state vector at any time t > 0 |'0(t)) is equal to the 
statistical average of Hartree-Fock ansatz: 

\tp{t)) = {\N : 4>{t))) (18) 

provided that this is the case at the initial time t = 0. 

ImCu I 



* 



¥ 






X ^ 



lv(t)> 



* 



* 






|N:Wt)> 



Re 



Figure 1: Schematic view of the Monte Carlo sampling of the exact many-body 
state vector at time t (cross) by a collection of random Hartree-Fock states 
\N : (j)k) (stars). In the limit of an infinite number of realizations, the center of 
mass of the distribution of the stars exactly coincides with the cross. In the N- 
body Hilbert space, u is an arbitrary unit vector and Cu is the component along u 
of 1-0) and of the |7V : 



In practice, a noise with the correlation function (17) can be obtained as 



dB{r) = 




4>{r) + c.c.) ; 

k>0 



(19) 



this expression was already discussed in [4] and is easily generalized to finite range 
interaction potentials. The index k is restricted to a half space and to the first 
Brillouin zone. The ^(k)’s are random angles uniformly distributed in [0,27 t]. An 
alternative choice can be 



/ go 



dB{r) = 



ihAV 



Qr [(p{T)d^{T)] 



( 20 ) 
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with the d^{T) being independent zero- mean real random variables with 

d^{r) d^(r') = ^r,r' dt. 

In both cases 

( 21 ) 

We can then calculate the evolution of the squared norm ||0||^ and check that it 
remains finite at all times: 

d\\4>f ^ Uf, (22) 

the absence of terms linear in dB being due to the orthogonality of dB with respect 
to (f). This leads to 

||^f(^)< ||<^f(0)e^, (23) 

SO that (j) remains finite at all times. The present stochastic wave function refor- 
mulation of the bosonic rnany-body problem is exactly the same as the so-called 
simple scheme with Fock states introduced in [4] from a different and more gen- 
eral point of view. Among the schemes based on a Hartree-Fock ansatz for the 
many-body density matrix of the form 

a{t) = \N:(l>,}{N:(f>2\ (24) 

with 01,2 evolving according to stochastic differential equations, the simple Fock 
scheme was shown to be the one which minimizes the growth rate of the statistical 
variance of a around the exact many-body density matrix p{t)\ 

Var[(T] = (||p(i) - a{t)f) = (Tr [a^t)a{t)] ) - Tr [p(i)2] , (25) 

where (...) represents the average over all stochastic realizations. The last term 
in the above equation, the so-called purity of p, is constant for Hamiltonian evo- 
lutions. If the stochastic wave functions 0i and 02 are statistically independent at 
the initial time ^ = 0, they remain so at any later time since their evolutions are 
independent. The mean squared norm of a is then 

(Tr [(T'f(^)cr(^)] ) = (26) 

where 

Ait) = {N:mN:4>it)) = Uf'' ■ (27) 

A geometrical interpretation of A can be put forward in the spirit of fig.l: the 
mean value of A is related to the mean value of the squared distance in Hilbert 
space between the Monte Carlo ansatz and the exact many-body state vector 

(A)-|||^)f^ = (|||7V:0>-|^)f^). 



(28) 




Vol. 4, 2003 Bosonic Many-Body Problem in Terms of Stochastic Wave Functions S789 

For the present Fock scheme, it follows from (23) that 

(A)(t) < (A)(0) (29) 

this upper bound was already discussed in [4]. In the model of this paper, the 
number of modes of the bosonic field is finite, so that the inequality (29) guarantees 
that the simulation can predict the expectation value of any observable with a finite 
statistical dispersion. The Monte Carlo statistical variance AO^ on the quantum 
expectation value Tr [pO] of any observable O is indeed limited from above by 

AO^ = (lTr[o-0] 1^) - Tr[pO]2 < Tr[0^] (Tr[crVl) - Tv[pO]^ (30) 

where the traces are taken in the A-particle subspace and the trace Tr[0^] is a 
finite quantity for a finite number of modes. 

This formalism can be extended to the imaginary-time evolution [11], which 
allows one to sample in an exact way the thermal equilibrium state at a given 
temperature T : 

p = (31) 



This requires solving the imaginary-time evolution equation 

( 32 ) 

for a “time” r going from 0 to f3 = IftsT. The use of the projector onto the N 
particle subspace Vn as the initial condition allows one to calculate the physical 
quantities in the canonical ensemble, i.e. for a fixed number N of particles. As 
we shall prove in full detail in the next section, the projector operator Vn has a 
simple expression in terms of Hartree-Fock states. 

The correct imaginary-time evolution of the Hartree-Fock state \N : cf)) is 
recovered if the stochastic wave function 0(r) evolves according to 



#(r) = - y 



j, , 9o{N — 1) I j,/ >,i2 

\\n 



with a noise term dB{r) equal to 



9o{N - 1) 

2 Uf 



|0(r')|' 

r' 



</)(r) + dB{r) 
(33) 



dB{r) = i 




0(r)^(e^(‘^-"+^(‘^))+c.c.) . 
k>0 



(34) 



We have recently applied this method to study the probability distribution of the 
number of condensate particles in a ID trapped Bose gas for temperatures both 
above and below the transition temperature [12]. 
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4 Proof of (over)-completeness of the Hartree-Fock states 

In the previous section we have shown how the A^-boson time-dependent problem 
can be exactly reformulated in terms of the stochastic evolution of single parti- 
cle wave functions; in particular, the thermal equilibrium state in the canonical 
ensemble can be obtained by means of an imaginary-time evolution with the pro- 
jector Vn as the initial state. Here, we shall prove that the projector Vn bas a 
simple expression in terms of Hartree-Fock states and that any state vector can 
be expanded onto a set of Hartree-Fock states. This ensures that the identity (18) 
can be satisfied at time t = 0 for any initial many-body state vector |'0(O)). 

It is a well-known fact of quantum optics [13] that the Glauber coherent 
states form an overcomplete basis of the Hilbert space of an harmonic oscillator. 
In the general multi-mode case, this property can be expressed as 

1 = ^ / I^</>|Glaub ; (/>)(Glaub : 0| (35) 

where the Glauber coherent state is defined as usual as 

IGlaub ; 4>) = e-H'^ll'/2eE,Ai/,A(r)'î'br)|o). (36) 

and 1 is the identity on the bosonic subspace of totally symmetric wave functions. 
The integration in (35) is performed over the 2M dimensional space of complex- 
valued functions defined on the M-point lattice: 

dRe0(r) dlm0(r) . (37) 

r 

Using the well-known formula 

PwlGlaub : <P) = ; 0) 

to project onto the N particle subspace, an explicit expression for Piv in terms of 
Hartree-Fock states can be obtained 

Vn — Vn^Vn = J Ptv I G laub : (/>)(Glaub : = 

C Jv^\N :^){N -N\. (39) 

The constant C is equal to 

jmw ( 40 ) 

The normalized wave function 0(r) is defined as <^(r) = </>(r)/ \\(f)\\ and the inte- 
gration over is performed on the unit sphere \\^\\^ = 1 according to the measure 
D0 defined as 

= d\m 



(41) 
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As discussed in [11], the expression (39) is the starting point for an exact numerical 
calculation of the thermodynamical equilibrium properties of the interacting Bose 
gas. 

Completeness of the set of Hartree-Fock states is an immediate consequence 
of the result (39): any state vector belonging to the N particle subspace can 
in fact be expanded over Hartree-Fock states according to 

1^) = Vn\-4j) = C Jvi {N : ^1^) \N : 0). (42) 

As required for practical implementation of the Fock scheme, the weight factor can 
be made real and positive for all the 0 just by reabsorbing the phase of {N : ^\ip) 
into that of the wave function (p: 



IV») = C jv^ {N : $\ip) \N : r{^) cj)) 



(43) 



with the phase factor r(0) defined according to 






{N : 01V.) 
(N : ^1^) 



Since the scalar product of two Hartree-Fock states is equal to 



(44) 



{N : <t>2\N : <j>i) = {4>2\h)'' , (45) 

this family of states is not an orthonormal basis, but rather an overcomplete set. 

5 Conclusion 

We have shown that the simple Fock scheme can be rederived in a very elementary 
way using the Schrôdinger equation for the state vector rather than the Liouville 
equation for the many-body density operator. Such a derivation is inspired from 
the derivation for fermions used in [8] . We have also given a proof of completeness 
of the Hartree-Fock state ansatz more detailed than in [4] . 
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Phase Fluctuations and Coherence 
of an Almost Ideal Condensate 



L.P. Pitaevskii and S. Stringari 



Abstract. The quantum description of a Bose-Einstein condensates in double-well 
traps connected by a Josephson junction is discussed. The quantization in terms 
of the number and phase canonically-conjugated operators gives correct energy 
levels, but results in finite phase fluctuations in the absence of interaction. This 
seems to contradict the perfect coherence of a system of non-interacting particles. 
The problem can be solved by calculating the observables that characterize the 
coherence properties of a system, as, for example, the momentum distribution. It 
occurs that the visibility of interference fringes depends both on the fluctuations 
of the phase and the number of atoms. The visibility is calculated in the limit of 
small interaction and the results exhibit the correct behavior. 



Bose-Einstein condensate in a double-well potential 

After the experimental realization of Bose-Einstein condensation in trapped atomic 
gases, the phenomenon undergoes active investigations both from theoretical and 
experimental points of view. Most of the experiments can be described in terms 
of relatively simple mean-field theory, based on the non-linear Gross-Pitaevskii 
equation. In the dilute gases deviations from the mean-field picture are, generally 
speaking, small. The deviations, however, can be measured and such measure- 
ments will give new insight in the phenomenon. One of the promising possibilities 
would be the observation of fluctuations in the relative phase of weakly coupled 
condensates. 

Let us consider a Bose-Einstein condensate of N atoms confined by two wells 
separated in the 2;-direction. The system can be characterized by the number of 
atoms in each well, Na and Aj,, and the corresponding phases of the condensates, 
and $5. The classical, mean-field dynamics of the system can be formulated in 
terms of canonical equations for the following canonical-conjugated variables: the 
phase difference ^ and the quantity hk^ where k = (Na — N^) /2 is the 

particle “disbalance” . 

We assume that the condensates are macroscopic {Na^N^ ^ 1 ). The energy 
of the system has form of the well-known “Josephson Hamiltonian” 



Hj = {l/2)Eck^-Ejcos^, 



( 1 ) 
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where the parameter Ej accounts for the tunneling between wells. The parameter 



Ec = 2 



f 



Na = N/2 



(2) 



where fia i^a) is the chemical potential, describes the atom- atom interaction (see, 
instance, [1, 2]). 

In the classical picture the ground state corresponds to the values ^ = k = 0. 
In order to describe quantum fluctuations, one has to introduce quantization rules. 
To this purpose, one can replace the conjugated variables $ and hk with operators 
satisfying the commutation relations [$, hk] = ih. Hence one has 

[^:k\=i ■ k = -i~. ( 3 ) 

The system can be described by the Hamiltonian 
acting in the space of periodic functions ^ with period 27 t. 



Relation to observations 

Now the quantities $ and k fluctuate even in the ground state. For a symmetric 
configuration one has for average values (sin $) = {k) = 0. The fluctuations satisfy 
the uncertainty relation [3] 

(Asin^ ^ (cos$)^ . (5) 

The average (cos^) defines an observable quantity - the momentum distri- 
bution. Let as assume that the condensates in two wells have phase difference 
Then the momentum distribution is 

n (p) = 2 [1 -h cos {jpzd/h -h $)] na(p), (6) 

where d is the distance between wells. By averaging with respect to the phase Çf>, 
one finds 

(n(p)) = 2 [1 + {cos^) cos {pzd/h) - {sin^) sin {pzd/h)]ria{p) (7) 
and, because (sin^) = 0, 

(n (p)) = 2 [1 + H cos (pzd/h)] ria{p). (8) 

The parameter V — (cos $) coincides with the visibility of the interference fringes 
in the momentum distribution. If the fluctuations are small, then 

y 1 - (^2) /2. 



( 9 ) 
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The momentum distribution n (p) is an observable quantity. It can be defined 
in the experiment where the interference is produced by two condensates, which 
expand and overlap after releasing the confined traps [4] . A direct measurement of 
n (p) in the double-well geometry is, in principle, feasible using two-phonon Bragg 
spectroscopy (see [5]). Notice that the average in Eqs. (7)-(9) means average with 
respect to different runs of the experiment. 

In the weak interaction limit, Ec Ej, phase fluctuations in the ground 
state are small: 




According to this equation ^ 0 when Ec 0. However, the Hamiltonian 

(4) itself must be corrected at small Ec- Taking into account the dependence of 
the Josephson energy Ej on the number difference k, one obtains: 

Hj=EcY~y{^^- cos4>, (11) 

where Ô is the splitting of the ground state energy level due to tunneling. Notice 
that the parameter Ec in this conditions does not depend on the number of atoms. 
Expanding with respect to ^ and k gives the oscillator-like Hamiltonian 



Hj 



NÔ NS 






k^ 



( 12 ) 



The eigenfrequency of the oscillator is [6] cj j = y/S {S -h NEc/2))/h and the fluc- 
tuations in the ground state are 

($2) ^ ^ + nEc/ 2SŸ^^ , (Ak^) = (13) 

In absence of interaction Ec = 0 and 

<$2) = l,(Afc2) = iV/4. (14) 

This result seems to contradict the common sense. The ground state of an ideal 
gas is completely coherent and the visibility of the interference fringes must be 
perfect. This paradox is, however, a result of the semiclassical calculation of the 
visibility. In an almost ideal condensate the visibility depends not only on 
but also on and demands more exact calculations. 



Quantum calculation of the visibility 

To give the first principle derivation of the visibility factor, we shall express the 
average momentum distribution in the second quantization representation: 

(n(p)> = (4't(p)#(p)). 



(15) 
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The ^-operator for the double-well problem can be written as 

^(p) = '/'a(p)â + «!>6(p)fe, (16) 

where 4>a{p) and ^6(p) are the wave functions for the two wells, <(>a(p) = 

06 (p), both normalized to 1. Then, simple calculations yield an equation of the 
form (8) for the average momentum distribution, with the visibility factor given 
by 

V=^{a^b) = ^{b^a). (17) 

It is convenient to calculate this quantity in the Fock representation, where wave- 
functions are labeled by the quantum numbers k [7] . Let us write the ground state 
wave function as 

|), = ^Cfc|/c), (18) 

k 

where the functions \k) and the coefficients Ck can be taken real. Then one has 
the matrix elements 



b^a\k) = [{N/2 + k) {N/2 -k + 1)]^/^ |fc - 1) 



and the visibility 






1/2 

CkCk-\-l • 



(19) 



( 20 ) 



In the limit of large N 1 one can expand this expression with respect to k. For 
example. 






(21) 



By retaining only terms up to the order of 1/A^, we obtain the main result of this 
paper: 





' + I 



N2 



( 22 ) 



or 

(23) 



where = f k^C^dk and = - f Ck^Ckdk. 

This equation shows that only if terms of the order of (k‘^) /N"^ and 1/A^ are 
negligible, the identification (9) is correct. Conversely, in the ideal gas case the 
fluctuations are given by (14) and V = 1, as it must be. In the general case of 
small but finite interaction one gets from (13): 



1 + NEc/AS 
{l + NEc/2ÔŸ^^' 



( 24 ) 
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If Ec » S/N equation (24) coincides with result (9)-(10). In the opposite limit 
Ec — > 0 the visibility tends to 1 according to the law: 




(25) 



To conclude, we developed a consistent method for the calculation of the 
visibility of interference fringes in the momentum distribution of a double-well 
condensate on basis of the Josephson Hamiltonian. The transition to the limiting 
case of an ideal gas is traced. 
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Extracting Entropy from Quantum Computers 
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Abstract. Quantum error correction (QEC) and fault tolerant quantum computing 
are introduced. The basic theory of an important class of quantum error correcting 
codes, the stabilizer codes, is given. Fault tolerance is described very briefly. The 
whole fault-tolerant correction process can usefully be described in thermodynamic 
language, as a special form of heat engine which extracts entropy from a collection 
of qubits without fully measuring their state. QEC is useful both for stabilizing 
quantum computers and for detecting small many-body correlated effects which 
would otherwise be swamped by (uncorrelated) noise. 

This paper briefly introduces some of the concepts and methods of quantum error 
correction (QEC) [1, 2, 3, 4] and fault tolerant quantum computing [5, 6, 7, 8, 9, 
10, 11, 12, 13, 14, 15, 16, 17, 18]. These form part of the basic theory of quantum 
information, and are now sufficiently well established that the main principles 
are covered in text book introductions to the subject [19, 20, 21]. This paper is 
therefore mainly a ‘taster’ for interested physicists unfamiliar with the material, 
but I will include some ideas which though natural to the subject, have not been 
previously discussed. 

The main theme of QEC and fault tolerant quantum computing is to under- 
stand how quantum information can be stored, transmitted, and experimentally 
manipulated in a robust way. For example, we may transmit quantum systems 
such as photons down a noisy channel, and desire that the receiver can with high 
reliability reconstruct the quantum state sent by the transmitter. Alternatively, 
we may wish to manipulate a large quantum system so as to perform information 
processing with high reliability, even though the operations and the constituent 
parts of the system are subject to random noise. This is of great practical interest 
for achieving quantum computing in the laboratory, and it also underlies a new 
and as yet little understood area of physics concerning the thermodynamics of 
complex entangled quantum systems. 

The challenge of achieving precise manipulation of quantum information has 
inspired much ingenuity, and many established methods of experimental physics, 
such as adiabatic passage, geometric phases, spin echo and their generalizations can 
be useful. These provide an improvement in the precision of some driven evolution 
by a given factor at a cost in speed, for example a slow-down of the evolution by 
the same factor. Such methods may play a useful role in a quantum computer, but 
they cannot provide all the stability required, for two reasons. First the slow-down 
is unacceptable when large quantum algorithms are contemplated, and secondly 
it is doubtful whether they will in practice achieve the relative precision of order 
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IfKQ which is needed to allow a successful computation involving Q elementary 
steps on K logical qubits, when KQ reaches values >10^ which are needed for 
computations large enough that a quantum computer could out-perform the best 
available classical computer. 

Quantum error correction (QEC) may allow a precision <C 10"^ per logical 
operation to be attained in quantum computers. In order for this to be possi- 
ble, QEC must be applied in a fault- tolerant manner, that is, the QEC process is 
constructed so that it removes more noise than it generates when it is itself im- 
perfect. One surprising and powerful result is the threshold. This is the result that 
arbitrarily long quantum computations can be robust, under various reasonable 
assumptions, once the noise per quantum gate and per qubit during the duration 
of a gate is below a threshold value which does not depend on the size KQ of 
the computation [7, 8, 11, 9, 10, 12]. The value of this threshold depends on con- 
siderations such as the degree to which operations can proceed in parallel in the 
computing device, and the ease with which quantum information can flow around 
the computer. For a device which allows a high degree of parallelism and good 
information flow it can reach ~ 10“^ [18]. That is to say, even though every oper- 
ation and every qubit in a quantum computer is subject to random relaxation and 
decoherence (e.g. by spontaneous emission of photons, thermal fluctuations, etc.) 
at the level of around one tenth of one percent, the methods of QEC can cause 
the stored quantum information to be protected to such a degree that it suffers 
decoherence only to the level 10“^^ or less. 

1 A quantum computer is not a Schrodinger’s Cat; 
entangled states are robust 

Since large-scale quantum computing requires the coherent control of a highly 
complex quantum system, it has been compared in various places to the thought- 
experiment of Schrodinger’s Cat. This is a way of stating that a large working 
quantum computer is extremely hard to build. However, this comparison is par- 
tially misleading, because the differences between a quantum computer and a cat 
are more significant than their similarities. A quantum computer (of very great 
computing power) would need only of order one thousand two-state systems, or ten 
thousand to one million with error-correcting ability included, arranged in such a 
way that elementary operations of the system are reversible to good approximation 
(to the level ~ 10“^ or 10“^). By contrast, a cat has some 10^^ or more atoms and 
an extremely complex structure: its large-scale behaviour is thermodynamically 
irreversible. 

The paradox of the Schrôdinger Cat experiment is essentially to do with rec- 
onciling the irreversible nature of a physical phenomenon such as the death of a 
cat with the reversible equations of quantum evolution. A working quantum com- 
puter of one thousand to one million qubits would demonstrate coherent reversible 
behaviour in a system much more complicated than any we are currently able to 




Vol. 4, 2003 Extracting Entropy from Quantum Computers 



S801 



control, but it would not imply that the death of a cat is reversible any more 
than the Young’s Slits experiment does, so it would have no impact on the logic 
of Schrôdinger ’s cat paradox. 

It is important to realise that entangled states of multiple systems are not es- 
pecially fragile as the number of systems increases. For example, if a certain atomic 
state has decay rate F then the 7V-atom entangled state ( 1 000 •••0)-h|lll*-*l)) / 
has decay rate of order NT. The loss of fidelity of such an entangled state, after 
some small time, is of the same order as the loss of fidelity of a product state such as 
|111 • • • 1), therefore the presence or absence of entanglement does not significantly 
affect the decoherence rate of such a system. Rather, the use of entanglement al- 
lows quantum computation to be stable. To store N qubits of information, 2^ 
orthogonal quantum states must be used. If this were attempted without using 
entanglement, for example by using one or a few degrees of freedom of a single 
system, such as a harmonic oscillator, then the system would be much less robust. 
For, if most of these states have different energy (i.e. if the amount of degeneracy 
does not grow exponentially with N) then the order of magnitude of the energy 
difference between [000 • • • 0) and |111 • • • 1) is 0(2^). In this case the decoherence 
rate of states such as (|000 • • *0) -|- jlll • • • l))/v^ would increase exponentially 
with N. If the degeneracy were sufficient to avoid this, then the 2^ states would 
need to be distinguished by some other parameter, and the system would show 
exponential sensitivity to the value of this parameter. In either case quantum 
computing would be ruled out by an exponential cost. The use of entangled states 
circumvents this sensitivity. 

2 Classical and qucintum error correcting codes 

Let us introduce a few of the concepts of classical error correction. A classical 
bit-string of n bits can be regarded as a row vector in an n-dimensional space 
called Hamming space. We will perform all arithmetic modulo 2 (to be precise, 
in a binary field). A classical linear code C is a collection of vectors such that 
u + V ^ Cy u,v ^ C. Such codes have size 2^ for some integer k < n. Each linear 
code is a linear vector-space (a sub-space of Hamming space) . The inner product 
u ‘ V = uv^ is calculated in the standard way for vectors, and owing to the use 
of binary arithmetic has a value either zero or one. Every linear code C has an 
{n — k) X n check matrix H such that Hu^ = 0 V k G C. The k x n generator 
matrix G is defined by HG^ = 0. The rows of G form a basis for the vector-space 
which is the code. 

If C has check matrix then the code generated by H is the dual of C, 
denoted For example, {000,111} is the dual of {000,011,101,110}. A code 
may contain its dual. It is obvious from the definitions that C C ïî and only if 
= 0 . 

I will now introduce the most important class of quantum error correcting 
codes, called stabilizer codes. 
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Consider the Pauli matrices X = cTx, Z = cjz and Y — XZ — —i(Jy, com- 
bined with the identity matrix /. This set has the properties that members either 
commute (e.g. XI — IX) or anticommute (e.g. XZ = —ZX); all members square 
to 1, (X^ = = Z^ = I^ = I) and it forms a complete basis for 2 x 2 matrices. 

We will be interested in tensor products of Pauli matrices. 

The set of tensor-products of n Pauli operators is a group V. 

A state 1^) is said to be stabilized by operator M if it is an eigenstate of M 
with eigenvalue 1, i.e. M \(f)) = \<f>). The set of operators in V which stabilize a given 
10) is called the stabilizer of |0). For example, |0) has stabilizer Z, /; |00) + |11) has 
stabilizer II, XX, YY, ZZ. A sub-space of Hilbert space can also have a stabilizer, 
for example the space spanned by {|00) , |11)} is stabilized by II, ZZ. Let n be 
the number of qubits, so the whole Hilbert space has dimension 2^. If the sub- 
space has dimension 2^ then the stabilizer has 2^“^ operators, and this group (the 
stabilizer) can be generated by n~ k operators. For example, the stabilizer II, ZZ 
is generated by ZZ; the stabilizer II, XX, YY, ZZ can be generated by XX, ZZ. 

A stabilizer S is an abelian sub-group of V. The stabilized space is a quantum 
error correcting code. 

3 Quantum error correction by syndrome extraction 

Since the Pauli matrices form a complete basis for 2 x 2 matrices, and because 
non- unit ary processes can be modeled by entanglement with further degrees of 
freedom (which we will call simply ‘the environment’), it follows that any process 
whatsoever affecting a set of n qubits can be written 

l<^) ® IV’>env Y ® (1) 

i 

where |0) is the state of the qubits, l-'^env denotes states of the environment, 
and where the operators are tensor products of Pauli operators. Each quantum 
error correcting code can correct a finite set of Ej. 

There are various ways to take advantage of the structure of the code in order 
to perform the error correction. The easiest to describe is the method of measuring 
syndromes. This involves applying tailored measurements to the set of qubits, so 
that the members of the stabilizer are measured (these are quantum mechanical 
observables so a process exists which measures them) but no other information is 
extracted (whether to our measurement devices or to the rest of the world). To 
make this idea more concrete, and to show one way to accomplish it in practice, 
it is useful to provide a complete specification for a network of operations which 
accomplishes the measurement. This can consist of a set of n — /c ancilliary qubits 
prepared in the state |0) + |1), followed by a well-defined set of controlled-gates 
between the ancilla and the original bits, followed by measurement of the ancilla. 
The details are described elsewhere [4, 19, 20, 21]. 
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One central concept here is that, since the quantum error correcting code is 
the stabilized sub-space, measuring all the members of the stabilizer has strictly no 
affect on any state in this sub-space, therefore the measurement does not disturb 
the stored quantum information. Errors which have acted on the data can move 
the system state out of the sub-space, and if this has happened then the syndrome 
measurement will either randomly collapse the state back into the protected sub- 
space, or it will reveal a measured observable whose eigenvalue is not equal to 1, 
and this will show what type of error has occurred. 

To explain this more precisely, consider the noisy state (1). The result of 
measuring the set of observables which form the stabilizer will be to collapse the 
state to an eigenstate of the measured observables: 

m ® - {È, \4>)) ® ( 2 ) 

i 

where to keep the notation uncluttered we have assumed only a single term from 
the sum survives; this assumption will be dropped in a moment. The measurement 
also yields the set of measured eigenvalues, this is called the error syndrome. If it is 
possible to deduce the error Ej from its syndrome Sj, then the error is correctable, 
since then it is possible to use the syndrome information to select which operator 
should be applied to the qubits to return their state to \(f)). Actually, a slightly 
lesser requirement is enough: if a corrective operator Cj can be identified from 
the syndrome, such that CjEj G S, then the error is correctable (since then the 
combined effect of Ej and Cj is equal to that of the identity operator for any state 
in the protected sub-space). 

If several different errors have the same syndrome, then several terms can 
appear on the right hand side of (2). In this case, after the corrective operation 
Cj , some of the error operators may be converted into the identity, and the others 
may not be. In general the state is left with some uncorrected errors. The power 
of the whole technique arises from the fact that codes can be constructed so that 
the uncorrect able errors are the least likely ones to occur. 

It remains to show which errors are correctable. For a given quantum error 
correcting code, a given set of errors {E^} is a correctable set, if, for all members Ei, 
Ej^ of the set, either EiEj anticommutes with a member of <S, or EiEj G S. Proof: 
suppose È 1 Ê 2 M = —MÈ 1 È 2 for some M G S. It follows that either {{Èi.M} = 0 
and [Ê 2 ,M] = 0) or ({E 2 ,M} = 0 and [Èi,M] = 0), where {•••} denotes the 
anticommutator. In either case, errors Ei and È 2 will have different syndromes, 
because upon measuring M, the eigenvalue -hi will be obtained if the error operator 
commutes with M, but —1 if it anticommutes (the easy proof is left to the reader). 
Therefore the definition of the correctable set which was given will guarantee that 
all its members have different syndromes, except for the special case EiEj G S. 
However, since the Pauli operators square to 1, the corrective operator Ci = Ei] 
if Ei and Ej do not have different syndromes, but they do satisfy the requirement 
EiEj G «S, then it is sufficient to apply corrective operator Ei when the error is 
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either of Éi or Èj, as was remarked in the paragraph after equation (2). This deals 
with the special case. 

3.1 CSS codes 

Within the class of stabilizer codes, an important sub-class is the Calderbank, 
Shor, Steane codes or CSS codes [2, 3, 4]. Such a code is constructed from a 
pair of classical linear codes Ci, C 2 which have the property Ci C C 2 . The check 
matrices iJi, i /2 of C\ and C 2 together produce the stabilizer of the quantum code, 
as follows. The generator of the stabilizer has a set of pure X operators (i.e. tensor 
products of X and the identity) formed from the rows of Hi and a set of pure Z 
operators formed from the rows of H 2 . The dual property of the classical codes 
ensures that the quantum stabilizer is an abelian group. For details the reader is 
referred to [22, 23, 24]. 

3.2 Summary 

QEC has been introduced above as a set of mathematical concepts, emphasizing 
the role of the Pauli group, the stabilizer, and dual pairs of classical binary error 
correcting codes. The impact of noise on a physical system has been written down 
in an abstract way (equation (1)), but so far there has been no need to analyze 
the noise of any particular physical example. This emphasizes that quantum error 
correction is a very general set of techniques which can apply to systems irrespec- 
tive of the type of error process they are subject to, and which also can be used 
for purposes other than correction of errors. 

QEC causes the state of a set of qubits to be returned to a given sub-space of 
the total Hilbert space. Since the sub-space has 2^ dimensions, it is an ‘encoded’ 
representation of k qubits. 

It is important to note that powerful quantum error correcting codes can 
be found in practice. For example, there exist codes such that, for a set of n 
qubits, any error which affects any subset of qubits of size up to t qubits is in 
the correctable set. Such codes are called t— error correcting. For such codes, only 
errors which affect large numbers of qubits at once are not correctable, and for 
many physical systems such ‘many-body’ effects have a probability which falls 
exponentially with the size t of the set. As long as t can be large enough for 
given values of the other parameters k and n, then this means that quantum error 
correction offers an exponential suppression of infidelity for only a polynomial cost 
in resources (number of qubits and operations). But t can be large, because it can 
be shown that classes of good codes exist. These have the property that the ratio 
t/n and also the rate k/n can both remain above a finite lower bound as n ^ 00 . 
The combination of these results allows a sufficiently powerful suppression of noise 
that it may allow quantum computing of non-trivial algorithms to be a realistic 
physical possibility rather than a subject of purely abstract interest. 
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4 Fault tolerance 

So far I have outlined some of the mathematical structure of quantum error correct- 
ing codes. However, to use them in practice for quantum computing, two further 
concepts must be introduced. First, we need to understand the impact of imper- 
fection in the error correction process itself. If, e.g., measurement of the syndrome 
is subject to noise, then the wrong syndrome may be obtained, and the ‘corrective’ 
operation would in fact introduce further errors. The second issue is that we need 
to be able to manipulate the protected quantum information, not merely correct it. 

Error correction is called ‘fault tolerant’ if overall it results in a net sup- 
pression rather than increase in errors, even when all the operations involved are 
themselves imperfect. Let F be the fidelity of the quantum computer’s state at 
some point in a computation, and let p give the loss of fidelity associated with a 
single operation, such as a controlled- not gate. In the absence of QEC, the fidelity 
will decrease as F ^ {l—p)F after a single computational step. With fault-tolerant 
QEC, using a t-error correcting code and assuming the noise is stochastic and un- 
correlated across different qubits, the decrease is modified to F ^ (1 — 
where m is a number of order 1000 which arises from the number of operations in 
the network needed to measure the syndromes. 

Fault tolerant QEC proceeds by using suitable encoding, such as a large 
CSS code, with careful construction of the network of operations used to extract 
syndromes. In particular, we need to track the routes by which errors can propagate 
from one qubit to another when a quantum gate acts between them. Space does not 
permit a full discussion of this here, the reader is referred to [19, 20, 5, 8, 18]. An 
important point is that different types of error propagate differently. For example, 
an X error on the control bit of a cont rolled-gate such as “controlled-NOT” will 
propagate to the target bit, but a Z error will not {^Xi 2 Xi = X 1 X 2 ^X \2 while 
^Xi 2 Zi = Zi ^Xi 2 , where ^X \2 is a controlled-NOT from bit 1 to bit 2). A Z error 
meanwhile propagates the other way, i.e. from target to control. The main trick 
is to prepare ancilla qubits in partially corrected states, which have undergone an 
X- but not Z- error correction, for example, and then use them as control-bits 
in a set of controlled-NOT operations with the data qubits whose full correction 
is desired. The Z- errors propagate from the data to the ancilla, and subsequent 
meaisurement of the ancilla yields the syndrome. This does cause extra noise in 
the data, but only to a degree proportional to the size of the network (number of 
qubits and operations), and the network is efficient. At sufficiently low noise levels 
the syndrome is more likely to be correct than not, and in this situation a simple 
repetition of the whole syndrome extraction will yield a group of syndromes such 
that the probability the majority of them are correct approaches 1 exponentially 
with the size of the group. Overall, then, while the whole process of syndrome 
extraction increases the noise in the data to a polynomial degree, it permits an 
error correction which then suppresses the errors to an exponential degree. This 
completes the correction of Z errors in the data. The correction of X errors then 
proceeds similarly, and the whole process is carried out repeatedly [18]. 
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5 Thermodynamics of QEC 

A useful overall insight into QEC is afforded by comparing it to a heat engine 
(figure 1). The quantum computer is a set of physical qubits (a quantum register) 
such as nuclear spins, with controlling machinery such as electrodes and laser 
beams. The environment and the controlling machinery together act as a hot 
thermal reservoir. The quantum register is warm, and is continuously heated by the 
uncontrolled heat flow from the hot reservoir. It is cooled essentially by connecting 
it to the prepared ancilla qubits which are cold. However, this connection is very 
clever, in two respects. First it is such that only the random heat can flow to 
the ancilla, not the information about the logical state of the register. The ancilla 
learns which sub-space the register is in (i.e. the encoded Hilbert space or one of 
its co-spaces), but not where it is within that sub-space. This is only possible if 
the ancilla is prepared in a special entangled state (a quantum codeword). 




Figure 1: Thermodynamics of QEC. The diagram is illustrative only. It shows how 
QEC can be understood in part in terms of heat flow. Ancilla bits are prepared in 
low-entropy states, they absorb the entropy from the data bits and lose it to the 
environment during measurements or re- préparât ions. The correcting network is a 
heat engine which only interacts with the noise, not the stored quantum informa- 
tion. This is possible through the properties of QEC codes and subtle features in 
the way the coupling between quantum computer and ancilla is introduced, and 
in the type of cooling of the ancilla which is needed - see text. 

Now, we cannot prepare the required ancilla state except by using noisy 
machinery, and there is a danger of an infinite regress here. The infinite regress is 
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avoided because the ancilla only needs to be cold in one ‘direction’, e.g. it must 
avoid X errors but may have Z errors. This is the second ‘clever’ feature. It means 
the cooling of the ancilla does not require a full QEC of the ancilla, but only a 
partial one which can be accomplished by allowing it to heat in one ‘direction’ 
while it is cooled in the other. 

The connection between thermodynamics and QEC is not just an illustrative 
one, but also can be seen in a more quantitative way in a new type of phase 
transition which may be found in a quantum computer stabilized by QEC [25, 
26]. This is an order-disorder transition which occurs at a critical value of the 
error rate when the computer is being continuously corrected by fault-tolerant 
QEC. It has been suggested that this transition establishes a well-defined boundary 
between quantum and classical regimes, which allows the long-standing quantum 
measurement or interpretation problem to be formulated in a quantitative way [26]. 

6 Detection of correlated physical phenomena 

I have emphasized that the ideas of methods of QEC are interesting whether 
or not they can be applied successfully to stabilizing quantum computers. One 
other area of interest is in the detection of correlated physical phenomena, such 
as many-body effects. Suppose a collection of qubits is subject to a very weak 
collective phenomenon. For example, there could be unexpected many-body corre- 
lated couplings to the surrounding electromagnetic radiation field (nominally in its 
vacuum state) [27]. For computing, the effect would be regarded as an undesired 
noise which we wish simply to avoid. It might even prevent QEC from stabilizing a 
quantum computer. However we can regard QEC instead as a technique to reveal 
such effects. If we apply QEC using a Terror correcting quantum code, we sup- 
press all errors acting on up to t qubits. This acts as a filter which gets rid of the 
noise which would normally dominate the dynamics, and reveals any remaining 
correlated eflfect which acts on f + 1 or more qubits at once. This permits a very 
sensitive detection of weak correlated effects if they exist. 
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Applications of Ideas from Random Matrix Theory 
to Step Distributions on “Misoriented” Surfaces 

T.L. Einstein 



Abstract. 

Arising as a fluctuation phenomenon, the equilibrium distribution of meandering 
steps with mean separation {t) on a “tilted” surface can be fruitfully analyzed using 
results from RMT. The set of step configurations in 2D can be mapped onto the 
world lines of spinless fermions in 1+lD using the Calogero-Sutherland model. The 
strength of the (“instantaneous”, inverse-square) elastic repulsion between steps, 
in dimensionless form, is /3(/3 — 2)/4. The distribution of spacings s{€) between 
neighboring steps (analogous to the normalized spacings of energy levels) is well 
described by a “generalized" Wigner surmise: p/3(0, s) ~ as^ exp(— 6s^). The value 
of j3 is taken to best fit the data; typically 2 < /3 < 10. The procedure is superior to 
conventional Gaussian and mean-field approaches, and progress is being made on 
formal justification. Furthermore, the theoretically simpler step-step distribution 
function can be measured and analyzed based on exact results. Formal results and 
applications to experiments on metals and semiconductors are summarized, along 
with open questions, [conference abstract] 



1 Introductory Synopsis 

For over a decade we have studied the correlation functions of steps on vicinal 
surfaces of crystals, that is to say crystals that are intentionally misoriented from 
high-symmetry directions. Of special experimental interest is the distribution of 
separations of adjacent steps, corresponding to p/ 3 ( 0 ,s) of random matrix theory 
[1]. Traditionally, surface scientists have described it by a Gaussian, with a con- 
troversy arising about the relationship between the width of the Gaussian and 
the strength of the repulsion between steps. This controversy is resolved, and a 
better representation of the step distribution obtained, by characterizing the prob- 
lem with the Calogero-Sutherland models and using the simple expression of the 
Wigner surmise, with [seemingly for the first time] general values of ^ in the range 
of 2 to 10. This range is rarely considered by random-matrix theorists. The step- 
step correlation function resulting from the Calogero-Sutherland models, for which 
exact but numerically intractable solutions exist, can also be applied to data. 

2 Background 

On vicinal crystals there are a sequence of terraces oriented in the high- symmetry 
direction and separated by steps typically one atomic layer high. If the spacing 
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between adjacent steps (the “terrace width”) is denoted i, then the mean spac- 
ing {£) is 1/tan (/), where 0 is the angle by which the surface is tilted from the 
high-symmetry direction. For theoretical modeling it is convenient to think of the 
crystal as being a simple cubic lattice. (The extension to realistic symmetries is 
not difficult but muddies the discussion.) By [“Maryland”] convention, the steps 
run on average in the y direction, so that the step spacings £ are in the x direction. 
(The high-symmetry terrace plane is then z.) For metals it is a decent approxima- 
tion to take the total energy of the system to be proportional to the number of 
nearest neighbor bonds. (For semiconductors corner energies can play a role, but 
these are insignificant for the properties discussed in this paper.) Then the ground- 
state configuration is a set of perfectly straight steps (uniformly spaced because of 
repulsions we shall discuss shortly). At low temperatures, the predominant excita- 
tion are kinks in the steps, each costing energy e. At higher temperatures, atoms 
and vacancies appear on the terraces (each with energy 4e in the simple model), 
but these are neglected here. The resulting model is called the terrace-step-kink 
model (TSK) or terrace-ledge-kink model in the literature. 

In this low-temperature picture, the number of steps is fixed (by the miscut 
angle in an experiment and by screw periodic boundary conditions in a numeri- 
cal simulation). Thus, the steps never start or stop, and they all have the same 
orientation (say up): there are no anti-steps (or down-steps). It is convenient and 
fruitful to imagine the y direction as time-like and to view the configurations of 
steps as the worldlines of a collection of a single kind of particle in one spatial 
dimension (viz. x) evolving in time. For physical reasons, the steps cannot cross 
each other (since that would involve atoms suspended above the terrace held with 
just a couple lateral bonds). Assuming also that the steps cannot merge to form 
double- height steps, we can view the evolving ID particles as spinless fermions. 
(Equivalently, in ID, they can be treated as hard bosons, a viewpoint exploited in 
a study of step bunching [2j. Moreover, the system is highly reminiscent of soliton 
lines in 2D incommensurate crystals [3].) 

The obvious next question is how these spinless fermions interact. A typical 
way to characterize the distribution of steps is by their terrace-width distribu- 
tion (TWD), i.e. the probability of finding the next step a distance £ away. This 
probability distribution is denoted P(s), where s = £/ {£) denotes the step spacing 
normalized by the [only] characteristic length in the x direction. By construction, 
P{s) is normalized and has unit mean. (It ultimately corresponds to p(0, s) of 
random-matrix theory.) For the ground state, P{s) is essentially a delta function 
at s=l. If the steps are imagined as straight (“uncooked spaghetti”) and deposited 
randomly with probability then P{s) becomes exp(-s). In fact, however, at 

finite temperature the steps do meander, leading to well-known entropie (or steric) 
repulsion due to the underweighting in the partition function of steps on the verge 
of crossing. This effect is particularly noticeable as s approaches 0: P{s) now van- 
ishes with power-law behavior rather than growing to unity. For s 1 P(s) must 
decay faster than exp(-s) to preserve the unit mean; analogies with random walk- 
ers suggest exp(-6s^) as the form of the tail [4]. There are many arguments to show 




Vol. 4, 2003 Applications of Ideas from Random Matrix Theory 



S813 






1-' . {*7>P-' V’ } j H 

'V-^-r'V^ /"V Vyfyt 

tP ■" f f^'^y 




‘ >i 

p‘ /' 



Figure 1: Illustration of a vicinal generated by a Monte Carlo simulation of the 
terrace-step-kink (TSK) model, showing that the elementary thermal excitation 
is the kink, a unit excursion of a step in the x direction. (Adapted from Fig. 1 
of Ref. 27) For experimental STM (scanning tunneling microscopy) images of a 
vicinal surface; see, e.g.. Fig. 1 of Ref. 12, Fig. 1 of Ref. 15, and Fig. 1 of Ref. 34. 
In the Gruber- Mullins approximation, only one step meanders, the others being 
straight along the y direction and uniformly spaced. (Cf. Fig. 9 of Ref. 12.) In 
the fermion picture, the y direction is viewed as time-like. In the step continuum 
model, the steps meander continuously rather than by discrete deviations (i.e., the 
kinks are smoothed). 



that the entropie repulsion per length is proportional to (/cbT)^// 3^^; here the step 
stiffness (3 = j3{6) -\- d?(3/d0‘^^ with [orientation-dependent] free energy per length 
conventionally called (3(6) in surface studies. 

In addition to the entropie repulsion, there generally is an elastic interaction, 
also having the inverse-square form due to repulsion between force dipoles 

intrinsic to the steps [5, 6]: Viewed from a continuum picture, the step involves 
a pair of sharp angles which could lower their energy by healing to form more 
obtuse angles. This process leads to a strain field around the step in which atoms 
in the terraces bounding the step (both above and below) tend to move away 
from the step. Atoms between two steps are then frustrated since they are pushed 
in opposite directions by the pair of steps. Thus, they cannot relax fully, leading 
effectively to a repulsion, with A proportional to the product of the step dipoles. 
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In the fermion picture, the strength of the elastic repulsion only enters in the 
dimensionless combination 

i = 0A/{kBT)\ (1) 

The elastic and entropie repulsions do not simply add together: As the strength A 
of the elastic interaction increases, the chance of steps coming near each other de- 
creases, thereby reducing the entropie repulsion. Hence, the dimensionless strength 
of the inverse square repulsion in essence [7] becomes [8, 9] 

i [l + Vl + 4i]" . (2) 

In situations where the surface has a metallic surface state [one which crosses 
the Fermi energy], there can also be an oscillatory [in sign] interaction between 
steps, albeit with a 1/^^ envelope [10, 11]. Such an interaction destroys many of the 
scaling properties, particularly that the form of P{s) is well defined, i.e. that plots 
of {£)P{£) vs. i/{^) are independent of {£). Further discussion of this digression is 
beyond the scope of this paper. 

Knowledge of the elastic strength A is crucial to adequate characterization 
of a stepped surface. It is one of just three parameters (the stiffness p being 
another) of the widely- applied step continuum model [12] and underlies the “2D 
pressure”, which determines surface morphology (e.g., whether steps “bunch”) 
and drives kinetic evolution. It is generally believed that the repulsions can be 
adequately approximated as acting between pairs of steps at the same value of 
y, i.e. instantaneously in the ID fermion representation. This approximation is 
clearly crucial for viability of the fermion approach. 

The simplest way to estimate P{s) was developed over a third of a century 
ago. In treating polymers in 2D, deGennes pointed that the energy of a step (or a 
polymer) could be described as the line integral over path-length ingrements times 
P{0). Expanding to lowest order, we get a constant (the straight step) plus what 
amounts to a kinetic energy proportional to p{dx/dyp, the mass-like stiffness 
times the ID velocity squared. 

In the mean-field (i.e. single active step with all others fixed at spacings 
{£)) approximation [13], Gruber and Mullins (GM) reduced the “free fermion” 
case (A=0) to the familiar elementary quantum mechanics problem of solving the 
Schrôdinger equation for a spinless fermion of mass oc in a hard-wall box of 
length 2{i). The probability density of the ground state (the squared ground-state 
wavefunction) is just Pgm-o{s) = sin^(7rs/2), for 0 < s < 2; remarkably, the 
associated ground-state energy is precisely the entropie repulsion. For A above 
about 3/2, the problem becomes that of a particle in a parabolic potential, i.e. 
a simple harmonic oscillator, giving Pgm oc exp[-(s - 1P/2wqj^], with wgm oc 
^-1/4 (^) |- 24 j These parametric results were dramatically verified for Si(lll): not 
only was the TWD measured at one misorientation well fit by a Gaussian, but the 
TWD at a second misorientation was well described by another Gaussian whose 
width was simply rescaled by the change in (^), with no refitting [15, 16]. 
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However, the Grenoble group pointed out a flaw in Gruber-Mullins approach 
in the limit of very strong A [17]. There the entropie repulsion becomes negligible 
and the steps can be imagined as meandering independently (albeit by a small 
amount). Then the variance of the TWD should not be just but twice that 

amount, since both steps bounding a terrace are meandering. (The actual increase 
is less than two- fold due to lingering anticorrelations.) Hence, a fit of the TWD 
width in the Gruber-Mullins approximation would underestimate Â by a factor of 
over 3 in the limit of very large A. Based on roughening theory, the Saclay group 
[18] concluded that the underestimate, for more general T, was somewhat smaller 
(about 2). 

Meanwhile, Ibach noted that the TWD of the free-fermion distribution could 
be far better approximated by an expression oc exp(— &s^), in essence the Wigner 
surmise for free fermions, than by Pgm-o{s) = sin^(7rs/2). How this comes about 
becomes evident by recasting the preceding model in terms of the celebrated [19] 
Calogero-Sutherland models [20, 21, 22]. 



3 Connection to Random Matrix Theory via 
Calogero-Sutherland Models 



In his study of the spacings of energy levels in nuclei, Wigner’s starting inspira- 
tion was to consider ensembles of dynamical systems governed by different Hamil- 
tonians with common symmetry properties [23]. The three generic Hamiltonian 
symmetries are orthogonal, unitary, and symplectic. The key ingredient is level 
repulsion: two states connected by a non-vanishing matrix elements repel each 
other by an amount determined by the symmetry of the Hamiltonian. While in- 
applicable to average quantities, the approach is appropriate for the fluctuation of 
a large number of energy levels. These fluctuations should become independent of 
the specifics of the level spectrum and the weight factors, and so should exhibit a 
universal form depending only on symmetry. This idea can also be derived from 
maximum-entropy arguments [23]. 

One can draw a correspondence between the fluctuations of energy spacings 
between adjacent energy levels in nuclei and the fluctuations of spatial separa- 
tions between adjacent fermions in ID systems. (In other chaotic systems, as well, 
there is a correspondence between energy and spatial spacings.) The Calogero- 
Sutherland model describes spinless fermions in ID which interact via an inverse 
square potential. Specifically, Sutherland’s version [21] of the Calogero Hamilto- 
nian [20] for the fluctuations of uniformly- spaced fermions on an infinite line is: 



ncs 




- Xi) 



-2+0,2 



J=1 



( 3 ) 



in the limits N oo and ü; ^ 0. (In Calogero’s original model [20], the last term 
((X is summed over particle separations [xj — x if" rather than deviations from 
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integer positions The ground-state wavefunction for this Hamiltonian is 

'î'o= n w 

l<i<j<N \ k=l / 

The ground- state density is recognized as a joint probability distribution func- 
tion from the theory of random matrices for Dyson’s Gaussian ensembles [24]. 

The Sutherland Hamiltonian [2 2] similarly describes spinless fermions on a 
circle of radius L (with L oc), having inverse-square interactions along chords: 



N 



, .P A 



V2 

j=i j 



L 2 



E 

l<j 



. 7t{Xj - Xi) 



(5) 



In this case the ground-state wavefunction has the Jastrow form 



$o = n 

i<j 



7r{xj - Xi) 



/3/2 



, Xj > X^ 



(6) 



With the definition Oi = 2'KXijL, the ground-state density written as 

A9j iOi 1 ^ 



^0 = n I®'”' “ 



(7) 



i<j 



This ground-state density is again a joint probability distribution function from 
the theory of random matrices, now for Dyson’s circular ensembles [25]. 

Prom Eqs. (3) and (5), the dimensionless interaction A can be identified as 
{P/2){P/2 — 1). Conversely, j3 is just the quantity inside the brackets in expression 
(2). These models can be solved exactly only for the special cases /3 = 1, 2, or 4, 
corresponding to orthogonal, unitary, or symplectic symmetry of the ensemble. In 
the literature of random matrix theory [1], the TWD P{s) is often denoted p(0, s), 
the probability of finding two levels separated by s with zero intervening levels. 
Note also that f3 was used in the previous section to denote the step free energy 
per length. (Presumably the letter was chosen to emphasis the similarity to but 
difference from the surface free energy per area 7 .) Dyson [25, 1] chose (3 to denote 
this exponent to indicate the analogy to an inverse temperature (of a Coulomb 
gas on a circle). While we follow this convention here to minimize the chance 
of confusion, in our applications to stepped surfaces the exponent is called g to 
minimize confusion for that audience. (The stiffness P is essentially unrelated to 
the exponent p\) 

Random matrix theory leads to exact solutions for the ground state properties 
of the three special cases, in particular for P{s). There are even prescriptions that 
allow one to generate numerical representations of arbitrary accuracy [26, 27]. 
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However, these expressions involve terms in a series and are not useful for fitting 
experimental data. Wigner surmised that P{s) has the simple form 

Pf3{s) = ( 8 ) 



for the special cases (3= 1, 2, and 4. The constants associated with normalization 
of P{s) and producing unit mean are: 



= 




and 




( 9 ) 



Prom Eq. (8) one can readily find analytic expressions for the moments and other 
measurable properties of P{s) [28]. 



P(s) 




Figure 2: P{s) vs. s = ^/(^) for fbo exact “free-fermion” , A=0 result (solid 
curve), the Gruber-Mullins approximation sin^(7rs/2) (long-short dashed curve), 
and the (3=2 Wigner surmise (Eq. (8)) result (dotted curve), barely distin- 
guishable from the exact result). Offset upward by 0.4 for clarity, an analogous 
plot of the exact result for A=2^ the Gruber-Mullins Gaussian approximation 
(24/7 t^)^/'^ exp(— v^24(s — 1)^) (long-short dashed curve), and the /3=4 Wigner 
surmise result (dotted curve). (Adapted from Fig. 1 of Ref. 31, Copyright (2001) 
with permission from Elsevier). 

The argument for Eq. (8) focuses on the Jacobean associated with a change 
of variables for the Gaussian-ensemble probability distribution function p{H) ~ 
exp[— 6A/'tr(?^^)] from the eigenenergies to the combinations of the matrix ele- 
ments. For an orthogonal ensemble {/3=1) with N=2^ the integrand has a Dirac 
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delta function with argument s — [{hu — ^ 22 )^ + 4 / 1 ^ 2 ] which vanishes for s=0 
only when the two [squared] independent variables do. Hence, P{s) oc s, cor- 
responding to a circular shell in parameter space and leading to j3=l. For uni- 
tary ensembles there is an additional independent parameter since h‘l 2 becomes 
{^ehi 2 )^ + (^m/ 112 )^. Hence, P{s) oc corresponding to a spherical shell in 
parameter space, i.e. (3=2. Exact for A/'=2, these arguments are still fine guides for 
large N [29]: As seen most clearly by explicit plots as in Haake’s text [30], -Pi(s), 
P 2 {s), and Pa{s) are excellent approximations of the exact results for orthogonal, 
unitary, and symplectic ensembles, respectively, and these simple expressions are 
routinely used when confronting experimental data [23, 30]. (The agreement is par- 
ticularly impressive for ^ 2 ( 5 ) and ^ 4 ( 5 ), which are germane to stepped surfaces.) 
Furthermore, Eq. (8) has a similar variance at very large j3 to that predicted by 
the Grenoble group, and more generally approaches the form of a Gaussian for j3 
not too small[31]. 

The essential innovation in our work is the conjecture that the generalized 
Wigner surmise expressed in Eq. (8) for arbitrary values of j3 provides a good 
approximation for these non-special values. For these general values, there is no 
recourse to formal justification from symmetry arguments. (Indeed, it remains 
mysterious whether there is some deep underlying reflection of the random-matrix 
symmetry and the corresponding Calogero- Sutherland models at the special values 
of (3.) The only way to test our hypothesis is to generate numerical data. This task 
has occupied us for some time. We have used both Monte Carlo and transfer 
matrix computations of TSK models [31, 32]. The details of these studies are 
beyond the scope of this paper, but the main result is that the generalized Wigner 
surmise does better than any of the earlier Gaussian approximations in accounting 
for the dependence of the TWD’s variance - the quantity typically measured by 
experimentalists - on the value of A. In particular, the numerical data confirms 
that the proportionality coefficient between the width w of the TWD and 
(or more accurately, between w and (3~^^‘^) is not constant but increases slowly 
with A. 

Obtaining the experimentally measurable variance of P{s) - essentially 
when the Gaussian approximation is viable - analytically from Eq. (8), we [33] 
can use a series expansion to derive an excellent estimate of A from : 



À 




2x 1 27 35 2 



(10) 



with all four terms needed to provide a good approximation over the full physical 
range of A, from near zero up to around 16, corresponding to (3 ranging from 2 up 
to about 8. The Gaussian methods described earlier essentially use just the first 
term of Eq. (10) and adjust the prefactor. 

We have successfully applied these ideas (both direct fits to Eq. (8) and use of 
Eq. (10)) to experimental data by several different groups [31], focussing primarily 
on extensive results for vicinal Cu (001) and (111) obtained by Giesen, Ibach, and 
collaborators at FZ-Jiilich [33, 34]. 
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4 Pair Correlation Functions 

While the TWD is usually the easiest quantity to measure, it is generally not 
the easiest to calculate since it is essentially a many-body correlation function 
due to the demand that there be no fermion between the two fermions separated 
by £ (or s). The pair correlation function h{s) (also called 1 — y( 5 ) in random- 
matrix literature[l]) indicates the probability of finding a fermion a normalized 
distance s from a reference fermion, regardless of whether there are any between 
them. For a “perfect staircase” this would just be a set of delta functions at 
positive integers. More generally, h{s) increases like for s <C 1 just like P{s). 
There are then a sequence of peaks at integer values of s with dips between them. 
The amplitude of the crests and troughs relative to the mean density of fermions 
decreases as s increases. For most purposes, but not here, it suffices to use the 
harmonic approximation [35, 36, 37] (which is essentially similar to the Grenoble 
approximation) : 

{t)hj 3 {s) = ^(Bm7r)^''^exp[-B„(s - m)^]; (11) 

7T^/^ 1 /o \ -/n \ 1 \7 + l 

— = 7 + log(27rm)-c.(27rm)=^Ç-l)^+ 

where 7 ~ 0.577 is Euler’s constant and ci is the cosine integral. Note that Bm 
is proportional to /3, so that peaks become sharper and higher with increasing 
repulsion. Also, Bm decreases slowly but monotonically with increasing m. While 
useful for many applications [35, 36, 37], this approximation for h^{s) proved 
inadequate for deducing (3 from data. 

Forrester [37] found an exact solution for even values of £3. This expression, 
which involves Selberg correlation integrals, takes nearly a page of text, so is not 
reproduced here. We do note that the ratio of the exact prefactor of in /^/^(s) to 
a /3 in P( 3 {s) is close to unity, lending further support to the utility of the generalized 
Wigner surmise expression [38]. 

Subsequently, Ha and Haldane [39, 40, 41] derived an exact solution for any 
rational value of [3. In principle, this result also contains information about the 
“dynamic” correlations between fermions (i.e. between steps at different values of 
y). Unfortunately, their expression is even longer than that of Forrester, and so 
also is not reproduced here. Moreover, it seems to be numerically intractable, so 
of little “practical” use. 

Fortunately, Gangardt and Kamenev [42] recently derived the following 
asymptotic expansion for h{s)\ 




( 12 ) 
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This formula turns out, remarkably, to be useful well below the asymptotic limit, 
and provides a good approximation - much better than the harmonic approxima- 
tion - for s > 1/2; it can be patched onto for s < 1/2. 

In confronting data with Eq. (13), one must beware the limited number of 
steps in experimental images. In our application [43], there were only about a half 
dozen steps per image. Hence, we included an ad hoc linear decay envelope to 
account for this limit on h{s) at large s. The system in question involved steps 
on Si(lll) at very high temperatures, necessitating a clever device to compensate 
for evaporation. For a variety of reasons, the difficulty of the measurement led 
to data that presented exceptional challenges for analysis. One problem was that 
steps sometimes disappeared from the image, leading to the concern that the 
TWD might appear wider than it in fact was (since sometimes second- neighbor 
spacings rather than adjacent ones might be counted). The investigation showed 
that electromigration from the current which heated the sample tended to keep 
the steps apart, leading to correlation functions indicative of a larger [effective] A 
than predicted from extrapolation of results at lower temperature. 



5 Concluding Comments 

We have seen that steps on misoriented surfaces can be represented by fluctuating 
spinless fermions in 1+lD. Thus, step-step correlation functions can be expressed 
in terms of the Calogero-Sutherland model and thereby related to random matrix 
theory. In particular, the terrace width distribution, identified as a joint probability 
distribution, can be well described by the generalized Wigner surmise, evidently 
not only for the special cases related to orthogonal, unitary and symplectic en- 
sembles but for arbitrary fermion repulsion strength. While perhaps reminiscent 
of some interpolation schemes, such a generalization has not, to my knowledge, 
been made before. Furthermore, most of these interpolation schemes have involved 
systems with mixed symmetries (typically orthogonal and unitary) , with values of 
(3 between 1 and 2 (and also between 0 and 1, where /3=0 indicates Poissonian 
[random “uncooked spaghetti”] behavior). For stepped surfaces the physically in- 
teresting range starts near [3=2 and goes past 4 up to 8 or even 10. This range has 
received minimal (if any) theoretical attention. 

While the discovery of exact solutions is always intellectually exciting and 
captivating, they are often of little use in confronting experiments - physical or 
numerical - unless the formulas are numerically tractable. Thus, even when a 
problem is formally solved, neither theoreticians or funding agencies should view 
it as completed until the formalism can be rendered in a way that allows the 
extraction of numbers, even if approximately. 

A major open question is the “deeper meaning” of the generalized Wigner 
surmise of Eq. (8), since there is no evident symmetry to argue for the simple 
expression. Howard Richards has taken the lead in trying to derive this result 
from an effective Hamiltonian [44] . This approach should lead to the development 
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of the form of P{s) for systems with higher-order terms in the repulsion potential 
and even potentials with an oscillatory term such as when metallic surface states 
mediate the interaction. (In systems in which “non- instantaneous” interactions 
(in directions other than x are significant, the usefulness of the mapping to ID 
fermions is likely to break down.) It will be interesting to see whether expressions 
similar to the generalized Wigner surmise arise in other areas of physics. For 
example, while considering the probability distribution of stock market returns in 
the Heston model with stochastic volatility (in econophysics research) , Yakovenko 
showed that the probability distribution of the variance - which is derived from a 
Fokker-Planck equation, has the form of the generalized Wigner surmise [45]! (In 
this case the value of jS was 1.6, so in a different regime from stepped surfaces. 

While Eq. (8) assumes that s is continuous, experiments and numerical simu- 
lations involve discrete lattices. We have shown [31, 32, 33] that so long as is at 
least 4 lattice spacings, there are no significant complications due to discreteness 
and that assuming continuous s does not distort the analysis. In particular, the 
roughening transition to a facet that occur for discrete lattices [18, 47] does not 
happen in the range of physical parameters [32]. 

Another skirted issue concerns how many steps interact. While the entropie 
interaction ipso facto involves just adjacent steps, the Calogero-Sutherland models 
[20, 21, 22] and the Saclay approach [18] assume energetic interactions between 
all steps. The Gruber-Mullins [13, 14] and Grenoble [17, 28] approximations allow 
all steps or just adjacent ones to interact. Most Monte Carlo and transfer ma- 
trix simulations assume that just adjacent steps interact. In the Gruber-Mullins 
approximation, the curvature of the potential involves the inverse 4th power of 
the distance from neighboring steps, so that the effective strength of the repulsion 
increases by (^(4) = tt'^/OO 1.08 when interactions between all steps rather than 
just adjacent ones are included [31]; in the Grenoble approximation it increases by 
about 1.10. These changes are relatively small, but the good agreement is curious 
between the variance predicted by Eq. (8) and our Monte Carlo results with just 
adjacent steps interacting [32]. 

For surface scientists a fundamental objective is to show consistency between 
the value of A deduced from the TWD and that predicted from surface stress using 
Marchenko’s formula [6]. However, most experimental systems do no possess the 
elastic isotropy assumed by that classic result. Hence, this goal has been elusive. 

Another interesting problem is what happens when the mean direction of the 
steps (i.e. y) does not correspond to a high-symmetry direction of the surface. We 
have recently found that the stiffness computed from standard near-neighbor bond 
lattice models (Ising or SOS models) underestimates the experimentally observed 
stiffness by a factor of about 4 for the square- net face of copper [46]. The most 
likely explanations are significant long-range interactions or perhaps local relax- 
ations leading to 3-atom effects. Furthermore, it is by no means obvious that the 
assumption of only “instantaneous” interactions between steps is justifiable for 
substantial azimuthal misorientation. 
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Integrable Flows in c = 1 String Theory 

Ivan K. Rostov 



Abstract. In these notes we review the method to construct integrable deformations 
of the compactified c = 1 bosonic string theory by primary fields (momentum 
or winding modes), developed recently in collaboration with S. Alexandrov and 
V. Kazakov. The method is based on the formulation of the string theory as a 
matrix model. The flows generated by either momentum or winding modes (but 
not both) are integrable and satisfy the Toda lattice hierarchy^ . 



1 Introduction 

The c = 1 string theory (the theory of random surfaces embedded in 1 dimension) 
can be constructed as the collective theory for a 1-dimensional N x N hermitian 
matrix field [1]. The invariant sector of the matrix model is described by a 

non-relativistic quantum mechanics of free fermions in an upside-down quadratic 
potential. The mapping to free fermions allows to calculate virtually any quantity 
in the string theory in all orders in the genus expansion. 

The elementary excitations of the c = 1 string represent collective excita- 
tions of free fermions. The tree- level 5-matrix can be extracted by considering the 
propagation of “pulses” along the Fermi sea and their reflection off the “Liouville 
wall” [2]. 

The exact nonperturbative 5-matrix has been calculated by Moore et al [3]. 
Each 5-matrix element can be associated with a single fermionic loop with a 
number of external lines. One can then expect that the theory is also solvable in 
a nontrivial, time-dependent background generated by a finite tachyonic source. 
Dijkgraaf, Moore and Plesser [4] demonstrated that this is indeed the case when 
the allowed momenta form a lattice as in the case of the compactified Euclidean 
theory. In [4] it has been shown that the string theory compactified at any radius R 
possesses the integrable structure of the Toda lattice hierarchy [5] . The operators 
associated with the momentum modes in the string theory have been interpreted 
in [4] as Toda flows. 

The explicit construction of these Toda flows is an interesting and potentially 
important task, because this would allow us to explore the time-dependent string 
backgrounds. This problem was recently solved in [6]. The method used in [6] is 
conceptually similar to the method of orthogonal polynomials of Dyson- Mehta in 

^ Based on the talks presented at the James H. Simons Workshop on Random Matrix Theory 
Stony Brook, February 20 - 23, 2002, and the International Conference on Theoretical Physics 
Paris, UNESCO, July 22-27, 2002 
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the interpretation of M. Douglas [7] , which has been used to solve the c < 1 matrix 
models [8, 9]. The construction of [6] allows to evaluate the partition function of 
the string theory in presence of finite perturbation by momentum modes. The 
simplest case of such a perturbation is the Sine-Liouville string theory^. 

Besides the momentum modes, the compact ified string theory contains a 
second type of excitations associated with nontrivial windings around the target 
circle. In the world- sheet description, the string theory represents a compact ified 
gaussian field coupled to 2d quantum gravity. Then the momentum and winding 
modes are the electric and magnetic operators for this gaussian field. It is known 
[11, 12] that the winding modes propagate in the non-singlet sector of the c = 

1 matrix model, which is no more equivalent to a system of fermions. Explicit 
realization of the winding modes can be given by gauging the compactified matrix 
model. Then the winding modes are realized as the Polyakov loops winding around 
the Euclidean time interval [13]. 

The electric and magnetic operators are exchanged by the usual electric- 
magnetic duality R 1/R called also T-duality. If the momentum modes of the 
compactified string theory are described by the Toda integrable fiows, then by T- 
duality the same is true also for the winding modes. A direct proof of is presented 
in [13], where it is also shown that the the grand-canonical partition function of 
the matrix model perturbed by only winding modes is a r- function of the Toda 
lattice hierarchy. Applying the T-duality backwards, we conclude that the same is 
true for the partition function of the theory perturbed by only momentum modes. 

A special case represents the theory is compactified at the self-dual radius R = 
1 , it is equivalent to a topological theory that computes the Euler characteristic of 
the moduli space of Riemann surfaces [14]. When R = 1 and only in this case, the 
partition function of the string theory has alternative realization as a Kontsevich- 
type model [4, 15]. 

In this notes we summarize the results concerning the Toda integrable struc- 
ture of the compactified c = 1 string theory obtained in refs. [13, 16, 17, 6]. We will 
follow mostly the last work [6] , but will present the construction in the framework 
of the Euclidean compactified theory, while [6] discussed the theory in Minkowski 
space. 

2 The matrix model for the c = 1 string theory revisited 

2.1 Partition function of the Euclidean theory compactified 
on a circle with radius R 

The c = 1 string theory (see Appendix A) describes the critical behavior of the 
large- A" matrix quantum mechanics (see the review [1] and the references therein). 
The critical point is associated with the maximum of the matrix potential. In 

^It is conjectured [10] that such a perturbation can lead to a target space geometry with a 
horizon. 
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the scaling limit, which is dominated by dense planar graphs, the potential can 
be approximated by a quadratic one (with the wrong sign), which is stabilized 
by imposing a cut-off wall far from the top. The relevant piece of the matrix 
Hamiltonian is thus 

Ho = \tt(P^-M% ( 2 . 1 ) 

where P = —id/dM and Mij is an TV x hermitian matrix variable. The cosmo- 
logical constant ji is introduced as a “chemical potential” coupled to the size of 
the matrix N, which should be considered as a dynamical variable. 

We will consider the Euclidean theory with periodic Euclidean time x = —it. 
The time interval 

0Rf3 = 2 ttR (2.2) 

can be also interpreted as inverse temperature. The matrix model describes the 
c = 1 string only when j3 > (3 kt = 47t. At /3 = Pkt the winding modes associated 
with the N{N — 1) angular degrees of freedom becomes important and produce a 
Berezinski-Kosterlitz-Thouless-like phase transition to a c = 0 string theory [11]. 
The angular degrees of freedom can be interpreted as the winding modes of the 
string, which represent strings winding several times around the target circle . 

A sensible matrix model for /3 < Pkt can be constructed by introducing an 
additional SU{N) gauge field A(x), which projects to the sector free of winding 
modes [12]. The partition function of the compactified system is 



z{^i) = ^ e-I^^ZN, (2.3) 

A^>0 

where Zj^ is given by a functional integral with respect to the one-dimensional 
matrix hermitian fields M{x) and A(x) with periodic boundary conditions 

Zn = j VPVMVAe~^^°. (2.4) 

periodic 

The action functional is 

rP 

So = Tt / (îPVaM - Ho)dx, (2.5) 

Jo 

denotes the covariant time derivative 

VaM = da:M -i[A,M]. (2.6) 

The gauge field can be also used to study perturbations by winding modes, with 
appropriately tuned coupling constants, so that the long range order is not de- 
stroyed. It was shown in [13] that the winding modes couple to the moments of 
the SU (N) magnetic flux associated with the gauge field A. 
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2.2 Chiral quantization 

The analysis of the matrix model simplifies considerably if it is formulated in terms 
of the chiral variables [6] 






M±P 



(2.7) 



representing N x N hermitian matrices. The Hamiltonian in the new variables is 



Ho = -^Tr{X+X.+X.X+), 



( 2 . 8 ) 



where the matrix operators X± obey the canonical commutation relation 



[{X+)),{X-)f] = -i5tS^. (2.9) 

The partition function for fixed N is now given by a path integral with respect to 
the one-dimensional hermitian matrix fields X^{x)^ X_(x) and A{x)^ satisfying 
periodic boundary conditions. The action (2.5) reads, in terms of the new fields, 

So = Tr (^iX+VaX- - ^X+X^^ dx. (2.10) 

The partition function (2.10) depends on the gauge field only through the global 
holonomy factor, given by the unitary matrix 

r= Te^ ( 2 . 11 ) 

This can be seen by imposing the gauge A = 0, which can be done at the expense 
of replacing the periodic boundary condition X±(^) = X±(0) by a SL/ (A/")-twisted 
one 

X+(/3) = X+(0), X.(j3) = n-^X.(O) Q. (2.12) 



2.3 Gauge-invariant collective excitations: momentum and winding modes 

The momentum modes Vn/j^ of the matrix field are gauge- invariant operators with 
time evolution Vn/R(t) = e~'^^^^Vn/R(0). Here n can be any positive or negative 
integer. The spectrum of momenta is determined by the condition that the time 
dependence is periodic in the imaginary direction with period 27tR. The operators 
satisfying these conditions are 



Ki/R = 



^±znx/R TrX±(x)^^^ = 



Tr 



if n > 0 
if n < 0. 



(2.13) 



This form of the left and right momentum modes was first suggested to our 
knowledge by Jevicki [18]. 
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As it was shown in [13], the winding modes VnR are associated with the 
SU{N) magnetic flux through the compactified spacetime 

VnR = TvÜ^ (2.14) 

where Çl is the U{N) holonomy (2.11). From the world sheet point of view the 
winding operator V^r creates a puncture with a Kosterlitz-Thouless vortex, i.e. a 
line of discontinuity 2tï R n starting at the puncture. 

2.4 The partition function as a three-matrix integral 

In the A — 0 gauge the path integral with respect to the fields X±{x) is gaussian 
with the determinant of the quadratic form equal to one. Therefore it is reduced to 
the integral with respect to the initial values X± — A'i(O) of the action (2.10) eval- 
uated along the classical trajectories, which satisfy the twisted periodic boundary 
condition (2.12). Therefore the canonical partition function of the matrix model 
can be reformulated as an ordinary matrix integral with respect to the two hermi- 
tian matrices A_t- and A_, and the unitary matrix 

Zn = j DX+DX-dn ^2.1^) 

where we denoted 

Ç (2.16) 

A general perturbations by momentum modes can be introduced by changing 
the homogeneous measures DX± to 

[DX±] = DX± (2.17) 

where we introduced the matrix potentials 

U±{X±) =RY, (2-18) 

n>0 

Similarly, a general perturbation by winding modes can be introduced by changing 
the invariant measure DVt on the group U{N) to 

[£»Q] = DO. (2.19) 

where we introduced the following matrix potential 

i/(n) = fi". (2.20) 

n^O 

In case of a generic perturbation with both momentum and winding modes, 
the three-matrix model (2.15) is not integrable. However, it can be solved exactly 
in case of an arbitrary perturbation only by momentum or only by winding modes. 
In the following we will discuss in details these two integrable cases. In both cases it 
is possible to integrate with respect to the angles and the matrix partition function 
reduces to an eigenvalue integral. 
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2.5 Integration with respect to the angles 

Let us consider the case of a perturbation with only momentum modes. Then the 
gauge field plays the role of a Lagrange multiplier for the condition = 

0 and the two matrices can be simultaneously diagonalized. Applying twice the 
Harish-Chandra-Itzykson-Zuber formula we can reduce the matrix integral to an 
integral over the eigenvalues , . . . , of the two hermitian matrices matrices 
X±. We write, after rescaling the integration variables, 

ZN{t)= J Ul^xt][dx]:] det(e«74) det (2.21) 

-oo 

where the measures are defined by 

[dx±] = dx± (2.22) 

Then the grand canonical partition function can be written as a Fredholm deter- 
minant 

z(/i,l) = Det(l + e“'^^K+K_) (2.23) 

where 

[K+f]{x-) = J[dx+]e^-^-~f{x+) 

\K-f]{x+) = J[dx-]e-^‘>^^--f{x-). (2.24) 

3 The partition function of the non-perturbed theory 

3.1 Eigenfunctions of the non-perturbed Hamiltonian 

In the absence of perturbation, the partition function is given by the Fredholm 
determinant 

Z(m) = Det(l + (31) 

and can be interpreted as the grand canonical finite-temperature partition function 
for a system of non-interacting fermions in the inverse gaussian potential. The 
Fredholm determinant (3.1) can be computed once we know a complete set of 
eigenfunctions for the one-particle Hamiltonian 

Ho = -^(x'VxL 4- xLx+). (3.2) 

The canonically conjugated operators x+ and xL 

x'Vx'L — xl-x'^- — —i 



(3.3) 
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are represented in the space of functions of as 

x\ = x^- = 

and in the space of functions of x_ as 

x'- = x_, x\ = —idx _ . 



Since the inverse gaussian potential is bottomless, the spectrum of the Hamil- 
tonian is continuous 

Hüil)l{x±) = ^i{x±dx^ + l/2)il;l{x±) = Eipl{x±) {E &R). (3.4) 

The eigenfunctions in the Xi -represent at ion, which we denote by Dirac brackets 

’>P±{x±) = {E\x±) 

are given by 

{E\x±) = e=Fi*'Ao ±*^-5 (3 5 ) 

where the phase factor (j)o = (j){){E) will be determined in a moment^. The solutions 
(3.5) have a branch point at = 0 and are defined unambiguously only on the 
positive real axis. For each energy E there are two solutions, since there are two 
ways to define the analytic continuation to the negative axis. The wave functions 
relevant for our problem have large negative energies and are supported, up to 
exponentially small terms due to tunneling phenomena, by the positive axis^. 
Therefore we define the analytic continuation to negative axis as 

{E\ — x+) = (^E\e~'^^ x^) = {E\x^) (a:+ > 0) 

{E\-x-) = {E\È^x^) {E\x^) (x_ > 0). (3.6) 

The functions (3.5) with E real form two orthonormal sets 

oo 

J dx±{E\x±){x±\E') = S{E - E'). (3.7) 

— OO 

We will impose also the bi-orthogonality condition 

oo 

II ^^^^iE\x+)e^^^^-{x^\E')=5{E~E'), (3.8) 



^The eigenfunctions (3.5) play in the chiral quantization the same role as the parabolic cylinder 
functions in the standard quantization, based on the original Hamiltonian (2.1). 

^This corresponds to the theory of type I of [3]. 
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which fixes the phase (/>o 










(3.9) 



The phase defined by (3.9) has in fact a small imaginary part, which we neglect^. 
This is because the theory restricted to the Hilbert space spanned on these states 
is strictly speaking not unitary due to the tunneling phenomena across the top 
of the potential. In the following we will systematically neglect the exponentially 
small terms With this accuracy we can write the completeness condition 



/ dE{x+\E){E\x-) = (3.10) 

J V 27T 



where the right-hand side is the kernel of the inverse Fourier transformation re- 
lating the X-^ and x_ representations. For real energy the phase (/>o is real (up to 
non-perturbative terms), but we will analytically continue it to the whole complex 
plane. In this case the reality condition means 

= ^o(^). (3.11) 



3.2 Cut-off prescription, density of states and free energy 

To find the density of states, we need to introduce a cutoff A such that A >> /u. 
This can be done by putting a completely refiecting wall at distance = 

V^2A from the origin. Since the wall is completely refiecting, there is no fiow of 
momentum through it, = 0. Thus the cutoff wall is equivalent to the 

following boundary condition at = X- = VÂ 

^:f(\/Â) - t/^^(VX) (3.12) 

on the wavefunctions (3.5). This condition is satisfied for a discrete set of energies 
En ( n G Z) defined by 

(j)o{En) - En logA -h 27m = 0. (3.13) 

From (3.13) we can find the density of the energy levels in the confined system 

logA 1 d^E) 

^ 27t 27t dE 

Now we can calculate free energy R) = logZ(/i, R) as 
T{fi,R)= [ c!£;p(£:)log [l + e-«^+®)] 

J — OO ^ 

^The pure phase factor would be ^ + 1/2) = yjl + e'^Egt4'o(E) 



(3.14) 

(3.15) 




Vol. 4, 2003 Intégrable Flows in c = 1 String Theory 



S833 



with the density (3.14). Integrating by parts in (3.15) and dropping out the A- 
dependent piece, we get 

= ~ I dME)\og (l + e-^(^+^)) = -R 

(3.16) 

We close the contour of integration in the upper half plane and take the integral 
as a sum of residues. This gives for the free energy 

R=-i • (3.17) 

r = n +| >0 

Using the explicit form (3.9) of we can represent the free energy as a sum over 
pairs of positive half-integers 

oo 

T{fi,R)— ^ log (/i + ir + is/i?) (3.18) 

T’,s'>l/2 

which is explicitly invariant under the T-duality R 1/ R, ^ Rfi. From here it 

follows that the free energy satisfies the functional equation 

4sin (^^/2i^) sin (^^/2) .F(/i, i?) = —\ogfi. (3.19) 

Prom (3.17) we can express the phase factor in terms of the free energy 

00 = -2sin {d^/2R)J^{/i, R). (3.20) 

3.3 Calculation of the free energy using the ^-representation 
of the operators x± 

Let us now give an alternative, algebraic derivation of the free energy of the non- 
perturbed theory. For this purpose we will write the canonical commutation rela- 
tion (3.3) for the matrix elements of the operators x^± in ^^-representation. 

The action of the operators x± to the wave functions is equivalent to a shift 
of the energy by an imaginary unit (we consider the wave functions as analytic 
functions of E) and a multiplication by a phase factor: 

{E\x±\x±) = (3 21 ) 

As a consequence, the operators are represented by the finite difference oper- 
ators acting in the F^-space 

^ ( 3 _ 22 ) 

The Heisenberg relation (3.3) is equivalent to the condition 

me + \i) - ME = - log(-E) 
which is equivalent to the functional constraint (3.19). 



(3.23) 
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4 Integrable perturbations by momentum modes 

4.1 One-particle eigenfunctions and the density of states 
of deformed system 

Now we will consider perturbations generated by momentum modes VnjR- As we 
argued in sect. 2.4, this can be achieved by deforming the integration measures 
dx± to 

[dx±] = dx±eyi^{±iU±{x±)) , U±{x±) = R'^^t±kX^J.^ . (4.1) 

k>i 

In this section we will show that such a deformation is exactly solvable, being gen- 
erated by a system of commuting flows Hn associated with the coupling constants 
The associated integrable structure is that of a constrained Toda Lattice hi- 
erarchy. The method is very similar to the standard Lax formalism of Toda theory, 
but we will not assume that the reader is familiar with this subject. It is based on 
the possibility to describe the perturbation by vertex operators as deformations of 
the L’-representation of the canonical commutation relation. 

The partition function of the perturbed system is given by the Fredholm 
determinant (2.23), where the integration kernels (2.23) and (2.24) are defined with 
the deformed measures (4.1). The deformed kernel is diagonalized by a complete 
orthonormal system of wave functions^ ^±(^±) 

j dæ±i£'f(x±)vl/|(x±) = 6(E - E') (4.2) 

which satisfy the following three conditions: 

1) they are eigenfunctions of the evolution operator relating the points x = 0 and 
X = p 

g-/3Ho^| = , (4.3) 

2) they behave at infinity as 

^|(x±) ~ (4.4) 

3) they satisfy the bi-orthogonality condition (3.8) 

II (x+) = S{E - E'). (4.5) 

The new eigenfunctions are not necessarily eigenvalues of the Hamiltonian Hq 
itself. Indeed, the evolution operator acts trivially on any entire function of 

1 / 7 ? 

x^ . In particular, the condition (4.4) is compatible with the condition (4.3). 

® As before, we understand the completeness in a weak sense, i.e. up to non-perturbative terms 
due to tunnelings phenomena, which we have neglected. 
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Once the basis of the perturbed wave functions is found, the logarithm of 
the Fredholm determinant can be calculated as the integral (3.15), where the new 
density of states is obtained from the phase (l){E) in the asymptotics (4.4) 



p{E) = 



1 d(f> 
2 tt dE 



(4.6) 



Therefore the phase (l>{E) contains all the information about the perturbed system. 
It is related to the free energy by 



T = -i ^ 4>{ir/R- p) . (4.7) 

r>l/2 

(/)(— /x) = 2sm{hd^/2R)E{/j,^ R). (4.8) 



4.2 Dressing operators 

Now we proceed to the actual calculation of the phase (f). The method is a general- 
ization of the algebraic method we have used in sect. 3.3. First we remark that the 
perturbed wave functions are given by the rhs of (4.4) up to factors of the form 



W±{x±) = exp 





(4.9) 



which satisfy the second condition (4.3) and tend to 1 when x± ^ oo. The un- 
known coefficients Vn) and the phase 0 are determined functions of E and the 
couplings tji by the third condition (4.5). 

It follows from (3.22) that the deformed wave function 

(x±) = (4.10) 

can be obtained from the bare wave functions (3.5) by acting with two finite- 
difference operators in the Ê'-space VV+ and VV_ 

«-f(x±) = (£:|e±5*<#-ovv±|x±). (4.11) 



The explicit form of the dressing operators >V± is obtained by replacing x± 
(4.10). The dressing operators are unitary 

w+vv]. = w_wl = 1 (4.12) 

since they relate two orthonormal systems of functions. Further, the bi-orthogon- 
ality condition (4.5) is equivalent to the identity 

Wle'^°VV+ = 1. (4.13) 

The identity (4.13) means that the product >V_VV^^ does not depend on the 
perturbation, which is a general property of all Toda lattice systems [5]. 
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4.3 Lax operators and string equation 

We have seen that the partition function is expressed in terms of the phase (p{—/j,). 
Therefore we assume that the energy is at the Fermi level E = —fi, and consider 
all variables as functions of instead of E. Let us denote by u the operator 

Û = (4.14) 

shifting the variable fi by i. The operators a) and // satisfy the Heisenberg- Weyl 
commutation relation 



[w,n]=Ûj, [w \^]=-c2) ^ (4.15) 

Now let us consider the representation of these commutation relations in the 
perturbed theory. The dressing operators W± are now exponents of series in u 
with //-dependent coefficients 

^ ri> 1 ^ n>l ^ 

The operators 

L+ = L_ = W-LJ-^WZ\ 

M+ = W+//>V+\ M_ = W_//Wr^ (4,17) 

known as Lax and Orlov- Schulman operators satisfy the same commutation rela- 
tions as the operators uj and 

[L+,M+] = zL+, - -zL_. (4.18) 

The Lax operators L± represent the canonical coordinates in the basis of 
perturbed wave functions, 

(£;|e±i*'^°>V±L±|a;±) = {£:|e±i*‘^°W±a;±|x±} (4.19) 

while the Orlov-Shulman operators M± represent Hamiltonian Hq = — l(a;+af_ + 
xLx%). Therefore the L and M operators are related also by 

M+=M- = \{L+L- + L-L+). (4.20) 

The last identity is not satisfied automatically in the Toda system and represent an 
additional constraint analogous to the string equations in the minimal models of 
2D quantum gravity. The relation (4.20) is proved by inserting the identity (4.13) 
in the bare relation for E = — //, satisfied by the operators (3.22). The string 
equation can be also written as the Heisenberg commutation relation between the 
two Lax operators 

[L+,L_l = -i 

which also follows directly from (3.3). 



(4.21) 
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The operators M± can be expanded as infinite series of the L-operators. 
Indeed, as they act to the dressed wave functions as 

(£;| W± M±|x±) = ±i(x±d,^ - l/2)4'f(x±) 

= i'^kt±k M v±fc j ^'±(a:±)- (4.22) 

\k>l k>l J 

we can write 

= + M + (4.23) 

k>l k>l 

In order to exploit the Lax equations (4.18) and the string equations (4.20) 
we need the explicit form of the two Lax operators. It follows from (4.16) that L± 
can be represented as series of the form 



L+ =e Û; + ük 



n/R I 



k>l 



L_ (2;-^ + ^a_ 



k ^ 



-l+n/R I -Z0/2 



k>l 



(4.24) 



4.4 Integrable flows 

Let us identify the integrable flows associated with the coupling constants tn> Prom 
the definition (4.17) we have 



dt^L± = [Hn,L±i 



(4.25) 



where the operators Hn are related to the dressing operators as 

Hn = (0t„W+)W;i = (0t„W_)WZ^ (4.26) 



The two representations of the flows Hn are equivalent by virtue of the relations 
(4.12) and (4.13). A more explicit expression in terms of the Lax operators is 
derived by the following standard argument. Let us consider the case n > 0. Prom 
the explicit form of the dressing operators it is clear that Hn = + 

negative powers of The variation of tn will change only the coefficients of the 
expansions (4.24) of the Lax operators, preserving their general form. But it is clear 
that if the expansion of Hn contained negative powers of its commutator 

with L- would create extra powers Therefore 



H±n 



(L 



n/R 

± 






n > 0, 



(4.27) 
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where the symbol ( ) > means the positive (negative) parts of the series in the shift 
operator and ( )o means the constant part. By a similar argument one shows 
that the Lax equations (4.25) are equivalent to the zero-curvature conditions 

dt^Hn - dt^Hm - [Hm.Hn] = 0. (4.28) 



Equations (4.25) and (4.25) imply that the perturbed theory possesses the 
Toda lattice integrable structure. The Toda structure implies an infinite hierarchy 
of PDE’s for the coefficients Vn of the dressing operators, the first of which is the 
Toda equation for the phase (/>(/z) = (j){E = —fi) 



._d ^ 

^ dti dt-i 



M = 









(4.29) 



The uniqueness of the solution is assured by appropriate boundary conditions [13], 
which are equivalent to the constraint (4.21). 



4.5 Representation in terms of a bosonic field 

The momentum modes can be described as the oscillator modes of a bosonic field 
(p{x-^,x-) = (f^{x^) -h (p-{x-). The bosonization formula is 






E=—iu, — if2 



(æ±) = Z-l -2 



(4.30) 



where Z is the partition function and 

<p±{x±) = +R'^tkx'^^ + +/ilogx± - 



fc>l 



fe>l 



(4.31) 



Then by (4.22) the operators M± are represented by the currents x±d±ip 

Mi^'|(x±)|E=_^_,/2 = .2“' x±d±ip ■ Z. (4.32) 



4.6 The dispersionless (quasiclassical) limit 

Let us reintroduce the Planck constant by replacing fi }i/h and consider the 
quasiclassical limit /I — > 0. In this limit the integrable structure described above 
reduces to the dispersionless Toda hierarchy [19, 20, 5], where the operators /i and 
Cj can be considered as a pair of classical canonical variables with Poisson bracket 

= (4.33) 

Similarly, all operators become c-functions of these variables. The Lax operators 
can be identified with the classical phase space coordinates x±^ which satisfy 



{æ+,a:_} = 1. 



(4.34) 
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The two functions x±{uj^E) define the classical phase-space trajectories as func- 
tions of the proper time variable r = logo;. 

The shape of the Fermi sea is determined by the classical trajectory corre- 
sponding to the Fermi level E = —fi. In the non-perturbed system the classical 
trajectory is 

X-{uj) = ^fjl (4.35) 

and the Fermi sea has a hyperbolic shape 

x^x- = jji. (4.36) 

In the perturbed theory the classical trajectories are of the form 

X± = L±{uj,^) (4.37) 

where the functions L± are of the form 



L±(w,|u) = l + '^a±kil^) 



ÜJ 



rp/e/iî 



k>l 



(4.38) 



The flows Hn become Hamiltonians for the evolution along the ‘times’ The 
unitary operators W± becomes a pair of canonical transformations between the 
variables a; , /x and L±,M±. Their generating functions are given by the expectation 
values S± = Z~^ ^±(^±) ■ Z of the chiral components of the bosonic field (j) 

S± = ±RY,t±k + fi\ogx± (4.39) 

k>l k>l ^ 

where Vk = dEjdtk- The differential of the function S± is 

dS± = M±d\ogx± + logo; dfi -h Hndtn- (4.40) 

n^O 



If we consider the coordinate o; as a function of either or x - , then 

^ _ ^d^s+{x+) _ ( 4 . 41 ) 



The classical string equation 



= M+ = M_ (4.42) 

can be written, using the expansion (4.23) of M±, as 

x+x^ = 'V ktk x^_l^ + p + V xzj"^^ 

k>\ *>1 

= kt-k x^i^ + . 

fc>l k>l 



X^X- 



(4.43) 
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The first of these expansions is convergent for sufficiently large and the second 
one for sufficiently large x_. Comparing the two equations one can extract the 
form of the Fermi surface. Technically this is done as follows [17]. First, note that 
if all t±k with k > n vanish, the sum in (4.38) can be restricted to A: < n. Then it 
is enough to substitute the expressions (4.38) in the profile equations (4.43) and 
compare the coefficients in front of 

The two expansions (4.43) can be combined into a single equation 

x-x- - ^ ktkHk{uj) = II (4.44) 

where 

/r±fc(c.) = [L’i\u;)]> + {k > 0). (4.45) 

The left hand side can be interpreted as the Hamiltonian for the perturbed system. 
It defines the profile of the perturbed Fermi sea, which is a deformation of the 
hyperbole (4.36). 

4.7 One-point correlators in the dispersionless limit 

It follows from logü; = d^S±{x±) that the /^-derivative of the one- point correlators 

(4.46) 

is given by the contour integrals 

where the closed contour of integration in the variable goes along the arc 

between uj = and uj — (Note that the integrand is expanded in Laurent 

series in 

4.8 Example: sine-Gordon field coupled to 2D gravity 

The simplest nontrivial string theory with time-dependent background is the sine- 
Gordon theory coupled to gravity known also as Sine-Liouville theory. It is ob- 
tained by perturbing with the lowest couplings t\ and t_i. In this case 

x± = -h a±icj^i). (4.48) 

and (4.43) give 

fi ~ ~ = 1, 

= tzfi 



(4.49) 
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The first equation is an algebraic equation for the the susceptibility 

Mo = dlT = -Rd^<p. 

It was first found (for the T-dual theory) in [13]. This algebraic equation resumes 
the perturbative expansion found in [21]. The equation (4.48) with a± given in 
(4.49) was first found in [16] by integrating the Hirota equations for the Toda 
hierarchy with the boundary condition given by the non-perturbed free energy 
(3.18). 



5 Integrable perturbations by winding modes 

5.1 The partition function as integral over the U (N) group 

Let us consider a perturbation of the c = 1 string theory by only winding modes. 
The relevant variable in this case is the gauge field A{x) and the matrix model 
(2.4) can be viewed as a two-dimensional gauge theory defined on the disc having 
as a boundary the compactified Euclidean time interval. The only nontrivial degree 
of freedom in such a theory is the holonomy factor around the circle (2.11). The 
canonical partition function is given by the U (AZ')-integral 

^ f m] 

^ y |g-i/2 n®7-çi/2 /xü| ^ 

where I is the unit N x N matrix. The integrand depends only on the eigenvalues 
zi^ . . . , zn of the unitary matrix Q and the partition function becomes 



ZN{in) = - q 




(5.2) 



where the integration goes along the unit circle |>2;| == 1 with measure 



[dz] = dz 



This representation of the partition function was studied (for the non-perturbed 
theory) by Bulatov and Kazakov [12]. Note that the absolute value can be aban- 
doned, since the integrand is homogeneous. 

The grand canonical partition function can be written as the absolute value 
of a Fredholm determinant 

= + (5.4) 

where the Fredholm kernel is defined by the contour integral 



(Kf)iz) 



[dz] f{z') 

2iri q^t'^z — q~^^‘^z' 



(5.5) 




S842 



Ivan K. Rostov Ann. Henri Poincaré 



The integral has poles when Zi = qzj and should be evaluated by adopting a 
prescription for surrounding the poles. The prescription used in [12] is to add a 
small imaginary part to R so that \q\ < 1. In the case of real g 6 [0, 1) the partition 
function was studied by M. Gaudin [22], 



5.2 Evaluation of the partition function of the non-perturbed theory 

Boulatov and Kazakov showed in [12] that the prescription for the contour inte- 
gration gives for the Fredholm determinant (5.4) the same result as (3.18), up to 
non-perturbative terms (9(e“^^). Here we recall their calculation. The integration 
kernel (5.5) acts to the monomials z'^ as 



Kz^ 



qn+\z^ ifn>0 
0 if n < 0 



(5.6) 



and the Fredholm determinant (5.4) reads 

r>0 

The free energy then is equal to an infinite sum 

( 1 

rw = i: bed + - E * „ 

n>0 m>l ^ ^ 



The last sum is the result can be written as the integral 




4 sinh{yR) sinh(y) * 



(5.8) 



along a contour C circling around the poles y = in, n > 0 only. If |g| < 1, then C 
goes from — cxd to 0 and then up the imaginary axis to zoo. 

Note that if we close the contour around the poles of sinhTri^g, then the result 
will be the sum over the residues k = inR {n > 0) which can be written as the 
partition function for the dual radius R = 1/R. 



5.3 Toda integrable structure 

The Fredholm determinant (5.4) with non-homogeneous measure (5.3) can be rep- 
resented as the Fock expectation value in a theory of chiral fermions defined on the 
unit circle [13]. The deformations by winding modes are introduced as Bogolyubov 
transformations of the left and right fermionic vacua and the partition function 
was identified as a r-function of the Toda Lattice hierarchy. 




Vol. 4, 2003 Intégrable Flows in c = 1 String Theory 



S843 



As a consequence, the free energy satisfies an infinite hierarchy of PDE with 
the Toda ‘times’ tn- The first one is the Toda equation dual to the equation (4.29) 

i ^ ^ 6(u) = (5 o) 

dtidt-i^ ^ ^ 

where 

0(/i) = 2sin(a^/2)J’(/i,iî). (5.10) 

Using the scaling relation given in the Appendix, one can reduce (5.9) to an ordi- 
nary differential equation, which should be solved with initial condition (5.8). In 
the quasi-classical (genus zero) limit, this differential equation can be integrated 
to an algebraic equation for the string susceptibility uq = 

= 1. (5.11) 

The one- and two-point correlators were calculated by Alexandrov and Kazakov 
from the higher equations of the Toda hierarchy [16]. These results were later 
confirmed in [17] using the Lax formalism of the constrained Toda system. 



5.4 Lax operators and string equation 

We will consider the potential U (z) in the measure (5.3) as the value on the unit 
circle zz = 1 of the potential 

Ù{z,z) = + L„z") (5.12) 

n>0 

defined in the whole complex plane. Let us assume that the spectral variables 2 ; 
and 2 are represented (in some sense) by a pair of Lax operators of the form 



1 £ 2 *^^ 



L = e LJ I 1 

\ ^>1 / 



k>l 



(5.13) 



where the phase (f) is related to the free energy by 

4>(fi) = + i/2) - T{n - i/2)]. (5.14) 

In order to find the constraint satisfied by L and L, we consider the non- 
perturbed theory, for which the expressions in the parentheses are equal to one. 
The functional equation (3.19) means that the bare phase satisfies 



+ i/R) -(I)q{ii- i/R) = -ilogfi. 



(5.15) 
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In the case of no perturbation, this is equivalent to the algebraic relations 
and 

= (5.17) 

The second identity is invariant with respect to the dressing procedure and there- 
fore are satisfied by the the general Lax operators (5.13). This is the string equation 
for the constrained Toda system, 

5.5 Quasiclassical limit and relation to the conformal map problem 

In the dispersionless limit (i ^ oo the two Lax operators define a smooth closed 
curve 7 in the complex plane, whose equation is written in a parametric form as 

2 : = L(cj), ^ = L{cj) (|a;| = 1). (5.18) 

For example, in the case when only ti and t_i are nonzero, the explicit form of 
the curve 7 is 

2 == uj {1 Lü~^)^ 

z = (1 + o;)^. (5.19) 

Assuming that the couplings in and t-n are complex conjugate, the map 
(5.18) can be extended to a conformal map from the exterior of the unit disk 
u < 1 to a connected domain T> with the topology of a disk containing the point 
^ = 00 and bounded by the curve 7 . The couplings (n ^ 0) and /j, can be 
thought of as a set coordinates in the space of closed curves. The relation between 
the conformal maps and the dispersionless Toda hierarchy was studied recently in 
a series of papers [23, 24, 25]. 

6 Conclusion 

In these notes we explained how the the perturbations of the compactified c = 1 
string theory by momentum or winding modes are described by integrable defor- 
mations of a gauged matrix model on a circle. The momentum and winding modes 
are associated with the collective excitations of the matter and gauge fields. The 
deformed system is described by a constrained Toda Lattice hierarchy. The Lax for- 
malism for the Toda lattice hierarchy allows to calculate explicitly the free energy 
and the correlation functions of the electric or magnetic operators for any genus. 
In particular, the partition function is a r- function of the Toda lattice hierarchy. 

The integrability takes place only in the grand canonical ensemble, in which 
the size is a dynamical variable and the string interaction constant is controlled 
by the chemical potential fi. The partition function is a Fredholm determinant. 
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and not a usual determinant as in the case of the open matrix chains for the c < 1 
string theories. 

It is not likely that the integrability is conserved for perturbations with both 
moment and winding modes. Nevertheless, the calculation of the correlation func- 
tions of the momentum modes in presence of a perturbation modes (or vice-versa) 
seem to be performable albeit very difficult. This calculation might help to check 
the hypothesis (related to the FZZ conjecture [10]) that a strong perturbation by 
winding modes can lead to a curved background with Euclidean horizon [13]. 

Another problem, also related to the [10] conjecture, is to find the limit 
12 1 tit -I 0 of the matrix model, which is relevant to the sine-Liouville string 
theory. This limit seems to be subtle because at small ^ the non-perturbative 
effects can enter into the game. In any case, the j 2 ^ 0 limit of the correlators 
of the matrix model does not seem to reproduce the results obtained in the Sine- 
Liouville theory [10, 26]. 

Finally, it would be very interesting to understand the origin of the integra- 
bility from the point of view of the world-sheet string theory. 
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A Momentum and winding modes in the 
c — 1 Euclidean string theory 

In this section we will recall briefly the world-sheet description of the c = 1 Eu- 
clidean string theory with compact target space. The elementary excitations in this 
theory are closed surfaces, or string world sheets, embedded in a circle of radius R. 
An embedded surface is defined by metric tensor gab{o') = 1,2) and the po- 
sition x{ct) in the target space as functions of the local coordinates a = 

The free energy of the string theory is given by the functional integral over all 
connected surfaces 



= J VgabVx (A.l) 

with a weight given by the the Polyakov action 

S{9ab,x)=^ I (fa^/ àelg[g^^ daxdhx -h 47 T/u + log (A. 2) 

world sheet 

where R is the local curvature associated with the metric gab’ The parameter g 
is the cosmological constant, coupled to the area of the world sheet, and h is the 
string coupling constant, which is associated with the processes of splitting and 
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joining of closed strings. By the Euler theorem the global curvature is related to 
the genus h of the world sheet as 

~ j (fa^/àëïg = 2-2h (A.3) 

and one can write the free energy as a series 

= (A.4) 

h>0 



In the conformal gauge Çab = the conformal factor 0 becomes a 

dynamical field due to the conformal anomaly, and the world-sheet action becomes 
essentially a c = 1 conformal field theory coupled to a Liouville field 

«S = — J (fcr [{dxY -h {d(j)Y + -h (log h -h Q(j)) H- ghosts] . (A. 5) 

world sheet 



The background charge Q and the exponent b of the Liouville field are determined 
by the requirement that the total conformal anomaly vanishes 



^matter T ^Liouville ^ghosts — 1 T (1 + 6Q ) 26 — 0 (A. 6) 

and that the perturbation due to the cosmological term is marginal 



b{2Q - b) 
4 



- 1 . 



(A.7) 



These two conditions give 

Q = -2, b = -2. (A.8) 

(The value Q = 2 does not lead to a sensible classical limit.) The invariance of 
the action with respect to shifts (f) ^ (j) (f)Q implies that the free energy depends 
on fi and h through the dimensionless combination which is the statement of 
the double scaling limit in the c = 1 string theory. Therefore we are free to choose 
h—\ and write the genus expansion (A.4) as an expansion in 1///^. 

The primary operators associated with the matter field x(a) are the vertex 
operators Ve{(j) ~ e~*e 2 :(cr) Kosterlitz-Thouless vortices t^(o-). The oper- 

ator Vm{o') is associated with a discontinuity 27rm of the field x around the point a 
on the world sheet. We call e and m electric and magnetic charges, in analogy with 
the Coulomb gas on the plane. The electric and magnetic charges should satisfy 
the Dirac condition em = 27tx integer. Prom the point of view of the compactified 
string theory observables, the electric and magnetic charges are the momentum 
and winding numbers, correspondingly. The spectrum of the electric and magnetic 
charges for a periodic target space x -h 2nR = x is 



e = n/R, m = nR (n € Z). 



(A.9) 
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If we write the position field x as a sum of a holomorphic and antiholomor- 
phic parts, x = xr -\- then the Kosterlitz-Thouless vortices are described by 
the vertex operators for the dual field x = xr — xl^ whose target space is the circle 
of radius l/R. When integrated over the world sheet, these operators should be 
accompanied by nontrivial Liouville factors compensating their anomalous dimen- 
sions. The integrated operators have the form 

~ y (A.IO) 

The Liouville exponents are determined by the condition that the integrands are 
densities. 

We are interested in deformations of the string theory obtained by allowing 
electric charges e = n/R with fugacities tn and magnetic charges m = nR with 
fugacities This is achieved by adding to the action (A. 5) the perturbation term 

55 = J2(^nV„/R + inVnR). (A.ll) 

n^O 



The translational invariance of the functional measure V(f> yields the following 
Word identity for the free energy 



-2h 



dJ^ 

dh 



2/i 



dfi 



+ (A.12) 



nT^O 



n.7^0 



This means that the couplings tn, tn and the string coupling h scale with respect 
to the cosmological coupling // 

(A.13) 



The T-duality symmetry of the original theory 

a: X, R—^l/R, fi^ji/R (A. 14) 

holds for the perturbed theory if one also exchanges the couplings as tn ^ tn (up 
to a rescaling by an i^-dependent factor) . 

In the perturbation (A.ll) we should retain only the relevant charges \e\ < 
2/R and \m\ < 2R. However, the correlation functions of a finite number of irrel- 
evant operators are perfectly meaningful. The interesting phase of the deformed 
theory, which can be thought of as Coulomb gas coupled to quantum gravity, is 
the plasma phase, where the fugacities of the charges are tuned so that the Debye 
length is of the order of the size of the 2D universe. 
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New Applications of Non-Hermitian Random Matrices 

A. Zabrodin 



Abstract. We discuss recently discovered links of the statistical models of normal 
random matrices to some important physical problems of pattern formation and 
to the quantum Hall effect. Specifically, the large N limit of the normal matrix 
model with a general statistical weight describes dynamics of the interface be- 
tween two incompressible fluids with different viscosities in a thin plane cell (the 
Saffman- Taylor problem). The latter appears to be mathematically equivalent to 
the growth of semiclassical 2D electronic droplets in a strong uniform magnetic field 
with localized magnetic impurities (fluxes), as the number of electrons increases. 

The equivalence is most easily seen by relating both problems to the matrix model. 

1 Introduction 

The subject of the theory of random matrices is a random matrix M distributed 
with some probability measure d/i(M). Typically, one is interested in the distribu- 
tion of eigenvalues and correlations between them as size of the matrix, TV, tends 
to infinity. 

The range of physical applications of this theory is enormously vast, with the 
role played by the matrix M being very different. In complex systems or systems 
with disorder M turns out to be a good substitute for the Hamiltonian or transfer 
matrix. Physical characteristics of the system are obtained via averaging over one 
or another ensemble of large matrices. Most extensively employed and well-studied 
are ensembles of hermitian matrices. In more recent applications to statistical 
models on random lattices and to string theory the key element is a set of graphs 
in the diagrammatic expansion of random matrix integrals while M has no physical 
meaning by itself. For different aspects of random matrix theory and related topics 
see e.g. [1, 2, 3]. 

Complex non-hermitian random matrices are employed in physics too. (A list 
of the relevant physical problems and corresponding references can be found in, 
e.g., [4].) New applications we are going to discuss are related to the distribution 
of their eigenvalues. To be specific, we consider the model of normal random ma- 
trices, i.e., such that M commutes with its hermitian conjugate, though similar 
results may hold for other ensembles. Eigenvalues of normal matrices are in gen- 
eral complex numbers. When TV becomes large, they densely fill a domain in the 
complex plane, the support of eigenvalues, with the mean density outside it being 
zero. The shape of this domain is determined by the probability measure and by 
the size of the matrix. As TV increases, the domain grows (see Fig. 1). The growth 
law is our main concern in this paper. 
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uU) - grad ip 



Figure 1: Growth of the support of eigenvalues 



For simplicity we assume that the support of eigenvalues, D, is a connected 
domain. Let the size of the matrix grow linearly in time t: N t. Then the support 
of eigenvalues grows in such a way that the normal velocity of the boundary is 

v{z) = grad (p{z ) , 2 : G dD (1) 

where (p is a function such that 

^ Aip{z) = Q zeC\D , 

< (p(z) ~ log \z\ z ^ 00 , (2) 

= 0 z E dD . 

Here A = = 4d^d z is the Laplace operator. We employ the complex 

notation z = x+iy. So, the dynamics of the boundary is governed by the function p 
which is harmonic in the exterior of D with a source at infinity, and vanishes on the 
boundary. The solution of the boundary problem (2) is unique: p(z) = \og\w{z)\, 
where w{z) is the conformal map from the exterior of the domain D onto the 
exterior of the unit circle such that 00 is mapped to 00 . Such a map exists by 
virtue of the Riemann mapping theorem. 

This dynamics is realized in various physical problems, classical and quan- 
tum. First, it is pertinent to viscous flows and other pattern formation processes 
when the normal velocity of the moving front is proportional to the gradient of 
a harmonic field (see, e.g., [5]). This mechanism is known as Darcy’s law. The 
droplet of eigenvalues behaves like an incompressible fluid with negligible viscosity 
(say water) surrounded by a viscous fluid (say oil), two fluids being confined in 
a thin plane gap (the Hele-Shaw cell). Oil is withdrawn at infinity at a constant 
rate while water is injected. In this context, the function p is identified with the 
pressure P in the viscous fluid with the opposite sign: p = —P. In water, the 
pressure can be set to 0. The condition that P = 0 on the interface amounts to 
neglecting the surface tension effects. This idealization is good until the curvature 
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of the interface becomes large. When the surface tension is small enough, the dy- 
namics becomes unstable. The moving interface develops many fingers, they split 
into new ones, and in a sufficiently long time the water droplet looks like a fractal. 
In the literature, this phenomenon is referred to as the Saffman-Taylor fingering. 
This growth process was linked to the matrix model in [6]. 

As was recently pointed out in [7] , the same growth law applies to semiclassi- 
cal dynamics of an electronic droplet confined in a plane on the lowest Landay level 
of a strong magnetic field. This suggests applications to the Quantum Hall effect. 
It turns out that the shape of the electronic droplet is sensitive to magnetic fluxes 
localized well away from it. As one changes degeneracy of the level to increase the 
number of electrons in the droplet, its shape evolves in accordance with the Darcy 
law, thus showing up fingering instabilities. This phenomenon is purely quantum. 
Like the Aharonov-Bohm effect, it is caused by quantum interference. The func- 
tion if in this case has no obvious physical interpretation. The characteristic scale 
of this phenomenon is less than that of the Saffman-Taylor fingering by a factor 
of 10^. Remarkably, the matrix model provides a unified mathematical treatment 
of the both phenomena. 

From mathematical side, it is also worth noting that calculation of certain 
expectation values and correlation functions of normal random matrices provides 
a constructive proof of some important mathematical statements in the inverse 
potential problem and the Dirichlet boundary problem proved in a different way 
in [8]. 

At last, the normal matrix model is known to be integrable. Its partition 
function is a tau- function of an integrable hierarchy of partial differential equations. 
Although we do not discuss integrability matters in this paper, let us point out 
that the above physical problems thus possess a hidden integrable structure. 

2 Partition function of normal random matrices 

A matrix M is called normal if [M, M^] =0, so that M and can be simultane- 
ously diagonalized. The model of normal random matrices was introduced in [9]. 
The integrable structure of the model is quite similar to that of the more widely 
known model of two hermitian random matrices [10] but physical interpretation 
is very different. 

The partition function is 



(3) 

J normal 

where /i is a parameter and F is a real- valued function. The measure d/j.Q is in- 
duced from the standard fiat metric on the space of all complex matrices. To 
introduce coordinates in the subspace of normal matrices, one makes use of the 
decomposition M = UZU^ofa normal matrix M, where U is a unitary matrix and 
Z = diag (zi, . . . , z/v) is the diagonal matrix of eigenvalues of the M. The measure 
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is then given by 



d/io(M) = 



A^IVol {U{N)) 



N 

1Aat(2)P 

2=1 



where <i/x^/(AT) is the Haar measure on the unitary group U{N), and Aat( 2 :) = 
Y[^yj{zi — Zj) is the Vandermonde determinant. The partition function is, therefore, 
written as the following integral over eigenvalues: 



Zn — 




( 4 ) 



(For notational simplicity we shall write V(z) instead of V{z,z).) This quantity 
has two important interpretations. 



One of them is the Coulomb gas picture [11]. Writing Zn ~ 
(Pzj where 

£= y^loglzi-2j| +h~^'^V{Zi) 




i^j i 

' ' ' 

2D Coulomb energy potential 

we see that is the partition function of the 2D Coulomb gas in the external 
potential. Another one is the Quantum Hall picture suggested in [7j. 



Quantum Hall picture. Consider spin-| electrons on the plane in a non-uniform 
magnetic field B. The Pauli hamiltonian is 

" = ^ 

where A is the vector potential. If the magnetic field is uniform, the spectrum 
consists of equidistant Landau levels, each level being highly degenerate. The low- 
est level is very special. Due to a hidden supersymmetry of the problem, it can 
be found exactly and remains highly degenerate even for arbitrary non-uniform 
field B [12]. The energy of this level equals 0 while the degeneracy equals the 
integer part of the total magnetic flux ^ = f B{z)d?z in units of the flux quantum 
$0 = 27T?i (we set e = c = 1). One-particle states on the lowest level can be found 
explicitly. In the gauge = ^dyV, Ay = —\dxV they are 

^n{z) = Pn{z) exp (-^^^ 

where B{z) = — |AV'( 2 :). Here = z'^ [terms of lower degree] are holomorphic 
polynomials of any degree which is less than the degeneracy of the level [12]. 
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Neglecting interactions between electrons, the wave function of N particles 
on the lowest level is the Jastrow determinant: n det'ipni^rn), and so 

= ( 6 ) 

coincides with the statistical weight of normal random matrices expressed through 
eigenvalues. The partition function (4) is, in this context, the normalization fac- 
tor of the N'-particle wave function: / Ili The mean density of 

electrons coincides with the expectation value of the density of eigenvalues in the 
matrix model: 

. N-l 

H d% = - M)) . 

i=l 

Similarly, multiparticle correlation functions are identified with multipoint corre- 
lation functions of densities. 

All the above relations are exact at any finite N. As h becomes small and N 
large, one approaches a semiclassical regime. However, the semiclassical properties 
of the system are quite unusual. On the one hand, the density distribution acquires 
a well-defined edge, and one can speak about a well localized electronic droplet 
which behaves like an incompressible fluid. On the other hand, in this specific 
semiclassical regime, quantum effects are by no means negligible. Quite the reverse, 
they become rather strong if not dominant. In fact there is no surprise here because 
the semiclassical limit we are speaking about is not the usual one which would 
require excitations of higher energy levels. In our “semiclassical” limit all particles 
occupy the lowest level, so the droplet as a whole remains a quantum object. 
Amusingly enough, it is this limit where one makes contact with the purely classical 
Saffman- Taylor fingering. In the next section, we analyze the corresponding large 
N limit of the matrix integral. 

3 The semiclassical (large N) limit 

The large N limit we are interested in is iV 0, 0 with hN finite and fixed. 

The expansion in N~^ is then the same as the expansion in h. 

To elaborate the limit, we represent the energy 8 (5) in the form 

J J log k — ^ J V{z)p{z)(fiz 

where p{z) = ~ Zi) is the density of eigenvalues. In the limit, one treats 

p{z) as a continuous function normalized as J p{z)d?z = N. As h 0, both terms 
in (5) are of order , and the saddle point method can be applied to perform the 
integral. 
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The saddle point condition is 08/Sp{z) = 0 which yields the integral equation 
for the mean density: 



h 






(7) 



The meaning of this equation is especially clear in the Coulomb gas interpretation. 
It states that each charge is in the equilibrium. Indeed, consider a charge at the 
point 2 . The first term in the equation is the Coulomb force caused by other charges 
in the gets while the second term is the external force. The equation just tells that 
they compensate each other. Clearly, it makes sense to impose the equilibrium 
condition only in the domain where the charges are actually present, i.e., in the 
support of eigenvalues. 

So, the equation should be satisfied in a domain D where p ^ 0. Here we 
assume that D is a connected domain. For example, in the potential V = —\z\‘^ 
the eigenvalues uniformly fill the disk of radius ^/fiN . Small perturbations of the 
potential slightly disturb the circular shape. 

It appears that in case of normal matrices the above integral equation is 
much easier to solve than the similar equation for distribution of eigenvalues of 
hermitian matrices. Indeed, on applying dz to both sides of eq. (7) we obtain 
p{z) = —^^AV (z) in D, and p(z) = 0 in C\D. The domain D itself is determined 
by the condition 

( 8 ) 

Z — z' 

which can be derived from (7) with the help of the Cauchy integral formula. The 
condition means, in other words, that D is such that the function dzV on its 
boundary is the boundary value of an analytic function in C\D. 

An important particular case is V equal to — plus a harmonic function 
which we parametrize by its Taylor coefficients tk at the origin: V = —\z\‘^-{- 
27le Then the density is constant (equal to -^) inside D and zero outside. 

The area of D is equal to hN. The shape of D is determined by the conditions 




1 

Trk 




z ^(fz = tk 



which easily follow from (8), so that irkik are harmonic moments of the domain 
complementary to Z). In case of quadratic potential the domain is an ellipse [13]. 

The integrated version of eq. (7) allows one to find the leading contribution 
to the free energy, which is given by the value of at the saddle point. Let us 
denote the latter by Fq, then 






1 1 



AV{z')dPzdPz' 



(9) 



This is the tau- function of curves introduced in [6]. The leading asymptotics of 
the partition function as h ^ 0 is therefore 
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Ôn(z) 



Figure 2: Normal displacement of the contour. 



Small variations of the potential and the growth law. If one varies the potential, 
V V ÔV, and size of the matrix, N ^ N ÔN, the support of eigenvalues 
slightly changes its shape and area. Let us examine how it goes. It is natural to 
characterize an infinitesimal change of the boundary by its normal displacement 
5n{z) at each point z, so that 6n{z) is a continuous function on the boundary 
curve (see Fig. 2). 

First we vary the potential at constant N. The shape of the support of eigen- 
values is determined by eq. (8). Its variation can be written as 



/ 



d,>8V{z')dz' 
z — z' 




ISV [z')6n{z') 
z — z^ 



\dz'\ =0 . 



( 10 ) 



It is natural to employ the ansatz AV {z)Sn{z) = dnh{z) where h is yet unknown 
function in the exterior of D such that h = 0 on the boundary, and dn means 
its normal derivative, with the normal vector pointing outward. This ansatz is 
suggested by an easy transform of the second integral into a Cauchy integral. 
Combining the two terms, we get f )) = 0 for z G B. This implies 

that the function dz(SV + h) is analytic in the exterior of D, i.e., that the function 
ÔV h is harmonic in there. Hence h{z) = SV^^{z) — 6V where, given a function 
/, we use the notation for the harmonic continuation of this function from the 
boundary to the exterior of D, Therefore, 



6n(z) 



d^{6V^{z)-SV{z)) 

AV{z) 



( 11 ) 



Similarly, to find how the domain D grows at constant V as N increases, we 
use eq. (8) again. This time the first term in (10) is zero. Noting that 



SN = - 



1 

AttÎï 



/ 



ôn{z)AV{z)\dz\ 
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it is easy to see that the solution is 

Mz) = - ( 12 ) 

where w{z) is the conformal map from C\D onto the exterior of the unit circle 
such that oo is sent to oo. Since log |ic| = 0 on the boundary, the normal derivative 
can be substituted by gradient. Therefore, at AV = const we get the Darcy law 
(1) (cf.[6]). 

Semiclassical electronic droplet in the presence of magnetic impurities. Let us 

apply the above results to the semiclassical behaviour of an electronic droplet in 
a strong magnetic field. The notion of the Quantum Hall droplet [14] implies that 
the electronic liquid is incompressible, i.e., all states at the lowest energy level 
are occupied. Therefore, we want the degeneracy of the level to be equal to N. 
This can be achieved in different ways. One of them is to assume the following 
arrangement. Let a strong uniform magnetic field Bq > 0 he applied in a large 
disk of radius Rq. The disk is surrounded by a large annulus Rq < \z\ < Ri with 
a magnetic field Bi < 0 such that the total magnetic fiux through the system is 
A^^o* Tfie magnetic field outside the largest disk \z\ < Ri vanishes. The disk is 
connected through a tunnel barrier to a large capacitor that maintains a small 
positive chemical potential slightly above the zero energy. If Bq is strong enough, 
the gap is large, and the higher levels can be neglected. In this arrangement, the 
circular droplet of N electrons is trapped at the center. Its radius is much less 
than The function V{z) for \z\ < Rq is V{z) = —^Bq\z\‘^. 

Now let us apply a non-uniform magnetic field ÔB somewhere inside the disk 
\z\ < Rq but well away from the droplet. Suppose that the nonuniform magnetic 
field does not change the total flux: f SBcPz = 0. The potential V{z) inside and 
around the droplet is modified as 

V{z) = ~ ~ J logk - z'\6B{z')(fiz' . (13) 

The second term is harmonic inside and around the droplet. One may have in mind 
thin solenoids carrying magnetic flux (“magnetic impurities”). In the case of point- 
like magnetic fluxes Qi at points a^, we have V{z) = —^Bq\z\‘^ + qi log \ z — üi\. 

In the presence of the fluxes, the shape of the droplet is no longer circular 
(Fig. 3) although the magnetic field inside the droplet and not far from it remains 
uniform and is not changed at all. In this respect this phenomenon is similar to 
the Aharonov-Bohm effect. The response of the droplet to an infinitesimal change 
of the magnetic field ÔB is described by eq. (11) in which 

SV^{z) - SV(z) = - f G{z,z')SB{z')(fz' . 

^ Jc\D 

Here G{z^ z') is the Green function of the Dirichlet boundary problem in C \ D 
normalized in such a way that G{z^ z') log \z — z'\ as z ^ z'. In fact this formula 
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a) b) 

Figure 3: a) The electronic droplet in the uniform magnetic held; b) The electronic 
droplet in the uniform magnetic held in the peresence of point-like huxes at the 
points marked by (g). 



holds for arbitrary SB, not necessarily vanishing inside the droplet. In particular, 
for small point-like fluxes Sqi at some points we have SV = Sqi log \ z — ai\, 
SB = -7T ^ . SqiS^"^^ {z - a^), and 

SV‘'{z)-6V{z) = -^G{z,ai)Sq, . 

i 

If ai is inside, G{z,üi) is set to be zero. The sum, therefore, goes over outside 
fluxes only. The fluxes inside the droplet, if any, appear to be completely screened 
and do not have any influence on its shape. 

When Bi adiabatically increases, with Bq and SB flxed, the droplet grows 
because the degeneracy of the lowest level is enlarged and new electrons enter 
the system. The growth is described by eq. (12) with AV(z) = —2Bo which is 
equivalent to the Darcy law. 

4 Conclusion 

We have analyzed the large N limit of the model of normal random matrices. It has 
been argued that as N increases, the growth of the support of complex eigenvalues 
simulates important physical phenomena: 

• Interface dynamics in viscous flows (the Saffman- Taylor Angering) in the zero 
surface tension limit 

• Semiclassical behaviour of 2D electronic droplets in the Quantum Hall regime 
The former is purely classical while the latter is purely quantum. 
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The relation to the matrix model may help to suggest a way to regularize 
singularities which usually occur in the zero surface tension limit and to obtain 
an analytically tractable formulation of the Saffman- Taylor problem with surface 
tension. 
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Thermal Aspects in Quantum Field Theory 

Jacques Bros* 



Abstract. Thermal (or “KMS”) states as well as ground states are characterized 
by analyticity properties in the (complexified) time variable. Such a characteriza- 
tion is applied to the quantum field theoretical systems on Minkowski, de Sitter 
and anti-de Sitter spacetimes. Privileged theories (or “vacua”) can be defined on 
the basis of general principles which ensure “maximal” analyticity properties of 
the correlation functions. In such theories, there exists an observer-dependent ther- 
mal interpretation of the “vacuum” which is due to the (complex) geometry. In 
Minkowski spacetime, the (non-privileged) thermal quantum field theories at ar- 
bitrary temperature are investigated for their particle aspect at asymptotic times. 
This aspect is encoded in the corresponding two-point functions through a certain 
“damping factor”, which is shown to depend on the dynamics of the interacting 
fields and suggests a possible substitute to the usual pole-particle concept in the 
thermal case. 



The first part of this talk (Sec 1-3) will be devoted to the following question: how 
does one determine the properties of stability of quantum states for relativistic 
systems which are described by local quantum fields, either in the Minkowskian 
background of flat spacetime, or more generally in certain curved spacetimes of 
simple type considered as given backgrounds for the quantum systems?. There is 
a general result of Pusz and Woronowicz [8] on quantum systems, according to 
which the class of quantum states satisfying an appropriate criterion of stability 
(called “passivity”) can be partitioned into two subclasses, namely, on the one 
hand the “ground states” and on the other hand the “thermal equilibrium states” 
or “KMS states” ; both cases are characterized by specific analyticity properties in 
the time- variable of the correlation functions of all pairs of local observables of the 
system. When one deals with a relativistic system in which the local observables 
are described in terms of quantum fields, it turns out that the “ground-state” or 
“thermal-state” interpretation of a stable state of the theory will in general de- 
pend on the motion of the local observer. While this phenomenon already occurs 
in flat spacetime with the Unruh effect [9], it seems to acquire a more general 
validity in curved spacetime by appearing as the manifestation of a certain type 
of “temporal curvature” of the world lines which is felt as a thermal effect by the 
corresponding observer. In models of spacetime equipped with a maximal symme- 
try group such as Minkowski, de Sitter and anti-de Sitter spacetimes, a specially 

* The results presented in the first part of this talk have been obtained in joint works with 
Henri Epstein, IHES- Bures-sur-Yvette (France), and Ugo MoscheUa, Universitk di Insubria, Como 
(Italy) [1-4] . Those presented in the second part have been obtained in joint works with Detlev 
Buchholz, Universitat Gottingen (Germany )[b-7]. 
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favourable circumstance occurs. In fact, one can then define classes of privileged 
theories (Sec 3) whose stability properties are encoded in appropriate global an- 
alyticity properties of the correlation functions with respect to the complexified 
spacetime variables. In particular, the two-point functions of the fields have “max- 
imal analyticity properties” which can be expressed through a Kàllèn-Lehmann- 
type integral representation. In such theories, there exists a stable state invariant 
under the full symmetry group which may legitimately be called a “generalized 
vacuum”, the word “vacuum” being used here in the mathematical sense of the 
GNS-construction [10], as explained below. The global analyticity properties play 
the role of a “generalized relativistic spectral condition” , although such a term may 
be misleading: in fact, it is shown that for all these theories, there exists a complete 
observer- dependent interpretation of the “vacuum” either as a ground state or as 
a thermal equilibrium state (the former being never satisfied in the de Sitter case) : 
this interpretation rigorously results from the combination of analyticity with the 
(complex) geometry of the spacetime manifold. 

The second part of this talk (see Sec 4) deals with the description of particles 
created by a quantum field in a thermal background in Minkowski spacetime. We 
shall in particular propose an answer to the following question: what becomes of the 
asymptotic particle aspect of quantum fields and of the characterization of particles 
in terms of poles of the two-point functions, if one combines the basic principles 
of relativistic quantum field theory with the general KMS condition expressed in 
the Lorentz frame of the thermal bath? We shall summarize here the results of 
[7], in which it is shown how the asymptotic particle aspect of the theory can be 
determined in a specific way by the dynamics of the interacting fields through a 
certain damping factor occurring in a Kallen-Lehmann-type representation of the 
two-point function. 

1 Analyticity in complex time as a criterion for stability: 
ground states and KMS-states 

Imaginary time formalism is often presented in the physical literature as being 
dictated by considerations of convenience: convenience of the use of the Euclidean 
metric in place of the Minkowskian metric of real space, convenience of the com- 
pactness of the integration interval [0, P] (/3 being the inverse temperature) in the 
study of quantum statistical systems. 

Here we first wish to emphasize that the analyticity properties of correlation 
functions with respect to a complexified time variable are actually the necessary 
manifestation of a basic principle of stability for the states of the quantum system 
under consideration [8]. 

Let A be the algebra of all local observables A of a given quantum system. 
We adopt the Heisenberg picture, in which the evolution of the system is described 
by a group of “time automorphisms” at acting on these observables: A at{A) 
{ao{A) = A). Each stable state cu generates a Hilbertian representation of the 
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algebra of observables of the system, in which it appears as a set of expectation 
values id (A) =< A satisfying the condition that uj{at{A)) =< A{t) >uj is in- 
dependent of t. Here we have identified the abstract algebraic notation in which 
the state uj is seen as the action cj{A) of a positive linear functional on the al- 
gebra of observables, with the usual Hilbert space notation < A >^j in which A 
(resp. A{t)) is meant as the operator representative of the observable A (resp. 
at (A)). The fact that any state u (including the thermal equilibrium states) can 
be represented as a distinguished vector of a certain Hilbert space Huj, which 
is spanned by the action of the whole algebra of observables on that particular 
vector corresponds to a standard mathematical construction, called the Gelfand- 
Naimark-Segal (GNS) construction (see e.g. [10] and references therein); the vector 
is then called a “GNS- vacuum” (see also the remark below). In the Hilbert 
space Tico, the automorphisms at are represented by unitary operators 6G(t) such 
that A{t) = Uuj{t)AUuj{t)~^ . 

We stress the conceptual importance of the “passivity’’ criterion of stable 
states introduced in [8] in terms of the reaction of the system in these states to ex- 
ternal perturbations. According to [8] this criterion then allows one to distinguish 
two types of stable states cd, namely the “ground states” which satisfy a condition 
of energy boundedness from below or spectral condition^ and the “thermal equilib- 
rium states” which satisfy the so-called KMS-condition [11]. It is remarkable that 
these two types of states are characterized by specific analyticity properties with 
respect to the time variable t of all the two-point correlation functions of the form 
WAsit) = (jj{at{A)B) =< A{t) B and = u{Bat{A)) =< B A{t) 

(A, B) denoting any pair of local observables of the system. 

In both cases, there exists for each pair (A, B) two analytic functions yV^Q{t)^ 
defined respectively below and above the real axis in the complex t— plane, such 
that for all real values of t, Wab(^) = lim (t — zry) and = \imW^^{t-\- 

ir]) for T] tending to zero with positive values. However the following difference 
holds: 

a) Ground states: WÂb(^) respectively holomorphic in the 

half-planes ^mt < 0 and ^mt > 0. 

b) Thermal equilibrium states of temperature 

respectively holomorphic in the strips = {t; —fd < ^mt < 0} and = 
{t; 0 > > 0}, and moreover the following KMS periodicity condition 

holds: WAB(t) = W^^(t-\-i0) (implying also W'^^{t) = and for 

every J in = W^^{t - iP)). 

Let the commutator functions CAB{i) =< [A{t), B] >u= WAB{i) — 
introduced (for each pair (A, H)). Then in the Fourier conjugate variable of t, 
namely the energy variable E, the previous properties can be equivalently ex- 
pressed as follows in terms of the Fourier transforms Wab{B)^ VF^^(jF), Cab{E) 

of WAB{t),W'^B{i) ^nd CAsit): 
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a) Ground states: 

Wab{E) = 0{E) Cab{E), W'ab{E) = -0{-E) Cab{E); 

b) Thermal equilibrium states of temperature 

Cab{E), W'ab{E) = Cab{E). 



The two cases a) and b) correspond to two different splitting procedures for 
Cab{E)^ the former being a sharp support splitting (expressing energy positiv- 
ity), while the latter is a smooth support splitting with exponential tails specified 
by the Bose-Einstein factor [1 - 

Remark The KMS condition of the case b) was originally derived for the case of 
statistical systems in a box, by using the standard Hilbertian formalism in terms 
of the Hamiltonian H and of the density matrix for representing the action 

of the thermal state cj, namely 



co0{A) = 



Tre p A 



The validity of the same analytic structure for the general case of infinite systems 
was then established in [11]: in the new GNS Hilbert space Tiuj associated with 
LUp, this state is no longer represented as a mixture (or density matrix) but as a 
vector state, namely the GNS- vacuum >^. 



2 Stability and analyticity in time variables 
for relativistic quantum fields 

We adopt the viewpoint of a general field theory described in terms of one (or sev- 
eral) Wightman-type quantum field ^(x). The basic local observables of the theory 
are then linear combinations of field monomials of the form J dxi . . .dxn ^{xi) 

. . . ^(xn) f{xi , . . . , Xn), such integrals being understood as the action of operator- 
valued distributions on smooth test-functions / with compact support. 

If the background spacetime is Minkowski spacetime and t denotes the time 
coordinate along a time-axis with unit vector e, the general stability criterion 
described in Sec.l then leads us to consider all the pairs of correlation functions 
or “Wightman functions” (understood in the sense of distributions) of the form 



Wmn{t) =< ^{Xi $(Xm + te)^{Xm+i) . . . ^{Xm+n) >co 



and 

^mn(t) =< ^(a;m+i) • • • #(a:Tn+„)$(a:i +te)... $(a;„ + te) 

where the state (characterized mathematically as a GNS-vacuum for the rep- 
resentation of the field algebra, as explained above) is stable, namely invariant 
under the translations of time along the e— axis. 
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The fact that this state is a ground state or a KMS-state is then specified by 
the corresponding analyticity property with respect to the complexified variable t 
of the previous pairs of Wightman functions, as described in Sec.l. 

More generally, one can consider families [7] of world- lines 7 parametrized 
by a proper-time parameter t and which are orbits of a certain (one-parameter) 
isometry group T of the underlying spacetime X, Denoting by x = the 

parametrization of each trajectory 7 in [7] and putting for simplicity ^j(t) = 
one is led similarly to characterize the stability properties of a given 
field theory on X with respect to the family of orbits [7] of the ‘‘evolution group’’ 
r by the analyticity properties of the pairs of Wightman functions 

Wmn(t) =< . . .$7™(0^7„.+ i( 0) • • -^7™+n(0) >u; 

and 

=< ( 0 ) . . . ( 0 )$.,, (t)... it) 

in the corresponding complexified variable t. 

We emphasize that the “ground state” (or “zero-temperature”) character or 
the “thermal” character (at temperature T = P~^) of the state >uj, corresponding 
respectively to the analyticity of Wmn{t) and W^^(t) in half-planes or in periodic 
KMS-strips (with width P) of the t— plane, is now observer dependent, since rela- 
tive to the family of world-lines [7]. Such a family may indeed cover possibly only 
a part of the spacetime X, which corresponds to the existence of an “horizon” 
for the corresponding observers. For these observers, the corresponding energy in- 
terpretation of the state is done in the Fourier conjugate variable E of t, which 
is relative to the “evolution group” F considered and generally does not have a 
global meaning with respect to X. 

3 Privileged Quantum field theories in Minkowski, 
de Sitter and anti-de Sitter spacetimes 

Of course, it is always possible to complexify the proper-time variable t of any given 
world- line of any spacetime manifold X, and if families of world- lines associated 
with an isometry group F of X exist, which is not always the case, one can introduce 
the previous notions of stability relatively to F with arbitrary temperature (positive 
or equal to zero): these notions rely on a partial complexification in one variable 
of the manifold X. 

Such complexification of time variables becomes particularly interesting if 
the manifold X is analytic, namely if one can speak of global complexifications of 
X in all the variables; in such a case, the complexification of the proper- time of a 
world-line 7 may define a corresponding “complex world-line” as a curve in a 
complex manifold X^^^ which is a global complexification of X. 

We shall consider three simple models of spacetime X which admit a global 
complexification X^^^: Minkowski, de Sitter and anti-de Sitter spacetimes. They 
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also have in common to admit a global symmetry group G, which extends to 
a complex symmetry group of For Minkowski, this is the (restricted) 
Poincaré group acting on For de Sitter, which can be represented as the 

one-sheeted hyperboloid with equation = Xq — xf — x| — Xg — X4 = —R^ 
embedded in the Minkowskian space R^, the symmetry group G is S'Oo(l, 4 ). 
For anti-de Sitter, which can be represented by the quadric with equation x^ = 
Xq — xf — x| — X3 + X4 = embedded in R^, the symmetry group G is the 
pseudo-orthogonal group SOq{ 2 , 3 ). R can be called the radius of the correspond- 
ing de Sitter or anti-de Sitter spacetime. These three spacetimes admit classes of 
privileged quantum field theories (QFT) , and corresponding GNS- vacua, whose 
definition (given below) will benefit from the global symmetry group and from the 
global complexified structure of these spacetimes. As a matter of fact, the correla- 
tion functions of such privileged QFT will satisfy properties of covariance under G 
and analyticity domains which are the analogs of those expressing the relativistic 
spectral condition in the Minkowski case. 

Then for the spacetimes X which we consider, there are various “evolution 
isometry groups” F, which all appear as subgroups of the global symmetry group G. 
Each one-parameter group F admits a complexified group F^^^ , which is a subgroup 
of G^^^\ and whose orbits will be complex world-lines 7^^^ in the corresponding 
manifold X^^'\ The following typical situations will occur. 

3.1 Two simple types of complexified orbits 7 ^^^^ 
for the ‘‘evolution groups” 

i) “straightlines” or “circles”: the complex orbits 7*^^^ are (topologically) iso- 
morphic to Ct or to 2^^ 

ii) “hyperbolae” : the complex orbits 7^^^ are (topologically) isomorphic to 2^ipz 
(In the latter, t always denotes the proper- time variable) 

The simplest example is provided by the complexified Minkowski spacetime 
= in which one can distinguish: 

a) All the orbits of (complexified) time-translation groups Fe^\ e being any unit 
timelike vector in the forward cone namely all straightlines of the type 5 : z = 
te b {b being any real vector); they correspond to all possible uniform motions 
in the kinematics of special relativity. 

b) Classes of orbits of complexified one-parameter Lorentz subgroups, such as 
the group Fq'JJ = 50 o(l,l)^^^ of pure Lorentz transformations in the coordi- 
nates (zq,zi); these orbits are the complexified timelike hyperbolae : zq = 
psinh^, zi = pcosh^, with p,Z2,zs constant and real. Each of these complex 
curves contains two disjoint real world-lines, which are the corresponding two real 
branches hp, h'^ of the complex hyperbola h^pK They are respectively represented 
in the t— plane by the real axis (mod. 27 rip) and the parallel to the real axis 
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passing at the point Trip but with negative orientation (mod. 27rip); hp and h'p 
are interpreted as world-lines of uniformly accelerated motions with acceleration 

1 (c) 

a = The complex curve hp contains a “circle” zq = ip sin-, z\ = pcos^, 
(obtained for t = ir and contained in the imaginary- time Euclidean spacetime 
of this circle will introduce a 27rip— periodicity , which the corresponding 

uniformly accelerated observer living on hp (or on h'^) may interpret in a thermal 
way (namely as a KMS-type periodicity with the temperature T = ^) provided 
its local field observables are in a GNS-vacuum state satisfying the corresponding 
global analyticity property in the complex manifolds h^p\ As seen below, this will 
indeed be satisfied by all the privileged QFT’s on and correspond to what is 
known as the “Unruh effect” [9], but one must not confuse the geometrical period- 
icity of the complex world lines h^p"^ with the corresponding KMS property of the 
QFT’s which will benefit from that periodicity. 

In de Sitter spacetime the situation is as follows. All the isometry groups 
r which admit families of time-like orbits interpreted either as inertial or uni- 
formly accelerated motions are subgroups of pure Lorentz transformations in G = 
50o(l,4). All the corresponding orbits are world-lines of type ii), i.e. hyperbo- 
lae; for such a given group, they are obtained by taking all the sections of X 
by a corresponding family of parallel (timelike) two-planes in the Minkowskian 
space R^, and only one of these sections, namely the “meridian hyperbola” in 
the two-plane containing the origin, corresponds to a pair of inertial motions (or 
geodesics). To be specific, it is sufficient to consider the complexified orbits of 
the group Fq^| = 500(1, 1)^^^ of pure Lorentz transformations in the coordinates 
(similar to those of Minkowski space) of the form: : zq = psinh z\ = 

pcosh^, ( 22 , 2 : 3 , 2 : 4 ) = — p^)ie, with e and p constant and real, = 1 and 

9 < p < R. On the corresponding real world-lines hp^e^ ffie acceleration is 




In anti-de Sitter spacetime the situation is more diversified. The isometry 
groups r which admit families of time-like orbits interpreted either as inertial or 
uniformly accelerated motions are of three types, and the corresponding orbits 
are sections of X by two-planes of which present the three possible shapes of 
conical sections. 

a) All the subgroups which are conjugates with respect to 50 q( 2, 3) of the rotation 
group SOq2 in the coordinates ( 20 , 2 : 4 ) give rise to complexified orbits of type i), 
isomorphic to fhe corresponding world-lines are in fact circles (or ellipses), 

which are interpreted as uniformly accelerated motions with acceleration a such 
that 0 < a < ^ , the only geodesics being the “meridian circles” such as 2 g -f- 2 | = 
7 ^ 2 ^ = 22 = 23 = 0. It is to be noted there that the replacement of the anti- 

de Sitter spacetime by its universal covering X suppresses the “unphysical time- 
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periodicity” of the previous circular world-lines: in fact, it replaces them by their 
universal covering R, and correspondingly it replaces the complexified orbits 
by Cf 

b) All the horocyclic subgroups which are conjugates with respect to SOq{ 2,3) of 
the subgroup acting in the two-planes parallel to 2:0+2:! = 0, 2:2 = 2:3 = 0 give rise to 
complexified orbits of type i) , isomorphic to Ct : the corresponding world- lines are 
in fact parabolae, interpreted as uniformly accelerated motions with acceleration 
equal to a = 

c) All the subgroups which are conjugates with respect to SOq{ 2, 3) of the Lorentz 

subgroup G = SOq{ 1, 1) in the coordinates (2:0, 2:1) give rise to complexified orbits 
of type ii); for this typical subgroup, these are the hyperbolae of the form : 
zq = p sinh ^ , zi = p cosh ^ , (2:2 , 2:3 , 2:4 ) := + 7^^ ) i e, with e and p constant and 

real, = I and p > 0. The corresponding world-lines are (as in the 

Minkowski and de Sitter cases) branches of hyperbolae h'^ now interpreted 

as uniformly accelerated motions with acceleration equal to a = ^ • 



3.2 Definition of the privileged scalar quantum field theories 

In Minkowski spacetime, the “privileged quantum field theories” are supposed to 
satisfy the Wightman axioms[12] (or generalized versions of them, such as those of 
the Jaffe fields[13] including the possibility of wilder short-distance singularities). 
In de Sitter and anti-de Sitter spacetime, it is possible to introduce similar classes 
of QFT’s [1,3,4], which allows us to give here a brief unified presentation for the 
three spacetimes; we use the general notations X and for anyone of them. 
We restrict ourselves for simplicity to the case of a single scalar field. 

A) Postulates on the held ^{x) 

A basic postulate which is of general nature in QFT is the quantum formu- 
lation of Einstein causality^ called local commutativity: the commutator [4>(xi), 
4>(a:2)] is supposed to vanish (as an operator- valued distribution) in the region of 
A X A in which x\ and X2 are spacelike separated. In all three cases, this region 
can be defined by the condition (xi — X2)^ < 0; for the dS and AdS cases, the 
latter is understood in terms of the quadratic form of the corresponding ambient 
space (the quadric A being defined by x^ = ±R^ in that space) . 

For a scalar field, the covariance assumption can be written for all three cases 
as: U {g)^{x)U {g)~^ = where U{g) denotes a unitary representation of the 

global symmetry group G in the Hilbert space of states. 

Massive free fields satisfy the Klein-Gordon equation {Dx + /i^)4> = 0, where 
Dx denotes the Laplace-Beltrami-type operator on the corresponding spacetime 
A. For the dS and AdS cases, this is the trace on A of the G— invariant Laplacian 
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(or D’ Alembert ian) in the corresponding ambient space R^. Additional specifica- 
tions on the range of the mass variable yu are given below. 

B) Postulates on the vacuum state Q 

The vacuum state ft is supposed to be invariant under the group G . 

The n— point Wightman functions Wn{xi, . . . ,Xn) =< ^( 3 : 1 ) . . . > 

of the field ^ are supposed to satisfy specific global analyticity properties in the 
complexified spacetime variables, which can be called generalized spectral condi- 
tion. This condition requires that for n > 2, each distribution Wn is the boundary 
value of a holomorphic function Wn{zi , . . . , Zn), defined in a certain ^‘tuboid do- 
main” Tn. We now give a precise definition of the latter for the three cases, together 
with the corresponding interpretation 

a) Minkowski spacetime M 4 : We postulate the usual relativistic spectral condition, 

which is equivalent [12] to the analyticity of the function Wn{zi , . . . , Zn) in the tube 
Tn{M 4 ) defined by the conditions ^ 7712:^+1 — ^mzj 6 1 < j < n — 1. 

b) de Sitter spacetime: The causal structure in X is inherited from that of the 

ambient Minkowskian space R^, namely the light-cone emerging from a point x in 
X is the intersection of the Minkowskian light-cone emerging from x in R^ with 
X. Correspondingly, one also shows that there exists complex “tuboid domains” 
bordering the whole real spacetime X, which are of the form T+ = T~^ and 

T~ = T~ nX^^\ where and T~ are respectively the “forward” and “backward 
tubes” in the complexified Minkowskian space C^, defined by the condition ^mz G 

or —^mz G . The following generalized spectral condition is then suggested 
[1,3]: for each n, the function Wn{z\^ . . . ^Zn) is analytic in a domain 7^, which is 
the intersection by X^"^^ x • • • x XC) of the five-dimensional Minkowskian tube 
domain Tn{M^). 

c) anti-de Sitter spacetime: A genuine spectral condition can be formulated in 
terms of all isometry groups F of rotation type (listed under a) in the paragraph 
of subsection 3-1 devoted to AdS), whose complex orbits can be seen to generate a 
union of tuboid domains T~^ and T~ in X^^\ defined by the condition (^mz)^ > 0. 
The corresponding formulation of the generalized spectral condition in terms of 
analyticity domains of the functions Wn{z\^ . . . ^ z^) is then also feasible [4], the 
domains 7^ of x • • • x being then defined in terms of the tuboids T+ 
and T~ . 

3.3 Two-point functions and thermal properties 
of the privileged field theories 

By exploiting the generalized spectral condition together with the covariance of 
the fields and of the vacuum state, one can show that the two-point functions 
^ 2 ( 21 , 2 : 2 ) of all privileged QFT’s enjoy a maximal analytic structure. What is 
meant here is that the following properties, known as old basic results for the 
Minkowski case [12] are also satisfied for the dS and AdS cases [1-4]. 
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i) Analyticity in a maximal cut-domain D of x which is invariant under 

D is the set of all pairs {z\,Z2) such that the G invariant variable C = 
(zi — z^)^ belongs to the cut-plane IT = C \ R"*’. In all cases, the restriction of the 
cut C > 0 to the reals is the set of all pairs (xi, X2) which are timelike-separated on 
X. Concerning the AdS case, this result actually holds for the two-point functions 
of QFT’s on the covering of AdS, while for those on the “true” AdS itself, the cut 
in the C^— plane reduces to the interval o<C< AB?, In all cases, each two-point 
function W2 (2:1, 2^2) of a scalar field is thus represented by a function of a single 
complex variable w{C,) holomorphic in the cut-plane II. 

ii) Privileged free fields: The two-point functions of free fields (with relevant mass 
ranges) are completely determined so as to satisfy the previous maximal analyticity 
property. They are shown to be proportional to Legendre-type functions P\ or Q\ 
of the —invariant) scalar product ^ ^ (linearly related to (). In the latter, 
the subscript A determines the squared mass of the corresponding free field. 
While the first-kind functions P\ are obtained for the dS case, the second-kind 
functions Q\ are obtained for the covering of AdS, but only integral values of A 
occur for the “true” AdS (which corresponds to the above peculiarity of the cut 
in ( and is closely related to the real-time periodicity of such theories). 

iii) Kàllèn-Lehmann-type decompositions for general privileged QFT‘s. The pos- 
sibility of representing the two-point functions of general (interacting) privileged 
QFT’s on X as linear superpositions over a certain mass range (with a weight 
which is a positive measure) of free-field two-point functions necesitates a more 
refined analysis. For the dS case, one can distinguish two disjoint bases of free 
fields (i.e. of Legendre functions P\), whose sets of masses correspond respectively 
to the labelling of the principal series and of the complementary series of unitary 
irreducible representations of G. Fields whose two-point functions can be expanded 
on these two disjoint bases differ from each other by their asymptotic properties at 
large times on de Sitter spacetime. For the AdS case, there exists correspondingly 
a general decomposition in terms of functions Q\, which selects the relevant subset 
of integral values of A when one deals with QFT’s on the “true” AdS. 

Thermal properties 

For all privileged QFT’s on Minkowski, de Sitter and anti-de Sitter space- 
times, a KMS-condition is satisfied in all the families of world- lines contained in 
complex hyperbolae hp^^\ as described above in subsec 3 - 1 . The corresponding 
temperature, which is of purely geometric origin, is T = where p 

is the radius of the hyperbola. It is therefore related to the acceleration of the 
corresponding observers through the formulae specified in 3-1 for each case. This 
property , which can be called a generalized Unruh effect deserves the following 
additional comments. 

For the case of de Sitter spacetime, the thermal interpretation covers all 
possible timelike orbits of one-parameter subgroups of G, including those of the 
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inertial motions (namely the meridian hyperbolae of X), for which the associated 
temperature is minimal and equal to 2 ^- 

For the case of anti-de Sitter spacetime (or its covering), the thermal inter- 
pretation is limited to accelerated motions whose acceleration a is larger than 

with a temperature ^ ^ ^ • For the parabolic trajectories, which cor- 

respond to the acceleration a = ^ and to the zero temperature limit, it can be 
shown that the corresponding energy spectrum is bounded from below: the spec- 
tral condition, postulated for all the elliptic orbits thus extends to the parabolic 
ones. 



Concerning the proof of this KMS analytic structure in all privileged QFT’s, 
one must distinguish the case of free fields and generalized free fields, from the 
general case of interacting fields. In the former case, the field theory is completely 
determined by its two-point function. So it results from the property of maximal 
analyticity of the latter that all the coresponding n— point functions (which are in 
that case combinations of products of two-point functions) satisfy maximal analyt- 
icity and geometrical periodicity in all sections of by the complex curves . 
Therefore all KMS-conditions of the type described in Sec 2 are easily obtained as 
a byproduct. In the latter case, the proof of the relevant KMS-conditions for all 
the n— point functions on the basis of the general postulates listed in 3-2 is more 
technical, since it necessitates the application of an analytic completion procedure, 
which we have given in [3,4] under the subtitle ‘‘proving the Bisognano-Wichmann 
analyticity property’^ As a matter of fact, for the case of the Unruh effect in 
Minkowski spacetime, the first general proof of the corresponding KMS-property 
for general interacting fields has been given in [14] by alternative methods based 
on operator algebras. In this connection one should also quote other presentations 
and analyses of the generalized Unruh effect in the approach of algebraic QFT, 
in particular those of [15] for de Sitter and [16] for anti-de Sitter (see also [17]). 
However, we did not pretend to give here a survey of all the investigations and 
possible viewpoints concerning the Unruh-type phenomenons. The purpose of our 
presentation (and this will be the conclusion to our first part) was to show that 
for Minkowski, dS and AdS spacetimes, there holds a unified viewpoint on the 
thermal aspects of privileged QFT‘s: it is clearly exhibited by the links between 
complex geometry and the global analyticity properties in spacetime variables of 
those (free or interacting) theories. 

4 Thermal quantum field theories in Minkowski spacetime: 
particle aspects 

In relativistic quantum field theories with a ground state, (or “theories at zero 
temperature”), also called previously “privileged QFT’s in Minkowski spacetime”, 
the particle aspect is linked with the occurrence of poles for the Green functions 
of the fields in momentum space. In particular, the “elementary particles” of the 
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field ^(:r) (if they exist) correspond to the occurrence of real poles for the two- 
point Green function in momentum space, or equivalently of discrete measures 
5{p^ — mf) in the corresponding energy- momentum spectrum. Moreover, this is 
closely related to the Haag-Ruelle construction of asymptotic free fields ^in (ar) 
and $out(a;) for the interacting field $(x). The choice of free propagators 
for starting perturbation theory is also justified by the latter facts. In thermal 
quantum field theories, all these notions must be reconsidered. 

After a brief summary of the postulates of thermal QFT, we shall give 

i) a general (nonperturbative) study of thermal two-point functions, resulting 
in a Kallen-Lehmann-type representation whose weight depends not only on 
the mass but also on the space variables x. 

ii) a related characterization of the constituent particles of the field in the ther- 
mal bath, in terms of thermal free two-point functions accompanied by x— 
dependent “damping factors” . The occurrence of such factors is equivalent to 
smoothing the usual pole singularity in momentum space 

iii) the introduction of “asymptotic generalized free fields” (AGFF) as substitutes 

to the usual free fields and ^out{x). These AGFF are associated with 

“asymptotic two-point functions” which depend through their damping factor 
both on the temperature and on the interacting field. 

Explicit nonperturbative computations of the asymptotic two-point functions in 
models such as the interaction are feasible. They allow one to give suggestions 
for the general type of singularities which might represent thermal particles in 
momentum space, as substitutes to the usual poles. 

4.1 General results on thermal two-point functions 

In the general approach of thermal quantum field theory at temperature T = 
one keeps from the relativistic framework the postulate of local commutativity 
(or Einstein causality); relativistic algebraic relations such as those which define 
the free fields are also preserved. However, the Lorentz symmetry is broken: there 
exists a privileged Lorentz frame, determined by the thermal bath corresponding 
to a choice of coordinates x = (xq,x). Time- translation invariance expresses the 
stability of thermal states and the spectral condition is replaced by the KMS- 
condition with periodicity i/3 in the complex time variable zq — xq iy^ (see Sec 
1 and 2). In the framework which we consider, the invariance (of the correlation 
n— point functions) under all space-translations and rotations is also maintained. 

The general study of the two-point function can be summarized as follows. 
Considering for simplicity a single scalar field, one introduces the correlation (or 
Wightman) functions W{x) =< 4>(x(i))4>(x(2)) >/3 (with x = a:(i) —x^ 2 )), W'{x) = 
W (— x), the commutator function C(x), the retarded and advanced functions R[x) 
and A(x), such that: 

W-W'^C = R-A, with R{x)=0{xo)C[x)] A{x) = -6{-x^)C{x) 
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and correspondingly, for the Fourier transforms in momentum space: 

W-W' = C = R-À. 

By applying the results described in Sec 1 (KMS-condition) , we can say that W and 
W' are the boundary values on the reals from the respective sides < 0 and 

> 0 of a function W{zq,x) holomorphic and ip— periodic in zq; its domain 
is the periodic cut-plane generated by the strip ~P< ^mzo < 0. Moreover the 
causality postulate implies the analyticity of W at all points zq = xo+iyo such that 
|x()| < 1^1, yo = inP (which implies the connectedness of the analyticity domain 
of W). 

Equivalently, there holds the following structural properties of the Fourier 
transforms in the complexified energy variable ko = po -h iqo (for all real momen- 
tum 



As usual, the Green functions R and A are boundary values of functions R{ko,'^ 
and A{ko^p) of the complex energy /cq = po + respectively holomorphic in 
the upper and lower half-planes. However, in contrast to the zero-temperature 
case, the relations of the latter with the Euclidean functions are “discretized” : the 
values of R{ko-,^ and A(/co,^ at the Matsubara energies ko = are equal to 
the spatial Fourier transforms of the Fourier coefficients of the Schwinger function 
W(iyo,x) (periodic in po)- 

Applying the previous structural properties to the scalar free field with mass 
m determines uniquely its thermal two-point function at temperature p~^ as being 
given by the following formula 

Wm,f3(xo,x) = j dp e{po)0{p‘^ ~ 1 _ e-/3po 

For general two-point functions (of interacting fields) at temperature P~^ , 
the full exploitation of the causality postulate implies the existence of a Kallen- 
Lehmann-type representation [6], which incorporates the whole analytic structure 
previously described and can be written as follows: 

^oo 

W{xo,x) = / dm D(3{x,m) Wm^(3{xo,x). 

Jo 

In the latter, the weight-function D^(x,m) is interpreted as a ‘^damping factor’’ 
which takes into account the dissipative propagation of the field in the thermal 
medium. In the usual Kallen-Lehmann representation of the zero-temperature case 
(which has a Lorentz-invariant form), this factor is constant with respect to x 
and reduces to an m— dependent weight p(m), while the thermal free two-point 
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function is replaced by the free positive-energy Wightman function — 

f dp 0{pa)5{p^ - 

Remark It is possible to justify [18] that (as in the zero-temperature csise) the 
function W has joint analyticity properties in all spacetime variables, as a “rem- 
nant of the relativistic spectral condition” (no longer true by itself in the thermal 
bath). This “relativistic KMS condition” (which incorporates the previous KMS- 
condition in the variable zq) states that W{z) is holomorphic in an (i/^eo— periodic) 
analyticity domain, which is generated by the “tube” {z = x iy\ y £ I/+, jSco — 
y G (where cq is the timelike unit vector (1,0)). This analyticity property is 
easily checked to be satisfied by the free-held functions Wm,i 3 - Its validity in the 
general case is also equivalent to the fact that the corresponding damping factor 
D( 3 {x,m) is holomorphic in x in the tube domain {z = x iy\ \y\ < f }■ 



4.2 Description of constituent particles 

By analogy with the zero- temperature case, we shall assume that the presence 
of a constituent particle with mass mo created by the held in the thermal bath 
is mathematically characterized by the occurrence of a discrete (5— term in the 
damping factor of the two-point function of that held, namely that one has: 

Df 3 {x,m) = Zp{x) S(m - mo) -\- 



where is smoother than 6 with respect to m. The damping function Zfs{x) 
will then represent the effect of the interaction of that constituent particle with 
the thermal bath. 

The presence of such a discrete term in Dp corresponds to the manifestation 
of an asymptotically dominant contribution at infinite real time xq in the two- 
point function Wp{x). This contribution, which is given by the distinguished term 
^ ^mo,/3(^), Gujoys the following asymptotic properties, which 
can be considered as physically satisfactory: at fixed x (i.e. at rest), it decreases as 
(the particle is not submitted to collisions from the thermal bath); along 
any direction x — vxo, it has a damped behaviour of the form Zp{vxo) x fyo |“2 
(the particle is submitted to collisions from the thermal bath). 

In energy-momentum space however, the usual “pole-particle” situation is 
no longer valid, since the Fourier transform of the previous dominant contribution 
f^^kes the form of a convolution product, namely: 



xxr(dom) 

mo,P 



{P0,P) 




g(Po)^ (Po - (p - u)‘^ - ml) 
1 - 



The effect of the latter is to wipe out the usual discrete mass shell contribution, 
and as a matter of fact, this feature is in agreement with a general theorem of 
thermal QFT [19] according to which the existence of a discrete S in the energy- 
momentum spectrum, corresponding to a particle with a sharp dispersion law, can 
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only occur in a field theory where the incoming and outgoing fields are the same, 
namely a held theory without interaction with the thermal bath. 

To the previous correlation function there corresponds the following 

retarded Green function in momentum space: 



^(dom) 



{ko,k) 




1 

ko~{k — — tUq ’ 



which yields after angular integration (assuming rotational invariance): 



^{dom) 



{ko,k) 



1 f°° 



X 



1 j g ^0 ~ - ml 

2\/W /Cq — — ttIq 



We shall see below that this formula allows one to maintain the possibility of 
analytic continuation of the retarded Green function (from the upper fco —plane 
across the reals) with the occurrence of “momentum space singularities associated 
to the constituent particle with mass mo”, which however will be generally more 
complicated than simple poles. 



4.3 Asymptotic generalized free fields: asymptotic dynamics 

The fact that no asymptotic free fields can exist in thermal QFT except if the 
theory is itself without interaction [19], together with our previous analysis of 
the asymptotically dominant behaviour of thermal two-point functions in the time 
variable have led us to introduce the concept of asymptotic generalized free held or 
asymptotic dynamics [7], by which we mean that at asymptotic times, the presence 
of the thermal bath maintains a manifestation of the dynamics of the interacting 
field, which is of course temperature-dependent and vanishes only in the limit of 
zero-temperature. 

Time-clustering assumption We are led to make an assumption on the asymptotic 
behaviour of the thermal n— point functions of the field < $(x(i)) . . . ^(x(„)) >f 3 , 
when the minimal internal time-interval A = inf{|x(j)o— ^(fc)ol ‘ j^k — 1 , . . . ,n,j ^ 
k} of the configuration (a:(i), . . . , X(y^)) tends to infinity. We assume that, as in 
the usual construction of asymptotic fields (although not in the same limiting 
procedure), this asymptotic behaviour is dominated by the terms containing the 
maximal number of two-point functions in the expansion of < ^(xi) . . . ^(x^) >/3 
in terms of its truncated (or connected) parts. In heuristic terms, this assump- 
tion means that there hold no collective memory effects in the underlying KMS 
state. (Note that in a satisfactory formulation, the exclusion of possible low-energy 
excitations must be performed at first by making use of some appropriate regu- 
larization of the field in the time- variable [7]). Moreover, the two-point functions 
are themselves supposed to be asymptotically dominated in time by a constituent- 
particle contribution lT^°^^(xo,x) of the form specified above, involving an un- 
known damping factor Zp(x). 
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So the timelike asymptotic description of an interacting field theory in a 
thermal state | >13 appears to be given by a thermal generalized free field 
entirely characterized by its two-point function ÿ{x) = Z^{x) X WmM- 
It turns out that the determination of Z( 3 {x) in a given specific field model such 
as the (hypothetic) ^ 4 — model is feasible in a simple way, on the following basis. 
At fixed X, $ 0 ( 3 : 0 ,^) behaves as \xq\~^ . Now the equation of the interacting field 
(of the form N{x) = d^d^^{x) ml^{x) + g"' {xY = 0) should only be satisfied 
in an asymptotic form by the asymptotic field ^o{x), which can be reasonably 
understood as follows. 

The field- function Nq{x) = dfj,d^^o{x) + itiq^o{x) + should be such 

that at fixed x, the product lxo |“2 x Nq{xo,x) tends to zero for \xq\ tending to 
infinity. (In writing that condition, one can give a meaning to the third power of 
this noninteracting field in a standard way). It turns out that the latter condition 
allows one to determine Zp(x) in a fully consistent way as a solution of the following 
Laplace-type equation: 

{-A + 3gk{p))Zp{x) = 0, 

where the (finite) quantity 

k{/3) = 2{27t)~^ j dpeipo)S{p^ - mo)(e'^P" - 1)“' 



tends to zero îov T — f] ^ tending to zero. One then obtains the following results: 
a) for ^ < 0 : 



V rîf'i sin(K(/?)|x|) - 5{u-k(P)) 

, Z/3{u) ^ — 






b) for ^ > 0 : 



Zgix) = Cst- 



-k{/3)\x\) 



-, Zp{u) = 



Cst 

V? -h ’ 



where k(/?) = [3\g\k{j3)]Y 



4.4 Conclusions 

The type of result which we obtain for Z( 3 {u) in the previous model indicates that 
(real or complex) singularities of the Green function {ko, k) can be generated 

by those of Zis{u) through the integral representation written above (at the end 
of 4-2). More precisely, one sees that these singularities of should always 

be localized on surfaces of toroidal shape with equation 

/cq — ( ± aY — rug = 0, 

where a can be either real or complex (in the example of 4-3, a was equal to k{P) 
for p < 0 and to iK{/3) for p > 0). Moreover these singularities should be in general 
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of logarithmic rather than polar type, and should also be accompanied by real cuts 
starting at ko = dimo- 

It is worth noticing that our proposal for characterizing the momentum space 
singularities of the particles in thermal quantum field theory is substantially dif- 
ferent from other recent proposals [20] where complex pole- approximations of the 
form (fcQ_£(^)(fco+£’*(p)j analytic) have been chosen as substitutes to 

the usual real poles as a starting point for perturbation theory. While for the stan- 
dard case of zero-temperature quantum field theory the formalism of perturbation 
theory is in agreement with all concepts and results of the general field-theoretical 
framework (discrete energy spectrum, asymptotic free fields, general form of the 
two-point functions, pole-particle concept), the situation is quite different for the 
case of thermal field theory. As shown above, all the previous concepts and re- 
sults are substantially modified (no discrete spectrum, no asymptotic free-fields 
but “generalized free-fields and asymptotic dynamics”, complex singularities of 
toroidal shape and smoother than poles to be associated with particles) : this sug- 
gests that a “reasonable” starting point for an analytic perturbation formalism in 
momentum space should already take into account the form of the interaction in 
the approximate form of the propagator. 
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Noncommutative Gravity 

Ali H. Chamseddine 



Abstract. Various approaches by the author and collaborators to define gravitational 
fluctuations associated with a noncommutative space are reviewed. 

Geometry of a noncommutative space is defined by the data {Ay H, D) where A is 
a noncommutative involutive algebra, H is a separable Hilbert space and D a self- 
adjoint operator on H referred to as Dirac operator [1]. Geometry on Riemannian 
manifolds could be recovered by specializing to the data 

A = C^(M), H = L\S), D = r{d^ + ^uj;'’yaby 

where is the spin-connection on a manifold M. To deserve the name geometry 
the operator D should satisfy certain conditions [2] . 

At present there are only few noncommutative spaces which are well under- 
stood such as the noncommutative space of the standard model, the noncommu- 
tative torus, deformed plane Rq and the noncommutative spheres 5^, 5^, S^. It 
is relatively easy to develop gauge theories on noncommutative spaces. To do this 
we first define the one-form 



p = a\b*eA, 

i 

then define an involutive representation n oî A on H such that 

7T aoda\ ■ • ■ = Ç 4 [D, 4] • ■ ■ [D, al] . 

The curvature is defined by 6 = dp and integration by 

j a = Tr^ (tt (q) lOr*^) , 

where Tr^ is the Dixmier trace and d is defined by the condition trn -hi) 
oo, V p > |. 

For gauge theories we consider a — 6‘^ . On a commutative space one gets, in 
d = Ay the action 

J éxTr 
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On the noncommutative torus [3] the triplet is taken to be (/ (A ) , I {H) ^ Z>) , where 
I is the left-twisting operator satisfying l(a.b) = /(a) * l{b) [4]. The star product 
is defined by 

f *g = f(^x + ^)g{x + ri) |ç=^=o- 

Then one gets [5] 

where A^ ^ A^, - Ai, ^ A^. 

The operator D includes the metric properties on the space. One can extract 
dynamics of fluctuations of the metric by one of two possibilities. The first is by 
using the spectral action principle which states that the physical action depends on 
the spectrum of D [6] . Good tests of this principle can be made by considering the 
standard model of particle physics and loop space of superstrings. As an example 
consider the noncommutative space of the spectral model defined by 

A = Ai<^A2, H = Z) = (g) 1 +75 

where {Ai^Hi^D\) is the triple associated with the Riemannian manifold M, 
{A 2 , H 2 , D 2 ) is the triple associated with the discrete space 

A2 = CeM2 (C)0M3 (C), 

H 2 is enumerated by quarks and leptons and D 2 contains information about the 
Yukawa couplings. The operator D satisfies the property that if G iZ then the 
fermionic action will be given by 



If = (V>, DrP) 

which includes all fermionic interaction terms. The bosonic action is then given by 

h = Tr(F(^ 



>2 

where A is a cut-off scale. At low energies, the arbitrariness in the choice of the 
function F would only reflect itself in having few measurable parameters. We can 
use heat kernel methods to evaluate the above trace. For example in d = 4 we first 

00 

write F{P) = and use the identity Tr{P~^) = ^7^ J dtP~^Tre~^^ and 

the expansion Tre~^^ ~ ^ f an {x^P) dv (x) to show that 

n>0 M 

h = /-2«0 + fod2 + /2Û4 H , 

where the an are the Seeley-deWit coefficients [7] and 



00 00 

/_2 = j uF(u)du, fo = j F(u)du, h = ^’'(O), • • • . 
0 0 
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From the structure of the quarks and leptons denoted hy Q = {uL,dL,dfi,UR) and 
L = one can determine the discrete triple {A 2 , H 2 , D 2 ) that will give 

rise to the Higgs field. For the leptonic sector one obtains 

'y^k^H \ 

^ \ r{D^ + igiB^) 

and a similar expression for the Dirac operator of the quarks sector. The bosonic 
action is then given by 

h = J ^aA'* + bA'^(^^R- 2y'^H^H 

+ glGl,G^>^‘' + 3^2 (H+ //)")) + O(^), 

where a, b, c are linearly related to /_ 2 , /o, /2 respectively, and y and 2 : are 
functions of the Yukawa couplings. By normalizing the kinetic energies of the gauge 
fields, one obtains a relation between the gauge coupling constants of SU (3), SU ( 2 ) 
and U{1) which is the same as that of SU{5)^ mainly that = \g\- After 

rescaling the Higgs field one gets a relation for the Higgs coupling A = Ifr pI- Using 
the fact that the dominant Yukawa coupling is that of the top quark this relation 
simplifies to A (A) = ^ 0:3 (A) . Combining this relation with the renormalization 
group equations one obtains the bound on the Higgs mass 160 Gev < m// < 200 
Gev. The unification of the couplings also implies that A ~ 10^^ Gev. The spectral 
action thus unifies gravity with gauge and Higgs interactions. 

The second possibility is to study the gravitational field of a noncommutative 
space from the structure of the spectral triple by defining the analogue of Rieman- 
nian geometry to be called noncommutative Riemannian geometry [11]. For this 
one must define connections, curvature, torsion, etc. For example if are basis 
and V a connection then we have 

^ E^, T{V)E^ = T^, 

R{SJ)E^ = -V‘^E^ =Ri® E^, 
which in component form gives 

= dE^ + R^^dEl^+Çï^Çfs. 

Applying these definitions to a product of a discrete two point space times a 
Riemannian manifold M one finds that the basis is given by E^ — 

. / 0 

and ( ^5 and the Dirac operator by 

[ 750 \ 

V 750 7“e^£>^ J ■ 




D = 
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The noncommutative Einstein-Hilbert action is 

I={EA,RiE’^) 

= 2 J d'^x^/g {R - 2d^aduffg^'') , 

M 

where (j) = e~^ . To this we can add a cosmological constant A (1) = A f d^Xy/g and 
allow matrix algebras for the discrete space. This would give rise to gauge fields 

750i/ 

750Ft 

of the gauge fields to the action is then [8] 

I = j ^ 

M 

+ d^^ad^ag^’' 

+R-2A + d^ad^ , 

which is the same at the Randall- Sundrum model [9] of 4 -h 1 dimensional space 
with four-dimensional brane boundaries. 

Allowing the Dirac operator D to fluctuate on noncommutative spaces based 
on deformed spheres or deformed poses a challenge. The operator D is not 
arbitrary, and the interesting problem to solve is to find whether there are gravi- 
tational fluctuations. The presence of a constant background B-field for D-branes 
leads to noncommutativity of space-time coordinates which could be realized by 
deforming the algebra of functions on the world volume. There are indications 
that the gravitational action on noncommutative branes in presence of constant 
background B-field is non covariant [10]. 

Deformed gravity could be constructed byyising the spectral action if one 
knows the form of the deformed Dirac operator D. At present this is not known, 
and all one can do is to probe for possibilities. If one assumes a constant background 
B-field then the commutator of space-time coordinates gives 

where and = Sj^. In this case the vielbein would be deformed 

to the form 

el {x, 0) = e“ {x) + iO''Pel^p (*) + ••• 

the complex part is dependent on If a metric is defined by g pu = then 

the metric will be complex. We can write gpu = G pu + iBpu and then impose 
the hermiticity of gpu which implies that G pu is a symmetric tensor and Bpu is 
antisymmetric tensor. This leads to complex gravity. At the linearized level, the 
field Bpu has the correct kinetic terms, but the ghost modes present in this field 



^ . The contribution 



with curvature 6 = dp -\- , where tt (p) = 
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propagate at the non-linear level. The reason for the inconsistency is that there is 
no symmetry similar to diffeomorphism invariance associated with the field 
[12]. 

An alternative approach is to develop a gauge theory for the field e® and 
formulate gravity as a noncommutative gauge theory without using a metric. This 
can be done without much complications in four-dimensions. The procedure is 
generalizable to higher dimensions, but the analysis will be more complicated. 
The idea is to start from the gauge group f/(2,2) in four-dimensions. The gauge 
field is expanded in the form [13] 

+ e“Fa + /“r„rs + \ojfVab. 

The gauge field strength is given hy F = dA+A^ where we have defined A = A^dx^ 
and F = ^F^j^dx^ A dx^. To make the system dynamical without using a metric a 
constraint is imposed. It is given by 

pa pa5 _ Q 

fj,u ' ^ fJ,l/ 

This breaks the symmetry to SL{2C), which is the relevant symmetry for gravity. 
The constraint is solved by 

/m = <. < = <K + /;), = 

An action which is invariant under the surviving SL{2C) is given by 

I = i j Tr (FgF A F) 

= '-J d^xe'^^^'^eabcd + 8 (1 - a^) e“et) {Rf^ + 8 (l - a^) e^e^) . 

This action consists of a Gauss-Bonnet topological term, an Einstein term and a 
cosmological term, provided that a ^ 1. The advantage of this formulation is that 
it immediately generalizes to a noncommutative action as the metricjs not used 
in the construction, but is generated as a function of Let A = Aj^^Tjdx^ be 
the gauge field over the noncommutative sp^ce where Ti are the group generators. 
The curvature is then given by F = dA -h A * A. Notice that we can write 

[T,,Tj] + ÀI *a M {T,,Tj}) dx^ A dx"', 

where the symmetric and antisymmetric star products are defined to be, respec- 
tively, even and odd functions of 0. Notice that this is consistent with imposing 
the constraint 



pa I pab _ 

IIU ' 
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because these correspond to generators with an odd number of gamma matrices 
and preserve the subgroup SL{2C). The noncommutative action is then given by 

I = i j Tr (FgF * F) 

= i J (2F^, F% + tabcdF^t *s 

The constraints could be solved by using the Seiberg-Witten map [5] which enables 
us to express all the deformed fields in terms of the undeformed ones. The final 
result is [13] 

I = ij {^abcdF^lF^^i + {2e1+eZ,Fl,^ + CabcdFf^FHJ) + 0{e^) 

where is the deformation to first order in 0 of . 

This deformed action gives deviations from the Einstein-action which are 
evaluated explicitly. In this deformation there are no additional propagating de- 
grees of freedom, all new terms being functions of the vierbein e" and its deriva- 
tives. This suggests that there should exist a formulation of noncommutative grav- 
ity on deformed BJq obtained by allowing fluctuations to the Dirac operator in the 
spectral triple of this space. The nature of such extension is presently under in- 
vestigation. 

To conclude, it is clear from the above discussion that gauge theories on 
noncommutative spaces are straightforward, but to define the gravitational action 
on such spaces is more difficult. At present there are only partial answers, and 
more intensive research in this direction is needed. 
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Half- Space Large Size Discrete Velocity Models 

Henri Cornille 



Abstract. We recall recent large size ’’physical” discrete velocity models (DVMs) 
results [1] for a gas in a half-plane {z > being the other coordinate) and present 
new ones. We consider both single-gas and binary mixtures with light mass l and 
heavy mass M. For the half-space (flow of a semi-infinite expanse of gas in contact 
with its condensed phase), the interface is located at z = 0 with only a z spatial 
dependence of the gas (densities with (±x, z) are equal) and no velocities parallel 
to the x-axis (or 2 = 0). We construct ’’physical” DVMs, (only mass, energy and 
momentum along the z-axis invariants, no spurious invariants), filling all integer 
coordinates 2 7^ 0 of the plane. The main new result for mixtures, is that we present 
different M models with the same geometrical structure in the plane with sums of 
the moduli of the coordinates either odd or even for heavy or light species. 



1 Introduction 

1) Recently planar physical DVMs (only binary collisions and physical invari- 
ants) DVMs, for binary mixtures (two species ) and for single-gas, (only one 
species), tiling all the integer coordinates of the plane, were presented [1] (Cornille- 
Cercignani). This was the outcome of recent interest [2] (Bobylev-Cercignani- 
Cornille) occurring for the construction of physical DVMs with only a finite num- 
ber of velocities vi (or momenta). However, from the physical point of view, a 
defect was that these models have no constraints in the plane for the fiows. 

Here we consider both single-gas and binary mixtures half-space models [1] 
(Cornille), for which velocities parallel to the interface (x-axis) must be excluded. 
The interface is located at 2 : = 0 and the z-axis perpendicular. As in the con- 
tinuum treatment of the problem and in DVMs [3] (Sone, Cabannes, Gatignol, 
d’ Almeida, Nicodin, Cornille), the spatial dependence of the gas is assumed to be 
one-dimensional depending upon the variable z > 0, associated physical properties 
of both the half-space and the two parallel interfaces problems were studied Here 
we only construct large size physical half-space models, filling all coordinates of 
the plane. In [1] (Cornille), except for M = 2, for other mixtures M = 4, 3/2, 5, one 
species was along octagons and the other filling all empty sites. Here, we present 
new results for M = 3, 3/2, 4 where, like for M = 2, the 2 species are either 
in |x| -I- |z| odd or even. 

2) The three main difficulties of the present work are: only binary collisions, only 
physical invariants (no spurious) and half-space for the flows. 

In the past, multiple collisions (ternary, collisions of order 4,5. . . ) were very 
often used in DVMs. More recently, only binary collisions are acceptable for the 
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community. The reason is that we try, for discrete Boltzman equations, collisions 
similar to the continuous case, in particular, like here, for DVMs filling all the 
plane. Consequently in the determination of physical DVMs models, without spu- 
rious invariants, we have more difficulties. For models with spurious invariants and 
only binary collisions, very often the introduction of multiple collisions is sufficient 
to destroy the nonphysical conservation laws. On the other hand, the determina- 
tion of physical starting model (like p = 1 square for the single-gas of section2) 
with only physical conservation laws is simpler with multiple collisions. Similarly 
for the mixtures models of section 3. 

We explain the strategy and tools for the construction of physical models 
(without spurious invariants). As illustration we recall 2 examples (Cercignani- 
Bobylev[2]), for two 13^^ and two 25vi mixtures models. For the 131;^, the spurious 
invariants were explained geometrically (Cornille: momenta only along 2 perpen- 
dicular axes). For the two 25vi with a great number of common collisions, it was 
found with powerful computers, that both have spurious (not physical) invariants. 
However, starting with more simple models with llu^, 15vi ( Cornille- Cercignani 
[2]), geometrical restrictions of these 25v{ models, it was shown why one of them 
was spurious contrary to the other (containing non common collisions which elim- 
inate the spurious invariants.) Consequently, it was clear that it is better to start 
with simple physical preliminary models (easier to check for the nonexistence of 
spurious invariants) with few Vi or momenta and to try to enlarge the number 
of momenta or associated densities with geometrical tools (presented below in 3). 
For a model with q physical invariants and p independent densities, it is useful to 
start with p — O' collisions, to check whether spurious invariants exist, to investigate 
their origin and to seek if they can be eliminated with other collisions. 

For the single-gas, section 2,the preliminary model 2.1, lemma la hcis 5 inde- 
pendent (ind.) densities and 2 collisions, eqs(2a-b-c),figla. For the mixtures, fig3, 
for M = 2, section3.3, the preliminary physical model has 7 ind. densities and 3 
collisions eqs(4a-b). Similarly for the M = 3, 3/2, sections 3.4-6, the preliminary 
physical models have 7 ind. densities and 3 collisions: eqs(5a-b) and eqs(6a-b), but 
for M = 4, 3.6a, eq7a, we have 8 ind. densities and 5 collisions. 

The most important tools (1 or 2 new densities), for the extensions of physical 
models are presented below in 3.1-2-3, figs la-b, 2a-b: for one species in squares and 
rectangles with 3 known densities or densities symmetric to the bisectors z = ±x. 
Sometimes these tools are not sufficient (see below 3.4) and for new densities we 
add collisions some being linked to the old physical densities. We multiply 
the new evolution equations of the new densities by arbitrary constants. Adding 
to the old physical conservation laws, we must verify that the constants can only 
have the correct values of the new conservation laws. We apply this method for 
the mixtures models in lemma 3al, 3.2, eqs(dl-d2) and M = 2, 3/2, eqs(4c-6c) in 
sections 3.2-5. 

The introduction of half-space means, models more physical, with constraints 
for the flows. The technical difficulty is due to the missing of momenta along the 
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interface, here x-axis. For instance the starting square p = 1 of section 2 requires 
a square of lengths 6 and 42 momenta, more simpler without this constraint. 
Furthermore , because we have no momenta with z = 0 ,x ^ 0 , the study was first 
with only 2: > 0, lemmala and second we include the 2: < 0 collisions. For the 
extensions into the whole plane the above constraints are not important. However 
in section 2.3, for Maxwellians at the interface, we must show how the impinging 
densities 2; < 0 can be deduced from the emerging ones 2: > 0. 

Here the great progress, comparing with [1], is that we could determine mix- 
tures models with the same geometrical structure of the momenta in the plane. 
Previously, studying different M mixtures models, we find different structures, like 
different octagons and only for M = 2, the heavy and light species in |o:| + 12:| either 
odd or even. Here we find this previous M = 2 structure, fig2c, for all other M 
values. Consequently, we have found that for the same M mixture model, different 
geometrical structures filling the plane are possible. In fact it is sufficient to prove 
this structure for small |x| -h |2:| = a values and the tools figs 2a-b, lemma 2a-b 
in section 3 are sufficient to expand these geometrical structures into the whole 
plane. For all M models a = 3 is sufficient for the heavy species but different a 
values (the reason being the nonexistence of momenta along the x-axis) for the 
light. Consequently if these different M have the same geometrical structure in the 
whole plane, on the contrary for small finite sizes (contrary to large sizes) of the 
starting models, the structures are different and we must give different proofs for 
the physical preliminary models of sections 3.3-4-5-6, fig 3. 

3) Tools in order to enlarge physical models. 

Starting with a preliminary simple physical model, we can for new ’’physical” add 
new momenta: 

3.1 4 momenta of the same species of a mixture or single-gas physical model being 
along rectangles or squares, to 3 {xi, Zj) belonging to the preliminary physical, we 
can include the last one (0:4, 2:4): With conservation laws satisfied, in order to 
eliminate this collision, we must add the last one. 

X) (a^i. 2»), y] A + a;- + zl ^ add {X4, 24) (1) 

2=1,2 2=3,4 2=1,2 2=3,4 

3.2 Along a square with the particle at rest at the center and 2 momenta of the 
preliminary model along the diagonals (or medians), we add 2 new ones along the 
medians (diagonals) . 

3.3 For one species of a physical model, we can add the densities symmetric to the 
bisectors 2: = ±x if, for the new densities, exist in the old model other densities 
with the same 2: (see [ 1 ]). 

3.4 These tools are not always sufficient to enlarge physical models, but in some 
cases we can include 2, 3. . . new densities. For a previous physical model we start 
with the old conservation laws which are invariants with old densities and old 
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collisions. With new densities i = 1, 2 . . . we have new collisions Xk (satisfying 
conservation laws) including some previous densities. 

For their evolution equations included into the old conservation laws we deduce 
sums of Xk- If these sums (at least for one conservation law) are all zero, then 
the new model will have spurious invariants. The reason being that this old con- 
servation is still an invariant with the new collisions, but another invariant exists 
with only the new densities. So these sums, for each old conservation law, must be 
different of zero (but this is not sufficient). For these sums, we get Yl^k^Xk with 
known numerical constants and (j) = 1, 2, 3 . . . for mass, energy, momentum 
conservation laws. Now for the new densities, the evolution equations h{Xk) give 
other sums of Xk with known coefficients. For each physical conservation law (j) we 
consider the sum = 0 where are arbitrary unknown 

constants. For each collision Xk, the sum of the constants must be zero. Solving 
this linear system we must verify, for the set bf , that only the numerical values 
of the new (j) physical conservation law are possible. Otherwise, we have spurious 
invariants. 

In the sequel, for a set ys, we define Vs and y^- = yi — yj. In 

the X, z plane, for all velocities v{±x ^ 0,z), the associated densities N{±x, z) are 
equal. So we write q ind. Ni for the number of really independent Ni. We have 
two successive studies: 

For single-gas, first for our starting models we write the equations of the evolution 
equations with the associated collisions. We must prove that with linear combi- 
nations of these collisions, only 3 are independent and only equivalent to the 3 
physical conservation laws. We begin by writing the collisions Xk quadratic in 
the densities Ni. Then we write the evolution equations k = {dt + Zidz)Ni (or 
ZidzNi) and write the nonlinear equations li — ^2^ Xi^k^k = 0 with Xi^k being 
known numerical constants. Then we write the Xk as linear combinations of the 
Ij that we substitute into the above li nonlinear equations and obtain invariants: 
~ with Vi^j known numerical constants. The sets Vi^j ^ 0 give the 
invariants and we prove that they are equivalent to the three conservation laws. 
Second with (1), we enlarge the physical models with squares, rectangles with3 
densities known (see below 3.1 the most important tool here) adding successively 
new densities. For mixtures, the strategy is similar but with light fy = Pi, fi and 
heavy Vj,Pj = MVj, Fj velocities, momenta, densities. 

With scaling we can generalize to half- integers,. . . . The presented models, 
filling all integer coordinates, have a regular grid (mesh-step 1). We can generalize 
with a mesh-step h finite: {v hv,P hP) and ^ (AF)^, P^ (hP)^ 
(collisions still valid). For the construction of physical models we present sets 
of p-th selected momenta and we know (advantage to the standard numerical 
discretizations) that in all intermediate steps we have no spurious invariants. 
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2 Physical single-gas DVMs filling all integers {z ^ 0) of the plane 

2.1 Physical p — 1 square, fig la, with Vi[x, z), x G —3, 3, z G —3, 3, 0 

Lemma la. We prove with binary collisions (iVo present or not) that the p = 1 
square model with 43 Vi(x,z) (42) and 25 (24) ind. N{ is physical. 

1) We begin with 7u/x, z) 5 ind. Ni, i = 1,3, 5, 6, 7 and x —x ^ Vk^\,k = 1, 3: 

ui(l,l), U 3 ( 1 , 2 ), U 5 ( 0 , 1 ), (2a) 

We call N^{t,z) (or N^{z)), k the densities, evolution equations associated to 
Vi{x,z) and A^fc+i = Nk,k — 1,3 in (2a). With the evolution equations {dt + 
Zidz)Ni = k linear combinations of two quadratic collisions Xk and X 2 linear 
combinations of the k, we deduce 3 invariants [I], [II], [III] which are equivalent to 
the mass [M], energy 2[E] and momentum [J] along z, conservation laws. 

Xi = N^Nj ^Nl X2= N3N5 - NqNu 
^ «7 = -Xi, -h = 2 X 2 + Xi, h = Xi“2, X2=h= k/2 = li J2 = -Z3,7 

^ [/] = h- 2h = 0, [//] = 2Zi + ll, = 0, [///] = Zi,3,7 = 0 (2b) 
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[M] = is, 6 , 7 + 2ii,3 = w + [II] + 2[III], 

[J] = 3h + 2ii,6 + 4ia +h = 2[7] + [77] + 4[777], 

2[E] = 4ii,6 + 9/7 + lOia + h= 4[7] + [77] + 10[777] (2c) 

2) We go on, fig la, with collisions in squares (3 known) and add successively; 

(i):N{±l,3) (n):JV(±2,2) 

(in) : N{±2, 3), N{±2, 1) {iv) : N{±3, 2) (u) ; N{±3, 3), N{±3, 1) (2d) 

In X 6 —3, 3, 2 € 1, 3 ^ 0, the N{x, z) belong to a physical model. 

3) For a link between the 2 < 0 and 2 > 0 densities, we begin with AT(0, —1). For 
the other 2 < 0, with collisions in rectangles and squares (3 N{x,z) known), we 
successively add a new N{x,z): 

{vi) : iV(-2, 1)A^(2, 1) - A^7A^(0, -1) ^ A^(0, -1), (vn) : N{±x, -1), 

(viii) : N{-1, -1)A^(3, 1) - N{2, 2)N{0, -2) ^ A^(0, -2) (ix) : N{±x, -2), 

(x) : 7V(-1, -2)iV(l, -2) - A^(0, -1)AT(0, -3) ^ iV(0, -3) (xi) : N{±x, -3) (2e) 

with X = 1, 2, 3. The with integers x e —3,3, 2 : G —3,3 ^ 0, belong to the 
physical p — 1 model. Finally, if we want to add N{0^ 0), the collision in the square 
7V(1, l)iV(-l, 1) - 7V(0, 2)AT(0, 0) is sufficient. 

2.2 Physical p model, fig lb, 2{2p + 3)(p -h 2)vi, 2{p + 2){p -h 3) ind.AT^ 

For the extensions to p = 2,3, .. . physical models, we use collisions in squares, 
rectangles (3 known, without Nq), adding one N(x, z). First, from the p=l square, 
we construct a physical p = 2 square with sides 8, fig lb. Then, from the p — I 
physical model, we obtain a physical p-th square, p integer arbitrary, limited by 
X = ±{p + 2),y = ±{p + 2), filling (p 00 ) all integer coordinates except the 
x-axis. We still write N{x,z) for the densities associated to v{x^z). 

Lemma lb. The p-th model, p integer arbitrary, fig lb, (sides 2p-|-4) is physical. 
From the p — 1 model we present the new N{x > 0, z > 0): N{i,p-\-2), A/’(p-l-2, j), 
i = 0, 1, . . . ,p + 2, j — 1, 2, . . . ,p + 1 which are successively included in squares, 
rectangles (3 densities known, add the last one). 

(*) : (-1,P+ 1) + (1,P+ 1) = (0,p) + (0,p + 2) 

(m) ; (i - 1 ,p + 2) + (i,p+ 1) = (i - 1 ,p+ 1) + (z,p+ 2),i = 1,. .. ,p+ 1 
(m) : (p+l,p) + (p+ l,p+ 2) = (p,p+ 1) + (p+ 2,p+ 1) 

(iv) : (p + l,p + 2) + (p + 2,p+ 1) = (p+ l,p+ 1) + (p + 2,p + 2), 

(p + 1,7) + (p + 2,7 + 1) = (p + 1,7 + 1) + (p + 2,7), 

j = p,p — 1 . . . 1 ^ addN{±x, z) : 

(0,p + 2), (i,p + 2), (p + 2,p + 1), (p + 2,p + 2), (p + 2,7). (2f) 
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2.3 Single-Gas DVMs Half-space [3] 

The DVMs is in equilibrium with the surface (wall or condensed phase) when 
the microscopic densities Ny^{x^z) of the Maxwellian state are associated with 
the macroscopic variables at the surface (mass, momentum, energy). 

At the interface, the particles leaving the condensed phase (emerging) are in 
Maxwellian equilibrium with it. At infinity, the gas is assumed to be uniform 
and in a Maxwellian equilibrium associated with the macroscopic variables of the 
fiow. There is evaporation (condensation) when the density of the vapor near the 
interface is lower (higher) than the saturation density. Here we discuss only the 
Maxwellians at the interface. We start with the emerging densities N^{x^z > 0) 
and with vanishing collision terms and vanishing momentum along the 2-axis we 
deduce the impinging 2 < 0 and could obtain all macroscopic quantities at 

the interface. We find, that the Ny^ with the same \v{x, z)\ are equal (for instance 
for emerging, impinging densities with opposite velocities or symmetric with the 
2 = ±x-axes). Furthermore, all with still Ny,{x, 2) = Nyj{—x, 2), are functions 
of 2 emerging densities and of |F|. We present results only for p = 1: 

First with vanishing collisions in rectangles parallel to the z-axis, 2 = 1, 2,3: 

X = ±1, ±2, ±3, z = l,2, 3, = Nyj{0, -z)lNyj{0, z) = Nyj{x, -z)/Nyj{x, z) 

Second with 2 squares we show that A3 = Aj and with 2 other A2 = X\ 

NUO, t3)JV^(0, ±1) = Nl{2, Tl) - A3 = iV^(0, -3)/iV^(0, 3) = 

[iV^(0, -1)/AT^(0, l)][iV^(2, -1)AT„(2, l)]^ = Af , iV^(0, ±2)AT^(2, ±2) = 
iV^(l,±3)JV^(l,±l) ^ A| = A1A3 ^ A2 = Xf (2g) 

Jw = 0, A^ = A^, 2 = 1, 2, 3, X — 1, 2, 3 ^ A = 1 and Nw{±x, ±2), \v{x, 2)^ = 
+ 2^ are equal: 

0 = Jtu = y^z[AT^(0, z) - A^to( 0, -2) + 2"^Nyj{x,z) - N^{x, -2)] = 

2 X 

^(1 - A^)[A^t.(0, z) + 2Y^ N^{x, z)]^X = 1, 

Z X 

Nyj(x, 2) = Nyj{—x, z) = Ny;{—x, ~ z) — Nyj{x, ~ z) aud (2h) 

N^{x,z)N^{-x, -2) = \N^{x,z)\'^ = Nu,{z,x)N^{-z,x) = \N-w{z,x)\'^ 

deduced from the rectangle (±x, ±2), (±2, ±x), parallel to a bisector. With the 
collisions Ai, A2 and (i), (ii), (iii) of (2a-d), we write Nyj{x,z) functions of the 3 
i = 1, 5,6 and only 2 with A = 1, (2g-h), giving a restriction (for brevity we 
write Nk for the Nyj^k of (2a-b)) 

JV3 = NqNi/N 5, N7 = NI/N5 = NlNf/Ni, 

{i)N^{l,3) = N.,Nr/Ne = NsNeNf/Nl 
{n)N^{2,2) = NMl,3)/Ne = NsNf/Nl, 

{iii)N^N^(2, 1) = JV| = NiN^{2,2) = ^ N 3 = Nf/Ni = 

NqNi/N5 - = Nw.eNi 5 : (2i) 
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Lemma Ic. We define Ny,{x,z) — Ny,{x,z)/N^,^ = Ni/N^, get: Nw = 

We have = l^Ni = (i and from (2i): Nq = /i^, Nj = For all 

other with collisions satisfying I, eq(l) and 3 Ny^ satisfying this relation, we can 
extend for the last: 



Nyj{XA,Z4) = N yj{Xl, Zl)N yj{X2, Z2) / N yj{x^, Z 2 ) = 

^\ vi \^ + \ V 2\‘^ -2 _ ^\ va { xa , za )\^-1 

2.4 Nicodin p-th squares [3], flglc, filling |x| , fy| odd integers 

aq = 2q-l, v{x,z) \{r]iaq,r} 2 as), ^4^5), 

ril = l,s = 0,l,...q q = l,2,...p, (2j) 

Lemma Id. The p = 2 model is physical, assuming the p — 1 physical, then the p-th 
is physical. We write {x^z) for v{x > —l^z > —1) and for the collisions, we must 
add {x,z) (z,x) (for instance below for p-th: (i), (ii)) and other x, 2: symmetries: 

p = 2 X = A^(1,1)A^(3,-1) - A/'(1,-1)A/'(3,1) ^ = /(±3,-l) = 

— /(±1, —1) = — /(±3, 1) 3 invariants and we add N{±1, ±3), A^(±3, ±3) with 

(=F1,±1) + (±3,±1)) = (dzl,=Fl) + (±l,±3), (ü)(±l,±3) + (±3,±l) =: (±1,±1) + 
(±3, ±3) 
p-th model with 

(i) (ttp_i , 3) + (dp— 1 , 1) ~ (d'p— 2 5 1 ) T (^p5 f ) 

(ii) (ap,l) + (ap-i,aj) = (ap_i,l) -h (ap,aj),j = 2, .p - 1 

(Hi) (cLp—\^CLp) T (cip^CLp—x) — ((lp—\^Q>p—\) T (dp^CLp) > 

add N (dp >1)5 X ( 1 , dp ) , N (dp , dj ) , N (dj , dp ) , N (dp , dp) . (2k) 

3 Physical mixtures DVMs filling all integers {z ^ 0) of the plane 

3.1 One species filling (squares, rectangles) \x\ + fy| odd or even 

We write (x,z) for the momenta and for collisions (1) with 3 (xi,Zi) known we 
add the last (x 4 ,Z 4 ) We begin, fig 2a, with few (x,z) of one species satisfying 
\x\ -j- \z\ — 1,3. We call p = 1 the physical model (mainly parallel to y — —x, first 
quadrant x > —l,z > —1) and with squares and rectangles (1), deduce physical 
2, 3, . . .p arbitrary with |x| + |2;| = 1 , 3, . . . 2p + 1, a: > —1, 2: > -1, 2: ^ 0. 
p = 1, 6 momenta: (0, 1), (0,3), (±1,2), (2,±1) (3al) 

Lemma 2a. With (1), the p = 2 model deduced from p = 1 (j — 2 in I, adding 
momenta |x|±|2:| = 5) and . . . the p-th models deduced fromp— 1 (adding |x|±|2;| = 
2p ± 1) are physical. We successively get new momenta /, //, III, IV: 

I: {p-j,p+j-l) + (p-j + 2,p+j~3) = (,p-j,P+j-3) + {p-j + 2,p+j-l), j = 
p,p- 4-p ^ II : (2p-2,3) + (2p-2,l) = (2p- 3,2) + (2p - 
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1,2), (2p-2,3) + (2p-2,-l) = (2p-4,l) + (2p,l), (0, 2p - 1) + (2, 2p - 1) = 
(l,2p-2) + (l,2p)^ 

III : (2p,l) + (2p- 2,-1) = (2p- 2,1) + (2p,-l), (-1, 2p - 2) + (1, 2p) = 
(l,2p-2) + (-l,2p)^ n^:(-l,2p) + (l,2p) = (0,2p-l) + (0,2p+l) (361) 

Geometrically the momenta of the 2p — 3 collisions I are on 3 lines parallel to 
y = —X and squares, parallel to the x, 2 axes and one top for the new p-th model. 

We go on, fig 2b, with few momenta (x, z) of one species satisfying |x| -\-\z\ = 
2,4 (geometrically similar to the previous p = 1 model), with x > —l,z > — 1, 
Similarly we deduce physical p — 2 and p arbitrary models but with |x| + \z\ — 
2, 4, . . . 2p + 2. We start with p = 1: 

p = l: 8 momenta: (0,2), (0,4), (±1,3), (2,2), (1,1), (3,±1) (3a2) 

Lemma 2b. the p = 2 (from p = 1) and p-th (from p— 1) models are physical, with 
the same geometrical explanation of the 2p — 2 collisions I as previously: 

I: (p - j,p + j) + {p - j + 2,p + j - 2) = (p-j,p + j~2) + {p-j + 2,p + j), j = 
p,p - 1,...2,1,. > 3 - p ^ // : (2p - 1,1) + (2p- 1,3) = (2p - 2,2) + 

(2p,2), (2p-l,3) + (2p-l,-l) = (2p-3,l) + (2p+l,l), (0, 2p) + (2, 2p) = 
(l,2p-l) + (l,2p+l) ^III : (2p-l,-l) + (2p+l,l) = (2p-l,l) + 

(2p+l,-l), (-l,2p-l) + (l,2p+l) = (l,2p-l) + (-l,2p+l) -^IV : 

(-l,2p+l) + (l,2p+l) = (0,2p) + (0,2p + 2) . (362) 

In conclusion, fig 2c let us consider any physical binary mixtures where the two 
species: heavy M > l,Pj, Fj, Lj and light: m = l,pi, /^, U (masses, momenta, 
densities, evolutions equations) contain the starting (3al-a2), and their x^z sym- 
metric densities (see (3c) and fig 3). Then with Lemmas 2a-b, we cant fill all the 
plane with either |x| + \z\ odd or even for the 2 species, 

light: po(0,0), pi(l,l) = -P3, P7(1,3) = -pg, pn(2,2) = -pis, pi7(3,l) = 
-Pi9, Pi{-x,z) pi+i, fi+i = fi i = 1,3,7,9,11,13.17,19, ps(0,2) = 

-P6, Pl5j0,4) = -pi6, 

heavy: Pi{l,2) = -P3,A(2, 1) = -Pu,Pj{-x,z) -> Pj+i, Fj+i = Fj j = 
1,11,P5(0,1) = -P6,^7(0,3) = -A . ^ (3c) 

In general for M fixed, as we shall see, we add other p^, Pj for the complete proof. 

3.2 Tools in order to enlarge one species, fig 3 

Lemma 3al. We assume that one species: Fj, j = 1, 2, 9, 10, (3c) and i"i7(l,4), 
Fi 8(— 1, 4), fig3, belongs to a physical model. We can add F5, P7, (3c) and Fi3(2, 3), 
±*14 (—2,3), fig3, for a new physical. With 3 arbitrary a, 6, c, we verify that they 
can only have the correct values. We write the collisions, Li and verify that the 
invariants are equivalent to the conservation laws: 

Qi = FjFis — ^1^17? ^^2 = P5±’l3 — ±7-^9? ^3 = ±5±7 ~ ^ ^17 = ^1, ±1 = 

f2i^3, Zyg ±5 2 O 2 ^3? Z /7 — ^^2,1 ^3? Z/X3 — ^1,2: ^ ~~ 

nZ/5±6Z/7±cZ/i3 = ^2(2(6— a)— c)— r2i(26±c)— fl3(a±6 ), [-^l] — 2Z/i^9^i7±A" = 

r^i(4 — 2b — c) F ^ 3(2 — a — 6 ) + ^ 2(2 — 2a + 26 — c)=^0-^a = 6 = l, c = 2 
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ok [2^;] = 34 Li7 + 10 Li,9 + X = Oi(44 - 26 - c) + f]2(10 - 2a + 26 - c) + 
^3(10 - a - 6) a = 1, 6 = 9, c = 26 ok [J] = SLu + 4Li + 2Lg + X = 
Ü3(4 - a - 6) -h Oi(12 - 26 - c) + ^ 2(2 -2a + 26-c)-0^a=l,6-3, c = 6 
ok {Ml) 

Lemma 3a2. We assume that one species: , j = 1, 2, 9, 10, (3c) and Fi3(2,3), 

Fi 4(— 2, 3), fig 3, belongs to a physical model. We can add F5, F7, (3c), for a new 
physical. Like previously with 2 arbitrary a, 6 (only correct values), we write the 
fli, Li and verify the conservation laws: 

Lli = F7F5 — Ff, ^2 = F7F9 — F5F13, Li3 = Ü2, Li = Oi, Lq = 

—LI 2 , F5 = 2f^2~^i5 ^7 — ~2fl2“^i? X = aL^^bLj == 2Ll2{ci — b) — Çli{a-\-b) 
[Ml] = 2 Li, 9 ,i 3 + X = -2^7i + X->a = 6 = lok [2F] = 26F13 + 10Li,9 + X = 
I6O2 F IOO2 ~h X — > a = 1, 6 = 9 ok [J] = 6F13 -1- AL\ -1- 2Lq X = 2 F X = 

0^a = l, 6 = 3ok ’ (3d2) 

3.3 Half-Space Physical M = 2, figs 3-2c, with light /o rest-particle, model 

We start with a preliminary physical 5fy, 6Fj model, extend to a starting model 
with the heavy (3.1) filling |x| + |z| odd and light (with tools) even. 

3.3a: Starting physical model, fig 3: 12Pj{x, z) and 7pi{x,z) satisfying (3al): 

In (3c), fig 3, we retain: 

Pi, z = 0,1, 2, 3, 4, 5,6, F^ j = l,2,3,4, 5,6, 7,8, 9,10,11,12. (4a) 

First, we study the 11 Fj.fi model: j = 1,...6 and i = 0, ...4, 7 ind. j = 

I, 3, 5, 6, 2 = 0, 1, 3 and 3 collisions, we write the Lj, k and the conservations laws: 
[Ml]-, [Ml] for the masses (heavy, light species), the energy [E] and [J], the z-axis 
momentum. We get 4 invariants, equivalent to the 4 conservation laws. 

L2 = Fefy— F5/3, F2 = foFi—fiF^, = /0F3 — fyFe, Li = (^i+2^^)F2 == -P2, 
L‘s = {dt — 2dz)F'^ = — F3, /q = dtfo = — 2F2,3, h — {dt + dz)fi = F 2 h = 
{dt - dz)h = T2,3, L5 = {dt + dz)F^ = 2F2,2, Le = {dt - dz)Fe = 2 r~^ ' [Mi] = 
/o + 2/2,3 = 0 , [Ml] = Fe^e + 21^2,3 = 0, 4[F] = 8/2,3 + 10L2,3 + L5,6 = 0, [‘^j = 

^^1,3 + ^^1,3 + ^5,6 ~ ^ 

For the /^, Lj, with arbitrary constants a, 6, c, we verify that they can only have 
the values for [Mi], [Ml]- For the k, z / 0, Lj, first we write F2 as linear combi- 
nations ol li, Lj, F2, F3. Second, we eliminate these collisions, get 3 invariants I, 

II, III with only 2 new ones [E], [J]. 

{ 1 ) li- 0 = /q + a/2 + 6/3 = F2(6 — a) + F2(a — 2) + F3(6— 2) — > a = 6 = 2 — > [^i ] ? 
(ii) Lj‘. 0 = F2 + aZ/3 + 6F5 + cLq = F2(6 — c) + F2(— 1 + 26) + F3(2c — a) 
^a=l = 26 = 2 c^ [Ml]/ 2. (iii) k,Lj not /q : Fi = h — +3 = ~h + L2 = 
L5/2 - F2 = -Lq /2 + F3 ^ /3 + L3 = -(/i + Li) = L5/2 + F2 = -Le/2 - L3 ^ 
[/] : ^i,3 + Li,3 = 0 8 [J] = 4[F]-[Ml], [//] : 2L2,3 + L5,6 = [Ml], [///] : 

/3 - L5/2 + L3 - L2 = 0 ^ AI 1 1 = 2[7] - [Ml] ~ [J] = 0 . (4c) 
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Second, with collisions in squares and ( 1 ), we add the last one Fj, j = 7, 8,9, 11 
/i, i = 5,6 giving a 19 Fj, fi physical model: 

Ff - F,F-r, Fi - FeFs, F^Fj - Fi, FjFg - Ff^ fi - foh, fi - fofe- (4rf) 
3.3b Physical models, including all Fj, fi of the plane (except x-axis) 

In eq(4a), the Fj,Fj j = 1,2,5,7,9,12 are the coordinates of 3.1 eq(3al), we 
can add \x\ + | 2 :| odd with {x > — 1 , 2 : > —1 ^ 0) for physical models and other 
quadrants with x, 2 : symmetries, the heavy fill all coordinates with \x\ -\-\z\ odd, 2 : 

0. For the light particles the 3.1, (3a2) momenta (0, 4), (dzl, 3), (2, 2), (3, ±1) 
are missing. But F(3, 4), F(3, 4), fig3, belongs to the physical model. We start with 
the 7/i, z = 0, 1, .6 of (4a) and F(3,4). With 5 collisions (3 known), we add the 
new ones: i == 7, 11, 15, 17, 19 written in (3c) 

/oF( 3 , 4) — /7F9, /7/1 — /5/11, /s/? — /5/155 / 4 /t — /2/17, /17/4 - /1/19 • (4e) 

We apply 3.1, (3a2), all |x| + \z\ even, x > — 1 , 2 : > — 1 , 7 ^ 0 and other quadrants 
with X, 2 : symmetries, are filled. All integers 2 : 7 ^ 0 are filled with either fi or Fj. 

3.4 Half-Space Physical M = 3, figs 3-2c, with light /q rest-particle, model 

We start with a preliminary physical 3 /^, 8 Fj model, extend to a starting model 
with the heavy (3.1) filling |x| -h \z\ odd and light (with tools) even. 

3.4a. Starting physical model, fig 3: 12Fj(x, 2 :) and 7pi{x,z) 

Pi, z = 0,1, 2, 3, 4, 5,6 F^, j- 1,2, 3, 4, 9,10,11,12, 17,18,19,20. (5a) 

First, we study the 11 Fj, fi model: j = 1, 2, 3,4.17, 18, 19, 20 and z = 0, 5 , 6 with 
7 ind. j = 1,3,17,19 i = 0,5,6. With 3 collisions, we write the Lj, k and the 
conservation laws. For the k, Lj, with constants a,b,c, they can only have the 
values for [Mi], [Ml]. For the li, i 0, Lj, we write Fa as linear combinations of 
li, Lj, get 3 invariants I, II, III but only 2 new [E], [J]. 

Fi = /oFi7 — /5Fi,F2 = /qFi9 — /6F3,Fs = feFi — /5F3, Li - L3 = 

T2,3, Lig — -F2, Fi 7 = — Fi, Iq = — 2Fi^ 2,^5 == 2Fi^3,/e = 2F2 3, [Ml] - /o,5,6 = 
0, [Ml] = 2Fi^3^i7^i9 = 0, 2[F] = 12/5^6+10Fi^3+34Fi7,i9 = 0, [J] = AL^ j-\-21^ q-\- 
SL^ 2 = 0 (i) li‘. 0 = IgFdl^FblQ = 2Fi(a — l)-t-2F2(è — l)-h2F3(a — 5) — > a — b = 

1 ^ [Ml], (ii) Fj: 0 = Fi FaFa + 5Fi7 + cFi9 = Fi(l — 6) + F2(5 — c) +Fa(a — 1) 
— >a = l = 6 = c^ [Ml]/2 . (hi) li, Lj not /q • F3 = — Fi^i7 = L^^ig = /5/2+F17 = 
— IqI2 — Fi9 [/] = Fi^3J7^19 = [Ml]/2 [II] = F,6 + 2Fi7^i9 — > 12[//] + 5[Ml] = 
2[F] [///] = 2F3 + 2F- ^17 - F ^ 6 [///] - 3[J] = 17[Ml]/2 = [E] . (56) 

Second, with collisions in squares (1), we add the last one: /0/5 — /f , /o/e ~ f 3 
new fi, i = 1,2, 3, 4 and also (2.3 same 2 : as /i,/a) we add Fj, j = 9,10,11,12 
symmetric 2 : = ±x of j = 1,2,3,4 ^ 19Fj, fi physical. Finally, from F\, Fg, Fu 
and Lemma 3al, we can add Fj, F 5 . 
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3.4b Physical models, including all Fj, fi of the plane (except x-axis) 

The Fj, Pj j = 1,2,5,7,9,12 are the coordinates of 3.1 eq(3al) and belong to 
the 3.4a physical model. We can add |x| + \z\ odd with {x > —1,2; > — 1 7^ 0) 
for physical models and other quadrants with x, 2; symmetries. The heavy fill 
all coordinates with |x| P\z\ odd, 2; / 0 in particular F(l,8), fig3. For the light 
particles fi, pi, with 3 collisions, we add fi, i = 15, 11, 7: 65 = 17 + 48, foF{l, 8) — 
fi 5 Fi 7 , fofi 5 - fn, /5/15 - fj - Now, for the light, we can apply 3.1, (3a2) for the 
first quadrant and all {x,z symmetries) are filled with light [xj -\-\z\ even. 

3.5 Half-Space Physical M = 3/2, figs 3-2c, with light /o rest-particle 

We start with a preliminary physical 5fi, SFj model, extend to a starting model 
with the heavy (3.1) filling |x| + \z\ odd and light (with tools) even. 

3.5a. Starting physical model, fig 3: 20 Pj (x, z) and 7 Pi{x, z) 

fig3 with: Pi, i = 0, 11,..., 16 Pj, j = 1,...,4 5,..., 8 9, . . . , 12 17, ...,20 
21,..., 24. (6a) 

First, we study the 13 Fj, model: j = l,2,3,4, 17, 18, 19, 20 and z = 0, 11, 12, 13, 
14 with 7 ind. densities j = 1,3, 17, 19 z = 0, 11, 13. With 3 collisions, we write 
the Lj, li and the conservation laws. For the U, Lj, with arbitrary a, b, c, they 
can only have the values for [Mi], [Ml]. For the li,i ^ 0, Lj, with Fs linear 
combinations of 4 li, Lj we get 3 invariants I, II, III but only 2 new [E], [J]. 

Fi = foLi? - fiiLi,T2 -- foFig - frsF3,Ts = fisFi — fiiFs,Li — F^ 3,F3 = 

F2,3, Liq = — F2 , Li 7 = -Fi,/o = — 2 Fi, 2,^11 = Fl 3 ,/i 3 := F2 3, [M/j = Iq -\- 
2^11,13 = 0, [Ml] = 2Li^347,i 9 = 0, 2[F] = 24/ii,i3 + lOLi^s + 34Li7,i9 = 

0, [«>^]/4: = -^1,3 F 2F27 19 + ^11^13 = 0 (i) Ip. 0 = Iq a/ll + ^^13 = Fi(a — 2) + 

F2(5 — 2) + F3(a — 6) — > a = b = 2 — > [M/], (ii) Lj: 0 = Fi + clL^ + bLu + cL\g = 
Fi(l -b) + F2(a - c) + F3(a -l)^a = l = 6 = c^ [Ml]/2 . (iii) k,Lj 

not Iq : Fs = — Fi^i7 = F349 = lu + L17 = — /13 — L\q I = Fi^3^i7^i9 = 
[Ml]/2 [7/] = /11,13 + Fi 7 ,i 9 ^ 12[/7] + 5[Ml] = 2[F] [7/7] = L3 + 2L^g + 

/i3 ^ 24[777] + 3[J] - 7 [Ml]/2 = [F] . (65) 

Second, we add 8 physical Fj, j = 9, . . . 12 and j = 21, ... 24 with 4 ind. j = 
9,11, 21,23 

F4 = /0F21 — /iiFii,F 5 = /0F23 — fuFg^Fli = F23F9 — FiiF2i,n2 = F17F19 — 
F23F21, L9 = F5 — Fl\, Lu = F4 + Fli, L21 — — F4 + f)i,2j L23 — — F5 + 
^2,1’ -^17,19 = 2fl2) ^11,13 ~ ^4,5 ^ X — X/j=:9,ll,21,23 ~ ^5(^9 ~ ^23) + 

F4(ttii — a2l) + f^l(ûll,21 — <^9,23) + ^^2<^21,23 (i)' [Ml] = . . . 2Fi7^19 F X = 

... — 4^2 + X — > 2 = a9 = ail = <^21 — ^^23 c)k (fi)'* [2F] = • • • + 34Fi7^i9 + 
24/iiq3 + .X^ = . . . 24F4^5 — 68fl2 + -^ — ^ cig — clii — 10, a2i = 0,23 — 34 ok (ifi)* 
[J] = • • • + 4:/ii 13 + 8L1Y 19 F X — • • • + 4F4 5 + X ^ a9 = —an = a2i = —^23 = 2 
ok (6c) 
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Third, from Fg, F 17 and LemmaSal, we can add F 7 , F 5 . The Fj, Pj j = 
1,2, 5,7, 9, 12 are the coordinates of 3.1 eq(3al), we can add \x\ + \z\ odd with 
(x > — 1 , 2 : > -1 / 0 ) for physical models (other quadrants with x, 2 : symmetries). 
For the light we add 

/o/i 5 - /u, / 0/16 - fi 3 new/i, i = 15, 16 ^ 27 Fj, fi physical model. (6d) 

3.5b Physical models, figs 3-2c, including all Fj , fi of the plane (except x-axis) 

The heavy Fj fill all coordinates with \x\ -\-\z\ odd, z 0 in particular F(4, 7), 
fig3. With an energy exchange collision /qF(4, 7) - /(2, 6 )Fg, 65 = 60 + 5, we first 
add /(d=2,6). Second in the light square /(±2,6), /n, / 12 , center /15 belonging 
to a physical model, we can add the tops of the medians, in particular /(O, 2) = 
Third, in the light physical square /o, / 15 , /n, / 12 , center /s, we add the tops 
of the medians (and with the symmetries) /i, i = 1,2, 3, 4, 7,8,9,10. With 
/ 4/11 - / 0 / 17 , / 4/17 - / 1 / 20 , we add / 17 , /20 finally. Now, for the light, we can 
apply 3.1, (3a2) for the first quadrant and all (x, 2 ; symmetries) are filled with light 
satisfying |x| -\-\z\ even. 

3.6 Half-Space Physical M=4, fig 3, with light /o rest-particle, model 

We start with a preliminary physical 7/i, SFj model, extend to a starting model 
with the heavy (3.1) filling |x| + | 2 | odd and light (with tools) even. 

3.6a: Starting physical model, fig 3: 

In fig 3, we retain 7 z = 0, 1, 2, 3, 4, 5, 6 and 

8 Fj, j = 1,2,3,4, 13,14,15,16 (7a) 

First, we study the 15 Fj, fi model (7a): with 8 ind. densities j = 1,3, 13, 15 i = 
0,1, 3, 5, 6. With 5 collisions, we write the Lj, li and the conservation laws. For 
the li^ Lj, with arbitrary constants a,b^c,d, we verify that they can only have 
the values for [M/], [Ml]. For the Z / 0, Lj, we write Fs, Ai,À 2 as linear 
combinations of /^, Lj and get 4 invariants I,. . . , IV but only 2 new [E], [Jj. 

Tl = /oFi 3 - /iFi,F 2 = /oFi 5 - / 3 F 3 ,F 3 - f^F, - / 5 F 3 , Ai - / 0/5 - /f, A 2 = 

fofe — fi, Li = F^ 3 ,Li3 == -Fi,Li5 = -F2,F3 = r2,3,Zs = 2F3 - Ai,/6 = 

— 2F3— A 2 , — /q = 2Fi^2+Ai^2, h = Fi+Ai, Z 3 = F 2 TA 2 , [M/] = Zo,5,6+2/i^3, [Ml] = 
2Ti^3^13 J 5 , 2[F] = 26Li3J5 + 10Li^ 3 -f 16 /i^3^5^6î [j] = ^“^13,15 T ^^1,3 "h ^^5 g + 2/^ 3 
(i) li 0 = Iq dll -h 6 Z 3 -f- cl^ ~h (Hq = Fi(a — 2 ) H- L‘2{b — 2 ) -t- 2 F 3 (c — d) + 
Ai(a - c - 1 ) + A 2 {b - d-1) ^a = b = 2, c = d = l-^ [Mi], (ii) 

0 = Li + clL^ + bL\^ + cLi 5 = Fi(l — 5) + F 2 (a — c) + F 3 (a — 1) — > a = 1 = 
b — c ^ [Ml]/2. (in) b,z ^ 0,Fj : F 3 = — T 143 == F 3 J 5 = (Zg -h Ai)/2 = 
— (Zg + A2)/2, Zi + Li3 = Ai,Z3 + Li5 — A 2 ^ [/] = 2Li^3^i3^i5 = [Ml], [II] = 
-Zi,5 + 2L3,i5 - Ai3 ^ 32[77] + 8 [J] + 2 [F] = 21[Ml] [777] = 2 L 3 + 3Li5 + ^ 

32[777] = -8[J] + 11[Ml] + 2[F] [IV] = Zi,3,5,6 + ^ 13,15 16[7V] = 2E ~ 5[Ml] 
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Second, we add Fj, j = 9, 10, 11, 12 symmetric 2; ±x to j = 1,2, 3, 4. Finally, we 
add F5, Fj with Lemma 3a2 and with Lemma 2a {x,z symmetries), all \x\ + \z\ 
odd are filled with heavy. 

3.6b Physical models, figs 3-2c, including all Fj, fi of the plane (except x-axis) 

The heavy F(3, 4), F(4, 7) 3.6a are included into a physical model and we can add 
successively fi,i = 7, 11, 15, 17, 20 

/oF( 4 , 7) — /t^(3, 4 ), /1/7 — /5/11 , fj — /5/15, /4/11 — /0/17, /4/17 — /1/20 • 
Finally, we can add all the light densities with |x| +|z| even. 
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Lattice Chiral Symmetry, CP- Violation 
and Majorana Fermions 

Kazuo Fujikawa 



Abstract. A brief summary of lattice fermions defined by the general Ginsparg- 
Wilson algebra is first given. It is then shown that those general class of fermion 
operators have a conflict with CP invariance in chiral gauge theory and with the 
definition of Majorana fermions in the presence of chiral-symmetric Yukawa cou- 
plings. The same conclusion holds for the domain- wall fermion also. 



1 Introduction 

The recent developments in the analysis of chiral symmetry in lattice theory is 
based on the so-called Ginsparg- Wilson relation [1]. Neuberger found an explicit 
construction of lattice fermion operator (the overlap operator) [2] , which was later 
shown to satisfy the Ginsparg- Wilson relation. This operator was then shown to 
satisfy an index theorem on the lattice under certain conditions [3]. This index 
relation was used by Liischer to derive the chiral anomaly as a Jacobian factor [4] 
just as in continuum theory [5]. These topological properties were later examined 
in further detail [6]. See Refs. [7] for reviews of these developments. 

To be more specific, we here work on the Ginsparg- Wilson relation defined 

by [8] 

75 (75^)) + (75^))75 = 2a2'=+i(75/?)'^+" ( 1 ) 

where D is the lattice Dirac operator and k stands for a non-negative integer; 
k — 0 corresponds to the ordinary Ginsparg- Wilson relation. 

We deal with a hermitian lattice operator 

H = aroD = i/t = (2) 

and the above algebra is written as 

75 /Ï + ^^75 = 2i/2'=+2 (3) 

where /c = 0 corresponds to conventional Ginsparg-Wilson relation. We also assume 
that the operator H is local in the sense that it is analytic in the entire Brillouin 
zone and free of species doublers, which is in fact shown for the free operator 
without gauge field [9]. One can then confirm the relation 



( 4 ) 
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The defining algebra is written in various ways such as 

TsH + HTs = 0, 'rsH + = 0, 7 I = 1 

where 

Ts = 75 - 75 = 75 - 

We can also show 



(7sr5)75 = 75(7sr5)- 

We now examine the action defined by 

^ — f ^ 2p{x)D{x, y)'ip{y) 



which is invariant under 



( 5 ^ = Sxp = '0Z675. 

If one considers the field re-definition 

V'' = 75rsV', = '0 

the above action is written as 




which is invariant under naive chiral transformation 



(5) 

(6) 

(7) 

( 8 ) 



(9) 

( 10 ) 



( 11 ) 



S^p' = ic75'0', Sip' = ip'ie^s. 



( 12 ) 



This chiral symmetry implies the relation 

(13) 

This naive chiral symmetry of the species doubler-free operator suggests the non- 
analytic behavior of the factor l/( 75 r 5 ) in the Brillouin zone. In fact, one can 
confirm that [9, 10] 



= 1 - = 0 (14) 

has solutions at the momentum variables corresponding to would-be species dou- 
blers, in the case of free operator without the gauge field, and also in the presence 
of topologically non-trivial gauge field. See also Ref. [11]. 
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We next recall the charge conjugation properties 

= -(7^f , C75C-i=7l’, 

C^C = 1, = -C. (15) 

We then have* 

= D(U)'^, = [75r5(t/)]^, 

= -[75iT(?7)75]^, 

wr5(c/CP)w-' = -hsr^iUhsF, 

w(r^/r)(u^^)w-^ = -[(75r575/r)(c/)]^' (i6) 

where 

r = yri = V(75r575)2 = (i7) 

Here we imposed the relation WD{U^^)W-^ = D{UY or [CD{U)f = -CD{U^) 
which is consistent with the defining Ginsparg-Wilson relation. 

We also have the property 

W75(17^‘')W^“' = - ■ (18) 

2 CP symmetry in lattice chiral gauge theory 

We now examine the CP symmetry in chiral gauge theory 

Cl = 'ipLD'ipL (19) 

where we defined the (general) projection operators 
D = PlDPl + PrDPr, 

'ipL,R = Pl.R"^, Ï^L,R = 'ipPL.R- ( 20 ) 

It was pointed out by Hasenfratz [12] that the conventional Ginsparg-Wilson op- 
erator when applied to chiral gauge theory has a difficulty with CP symmetry. We 
would like to examine this issue in more detail. Under the standard CP transfor- 
mation^ 



^ ^ ( 21 ) 

^We define the CP operation by W = C 70 = 72 with hermitian 72 and the CP transformed 
gauge field by and then W D{U^^)W~^ — D{U)^ , If the parity is realized in the standard 

way, we have CD{U^)C~^ = D{U)^ . 

^The vector-like theory is invariant under this CP transformation. 
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the chiral action is invariant only if 

WPlW~^ = PI, WPlW~^ = Pj. (22) 

It was shown in Ref. [13] that the unique solution for this condition in the frame- 
work of the Ginsparg-Wilson operators is given by 

Pl,r = ^(1 T Ts/r), Pi.fi = 1(1 ±7sr575/r), ( 23 ) 

but these projection operators suffer from singularities in 1/F, as we have already 
noted. Namely, it is impossible to maintain the manifest CP invariance of the local 
and chiral symmetric doubler- free Lagrangian [12, 13, 14]. 

If one stays in the well-defined local Lagrangian 

j^L = j (24) 

where 

P± = ^(1 ± 75 ), P± = 1(1 ± 75 ) (25) 

it is not invariant under the standard CP transformation as 

WP±W~^ = P^ jf^P^iU), 

WP±{U^^)W-^ = ^ t ] 757 | ( 11 ) 75 ] ^ ^ pT^ 

[WP+D{U^^)P-{U^^)W-^] ^ = 75-P+(tl)75P(t/)P- 
= P+D{U)P.{U) - D(P)[75 - r5(C/)l ^ P+DP-. (26) 

This generalizes the analysis of Hasenffatz in a more general setting^. 

3 Majorana fermion 

We next want to show that the general class of Cinsparg-Wilson operators have a 
difficulty to define Majorana fermions in the presence of chiral symmetric Yukawa 
couplings [10]. We start with 

C = PrD'iPr -h PlD'iPl + rn[ï)R'il)L + 

P2g[4)L(t)'ipR + 'ipR(l)^'ipL] 

= ipDxlj + 

+ (75rs75/r)A(r5/r) + z( 75 r 575 /r)s + iB{T5/r)]rp (27) 

^This analysis is extended to a more general class of operators defined by 75// 4- if 75 = 
2if^/(ii^), where /(ü^) is a regular and monotonous non-decreasing function [13] . 
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where 

'^L,R — Pb.R'ip, Ï^L,R — Ï^Pl.r (28) 

with the projection operators in (23), and we used (j) = [A -\- iB)l\j2. We then 
make the substitution [15, 16] 

^ = (x + *»?)/V2, 

^ - ii]^ C)fV2 (29) 



and obtain 

^ = ]pC^CD\ + ]^mx^C'f^Tf,x 

+^x"’c[A + (75r575/r)^(r5/r) + i(75r575/r)s + is(r5/r)]x 

+ (75r575/r)^(r5/r) + i(75r575/r)s + is(r5/r)]r)(.30) 

This relation shows that we can write the Dirac fermion operator as a sum of two 
Majorana operators. One can then define the Majorana fermion x (or r]) and the 
resulting Pfaffian as a square root of the determinant of lattice Dirac operator. But 
this formulation of the Majorana fermion inevitably suffers from the singularities 
of the modified chiral operators Ps/P and 75P575/P in the Brillouin zone, as we 
have already explained. 

We note that the condition, 

CPlC-'^ = Pi, CPlC-^ = Pi (31) 

which is required by the consistent CP property, is directly related to the condition 
of the consistent Majorana reduction for the term containing scalar field A(x), 

C(7sr575/r).4(a;)(r5/r) = -[C(75r575/r)^(x)(r5/r)]^ (32) 

in the Yukawa coupling^, if one recalls that the difference operators in P 5 and P 
do not commute with the field A{x). 

In other words, if one uses the projection operators which do not satisfy, 

CPlC-^ = Pi, CPlC-^ = Pi (33) 

the consistent Majorana reduction is not realized. For the chiral symmetric Yukawa 
couplings such as in supersymmetry the Majorana reduction is thus directly related 
to the condition of the CP invariance, provided that parity properties are the 
standard ones. 

■^If {CO)^ = —CO for a general operator O, the cross term vanishes rj^COx ~ = 0 

by using the anti-commuting property of x and 77. 
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4 Discussion and conclusion 

We have shown that both of the consistent definitions of CP symmetry in chiral 
gauge theory and the Majorana fermion in the presence of chiral symmetric Yukawa 
couplings are based on the same condition, 

CPlC-^ = Pj, CPlC-^ = Pj ( 34 ) 

and that the construction of projection operators, which are the unique solution, 

PL,R = \{iTr,/r), 

Pl,r = 2 ^^ ^ Isrsls/r) ( 35 ) 

inevitably suffers from singularities in the modified chiral operators (to be precise, 
in 1/r) for any Dirac operator D satisfying the algebraic relation 



75^/ + //75 = 2P2fc+2 



(36) 



with H = a'y^D. We find it interesting that the breaking of CP symmetry and 
a conflict with Majorana reduction are directly related to the basic notions of 
locality and species doubling in lattice theory. 

If one uses well-defined regular lattice operators 



J D'il; lViIjl exp 



(y ipP+DP-ip 



(37) 



with 



p ± = 2(1 =*= 75), 

P± = ^(l±7s) (38) 

the CP breaking in lattice chiral theory on the basis of Ginsparg- Wilson operator 
inevitably appears. As for the physical implications of this breaking of CP symme- 
try in pure lattice chiral theory, it was shown in Ref. [14] following the formulation 
in [17, 18, 19, 20] that the effects of CP breaking are isolated in the (almost) 
contact term of fermion propagator, which is connected to the external fermion 
sources. The CP breaking effects in all the loop diagrams are under well-control 
and they are absorbed into the weight factors related to various topological sec- 
tors in the fermionic path integral. In the presence of the Higgs coupling and in 
particular in the presence of the vacuum expectation value of the Higgs field, the 
analysis of CP breaking becomes more involved and it could be more serious [14]. 

It is also shown that this complication of CP breaking persists in the domain- 
wall fermion [21, 22, 23, 24, 25]. In fact, it is shown that the domain- wall fermion 
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in the limit N = oo, when applied to chiral theory, is valid only for the topolog- 
ically trivial sector and that it still suffers from the complications related to CP 
symmetry [26]. In the N = oo limit, the domain- wall fermion is related to the 
Ginsparg- Wilson fermion by 



(/' 



V'lpL'I^'ipLexp { / 'ipP^DP-ip 



/ 

= J T^QLVqiVQLVQReTip + J Qp^--^P_Q^{39) 



where q and q stand for the standard variables in domain- wall fermion, and Q and 
Q stand for the Pauli-Villars fields. Note the appearance of the non-local operator 
o/(75rs). 

It is customary to use the domain-wall fermion for finite N in practical ap- 
plications, but we consider that the finite N theory is not in a better situation 
with respect to CP symmetry either^, besides the ill-defined chiral symmetry. 
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Observations on Noncommuting Coordinates 
and on Fields Depending on Them 

R. Jackiw 



Abstract. The original ideas about noncommuting coordinates are recalled. The 
connection between U(l) gauge fields defined on noncommuting coordinates and 
fluid mechanics is explained. 

The idea that configuration-space coordinates may not commute 

= (1) 

where 0'^^ is a constant, anti-symmetric two-index object, has arisen recently from 
string theory, but in fact it has a longer history. Like many interesting quantum- 
mechanical ideas, it was first suggested by Heisenberg, in the late 1930s, who 
reasoned that coordinate noncommutativity would entail a coordinate uncertainty 
and would ameliorate short-distance singularities, which beset quantum fields. He 
told his idea to Peierls, who eventually made use of it when analyzing electronic 
systems in an external magnetic field, so strong that projection to the lowest 
Landau level is justified. But this phenomenological realization of Heisenberg’s 
idea did not address issues in fundamental science, so Peierls told Pauli about it, 
who in turn told Oppenheimer, who asked his student Snyder to work it out and 
this led to the first published paper on the subject [1]. 

The coordinate noncommutativity in the lowest Landau level is very similar 
to today’s string-theory origins of noncommutativity - both rely on the presence of 
a strong background field. Also, thus far, it is the only physically realized example 
of noncommuting coordinates, so let me describe it in a little detail [2]. We consider 
the motion of a charged (e) and massive (m) particle in a constant magnetic field 
{B) pointing along the 2 ; direction. All interesting physics is in the x~y plane. The 
Lagrangian for this planar motion is 

L = \m{x^ +ÿ^) + +vAy) - V{x,y) (2) 

where the vector potential A can be chosen as (Q,xB) and V{x,y) describes ad- 
ditional interactions (“impurities”). In the absence of V, the quantum spectrum 
consists of the well-known Landau levels | A", d), where N indexes the level’s energy 
eigenvalue, and d describes the infinite degeneracy of each level. The separation 
between levels is 0{B/m), so that in the strong magnetic field limit only the lowest 
Landau level |0,d) is relevant. But observe that the large B limit corresponds to 
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small m, so projection on the lowest Landau level is also achieved by setting m to 
zero in (2). In that limit the Lagrangian (2), in the chosen gauge, becomes 

LeLe = ^Bxy -V(x,y) . (3) 

This is of the form pq — H(p, q), and immediately identifies ^Bx and y as canonical 
conjugates, leading in the usual way to the commutator 

= ( 4 ) 

[The “Peierls substitution” consists of determining the effect of the impurity by 
computing the eigenvalues of V{x^y), where x and y are noncommuting.] 

For another perspective, consider a calculation of the lowest Landau level 
matrix element of the [x, y\ commutator. 

(0, d\xy - yx\Q, d') = M{d, d') - M*{d', d) (5) 

where 

M(d,d') = (0,d|x2/|0,d') . (6) 

We evaluate (6) by inserting intermediate states in product xy: 

M{d,d') = ^(0,d|æ|s)(s|2/|0,d') . (7) 

S 

If the sum is over all the degenerate Landau levels, then one finds that (5) vanishes: 
X and y do commute! But if one pretends that the world is restricted to the lowest 
Landau level and includes only that level (with its degeneracy) in the intermediate 
state sum 

M,m = ^<0, d|x|0, d")(0, d”\y\0, d') (8) 

d" 

one finds that in this truncated state space, eq. (7) becomes consistent with (4): 

T) a 

(0,d|[x,j/]|0,d') = (0,d|0,d') . (9) 

Let me now return to the general and abstract problem of noncommuting 
coordinates. When confronting the noncommutativity postulate (1), it is natural 
to ask which (infinitesimal) coordinate transformations 

= -f\x) (10) 

leave (1) unchanged. The answer is that the (infinitesimal) transformation vector 
function P{x) must be determined by a scalar f{x) through the expression [3] 



r{x) = o^^djf{x) . 



( 11 ) 
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Since dip(x) = 0, these are recognized as volume- preserving transformations. 
[They do not exhaust all volume preserving transformations, except in two dimen- 
sions. In dimensions greater two, (11) defines a subgroup of volume- preserving 
transforms that also leave 0^^ invariant.] 

The volume-preserving transformations form the link between noncommuting 
coordinates and fiuid mechanics. Since the theory of fluid mechanics is not widely 
known outside the circle of fluid mechanicians, let me put down some relevant facts 
[4]. There are two, physically equivalent descriptions of fluid motion: One is the 
Lagrange formulation, wherein the fluid elements are labeled, first by a discrete 
index n: Xn{t) is the position as a function of time of the nth fluid element. Then 
one passes to a continuous labeling variable n x : Xn{t) X{t,x), and x 
may be taken to be the position of the fluid element at initial time X{0,x) = 
X. This is a comoving description. Because labels can be arbitrarily rearranged, 
without affecting physical content, the continuum description is invariant against 
volume- preserving transformations of æ, and in particular, it is invariant against 
the specific volume- preserving transformations (11), provided the fluid coordinate 
X transforms as a scalar: 

6fX = = e^^diXdjf . (12) 

The common invariance of Lagrange fluids and of noncommuting coordinates 
is a strong hint of a connection between the two. 

Formula (12) will take a very suggestive form when we rewrite it in terms of 
a bracket defined for functions of x by 

{0,{x),02{x)} = 0^^^^Ol{x)^JO2{x) . (13) 

Note that with this bracket we have 

{x\x^}=0^^ . (14) 

So we can think of bracket relations as classical precursors of commutators for a 
noncommutative field theory - the latter obtained from the former by replacing 
brackets by —i times commutators, à la Dirac. More specifically, we shall see 
that the noncommuting field theory that emerges from the Lagrange fluid is a 
noncommuting U(l) gauge theory. 

This happens when the following steps are taken. We define the evolving 
portion of X by 

X\t,x) = x^ -\-e^^Aj{t,x) . (15) 

(It is assumed that 0^^ has an inverse.) Then (12) is equivalent to the suggestive 
expression 

SfA = difA{Aj} . (16) 

When the bracket is replaced by (— i) times the commutator, this is precisely the 
gauge transformation for a noncommuting U(l) gauge potential Ai. Moreover, the 
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gauge field Fij emerges from the bracket of two Lagrange coordinates 

{X\ X^] = + e^^O^^Fmn (17) 

F mn ~ ^n-^m “1“ '{'^m ? } • (1^) 

Again (18) is recognized from the analogous formula in noncommuting gauge 
theory. 

What can one learn from the parallelism of the formalism for a Lagrange fiuid 
and a noncommuting gauge field? One result that has been obtained addresses the 
question of what is a gauge field’s covariant response to a coordinate transforma- 
tion. This question can be put already for commuting, non- Abelian gauge fields, 
where conventionally the response is given in terms of a Lie derivative Lj: 

ôfx^ = -r{x) ( 19 ) 

5fA^ = LfA^ = rOo^A^ + d^rA^ . (20) 

But this implies 

6fF^, = LfF^, = rdaF^, + + d.rF^a (21) 

which is not covariant since the derivative in the first term on the right is not the 
covariant one. The cure in this, commuting, situation has been given some time 
ago [5]: Observe that (20) may be equivalently presented as 

5fAf, = LjAfj, = - d^Aa - i[Aa, A^]) 

+ f°‘d^,Aa - i[A^, r^Aa] + djj,f°‘Aa ( 22 ) 

= + D^irAc) . 

Thus, if the coordinate transformation generated by is supplemented by a gauge 
transformation generated by — the result is a gauge covariant coordinate 
transformation 

ô'fA^ = (23) 

and the modified response of F^jy involves the gauge-covariant Lie derivative 

S'fF^, = L'fF^, = rD^F^, + dJ^F^, + dJ^F^^ . (24) 

In the noncommuting situation, loss of covariance in the ordinary Lie deriva- 
tive is even greater, because in general the coordinate transformation functions 
do not commute with the fields A^^F^i^\ moreover, multiplication of ^-dependent 
quantities is not a covariant operation. All these issues can be addressed and re- 
solved by considering them in the fluid mechanical context, at least, for linear and 
volume-preserving diffeomorphisms. The analysis is technical and I refer you to the 
published papers [3, 6]. The final result for the covariant coordinate transformation 
on the noncommuting gauge potential A^, generated by f^(X)^ is 

s'fA^ = ^{nx),F^^y plus reordering terms. 



(25) 
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Note that the generating function f^{X) enters the anticommutator with covariant 
argument X. is restricted to be either linear or volume- preserving; in the latter 
case there are reordering terms, whose form is explicitly determined by the fluid 
mechanical antecedent. 

Next, I shall discuss the Seiberg- Witten map [7], which can be made very 
transparent by the fluid analogy. The Seiberg-Witten map replaces the noncom- 
muting vector potential by a nonlocal function of a commuting potential and 
of i.e., the former is viewed as a function of the latter. The relationship between 
the two follows from the requirement of stability against gauge transformations: a 
noncommuting gauge transformation of the noncommuting gauge potential should 
be equivalent to a commuting gauge transformation on the commuting vector po- 
tential on which the noncommuting potential depends. Formally: 

l^(a + dA)=l°(“’^)(a) . (26) 

Here A is the Abelian gauge transformation function that transforms the Abelian, 
commuting gauge potential (7(a, A) is the noncommuting gauge function that 
transforms the noncommuting gauge potential A^. G depends on and A, and 
one can show that it is a noncommuting 1-cocycle [8] . 

Moreover, when the action and the equations of motion of the noncommuting 
theory are transformed into commuting variables, the dynamical content is pre- 
served: the physics described by noncommuting variables is equivalently described 
by the commuting variables, albeit in a complicated, nonlocal fashion. 

The Seiberg-Witten map is intrinsically interesting in the unexpected equiva- 
lence that it establishes. Moreover, it is practically useful for the following reason. 
It is difficult to extract gauge invariant content from a noncommuting gauge theory 
because quantities constructed locally from are not gauge invariant; to achieve 
gauge invariance, one must integrate over space-time. Yet for physical analysis one 
wants local quantities: profiles of propagating waves, etc. Such local quantities can 
be extracted in a gauge invariant manner from the physically equivalent, Seiberg- 
Witten mapped commutative gauge theory [9]. 

Let me now use the fluid analogy to obtain an explicit formula for the Seiberg- 
Witten map; actually, we shall present the inverse map, expressing commuting 
fields in terms of noncommuting ones. For our development we must refer to a 
second, alternative formulation of fluid mechanics, the so-called Euler formulation. 
This is not a comoving description, rather the experimenter observes the fluid 
density p and velocity v at given point in space-time (f,r*). The current is pv and 
satisfies with p a continuity equation 

lp + V-(pt;) = 0. (27) 

The theory is completed by positing an “Euler equation” for dv/dt, but we shall 
not need this here. 
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Of interest to us is the relation between the Lagrange description and the 
Euler description. This is given by the formulas 

p{t,r) = J dx6[X{t,x) — r) (28a) 

/ d 

àx—X{t,x)5{X{t,x)-r). (28b) 

(The integration and the ^-function carry the dimensionality of space.) Observe 
that the continuity equation (27) follows from the definitions (28), which can be 
summarized by 

fit, r) = j dr ^^X^‘S{X - r) (29) 

X^ = t 

= 0 . (30) 

The (inverse) Seiberg-Witten map, for the case of two spatial dimensions, can 
be extracted from (29), (30) [3]. (The argument can be generalized to arbitrary 
dimensions, but there it is more complicated [3].) Observe that the right side of 
(29) depends on A through X [see (15)]. It is easy to check that the integral (29) is 
invariant under the transformations (12); equivalently viewed as a function of A, 
it is gauge invariant [see (16)]. Owing to the conservation of [see (30)], its dual 
satisfies a conventional, commuting Bianchi identity, and therefore can be 
written as the curl of an Abelian vector potential a^, apart from proportionality 
and additive constants: 

/ Q 

dx —X^S{X — r) 

. (31) 

diüj — djai + constant oc Sij / dx 6{X — r) = Sij p . 

This is the (inverse) Seiberg-Witten map, relating the a to A. 

Thus far operator noncommutativity has not been taken into account. To do 
so, we must provide an ordering for the (^-function depending on the operator — 
-f 0^^ Aj. This we do with the Weyl prescription by Fourier transforming. The 
final operator version of equation (31), restricted to the two-dimensional spatial 
components, reads 

J dr - djai) = [ J dx - (27r)2<5(fc)j (32) 

where the x integral over an operator {X) dependent integrand is interpreted as a 
trace. Here the additive and proportionality constants are determined by requiring 
agreement for weak noncommuting fields. 

Formula (32) has previously appeared in a direct analysis of the Seiberg- 
Witten relation [10]. Now we recognize it as the (quantized) expression relating 
Lagrange and Euler formulations for fluid mechanics. 
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Functional Renormalization Group 
for Interacting Fermi Systems 



Walter Metzner 



Abstract. We describe a Wick ordered functional renormalization group method 
for interacting Fermi systems, where the effective low-energy action is generated 
from the bare action of the microscopic model by a differential flow equation. We 
apply this renormalization group approach to a prototypical two-dimensional lat- 
tice electron system, the Hubbard model on a square lattice. The flow equation 
for the effective interactions is evaluated numerically on 1-loop level. The effective 
interactions diverge at a finite energy scale which is exponentially small for small 
bare interactions. To analyze the nature of the instabilities signaled by the diverg- 
ing interactions, the flow of the singlet superconducting susceptibilities for various 
pairing symmetries and also charge and spin density susceptibilities are computed. 
This and similar RG calculations by others conclusively establish the existence of 
d-wave superconductivity in the Hubbard model. 



1 Introduction 

Renormalization group (RG) methods provide the most reliable and efficient ap- 
proach to low dimensional Fermi systems with competing singularities at weak 
coupling. Such methods have been developed long ago for one- dimensional systems 
where, combined with exact solutions of fixed point models, they have been an im- 
portant source of physical insight [1, 2]. A major complication in two-dimensional 
systems compared to ID is that the effective interactions cannot be parameterized 
accurately by a small number of running couplings, even if irrelevant momentum 
and energy dependences are neglected, since the tangential momentum dependence 
of effective interactions along the Fermi “surface” (a line in 2D) is strong and im- 
portant in the low-energy limit. This has been demonstrated in particular in a 
1-loop RG study for a model system with two parallel flat Fermi surface pieces [3] . 
In a series of papers Zanchi and Schulz [4] have developed a new renormalization 
group approach for interacting Fermi systems, which is based on a flow equation 
derived by Polchinski [5] in the context of local quantum field theory. In this RG 
version the complete flow from the bare action of an arbitrary microscopic model 
to the effective low-energy action, as a function of a continuously decreasing in- 
frared energy cutoff, is given by an exact differential flow equation. Zanchi and 
Schulz have applied this approach to the 2D Hubbard model (with nearest neigh- 
bor hopping) in a 1-loop approximation, with a discretized tangential momentum 
dependence of the effective 2-particle interaction. The expected [6] presence of an- 
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tiferromagnetism and d-wave superconductivity as major instabilities of the model 
close to half-filling was thereby confirmed. 

The development of continuous renormalization group methods for interact- 
ing Fermi systems has made further progress with a work by Salmhofer [7] . By ex- 
panding the effective action in Wick-ordered monomials instead of bare monomials 
he obtained an exact flow equation for the effective interactions with a particu- 
larly convenient structure: The /^-function is bilinear in the effective interactions 
and local in the flow parameter, i.e. it does not depend on effective interactions at 
higher energy scales. 

Here we briefly review the Wick ordered functional renormalization group and 
discuss its application it to a prototypical two-dimensional electron system, the 
Hubbard model [9] with nearest and also next-nearest neighbor hopping amplitudes 
on a square lattice. We will focus on the electron density regime near half-filling, 
where the Hubbard model serves as a model for the copper-oxygen planes of high- 
temperature superconductors [10]. The flow of effective 2-particle interactions is 
evaluated on 1-loop level, neglecting the irrelevant energy dependence and also the 
irrelevant normal momentum dependence, but keeping the important tangential 
momentum dependence. The effective interactions diverge at a finite energy scale, 
which is exponentially small for a small bare interaction. To analyze the physi- 
cal nature of the instabilities signaled by the diverging interactions, Salmhofer’s 
flow equations are extended for the calculation of susceptibilities. Charge and spin 
susceptibilities as well as singlet superconducting susceptibilities for various pair- 
ing symmetries are computed. Depending on the choice of the model parameters, 
hopping amplitudes, interaction strength and band- filling, commensurate or in- 
commensurate antiferromagnetism or d-wave superconductivity turn out to be the 
leading instability. The results agree qualitatively with those by Zanchi and Schulz 
[4] (for = 0) , and also with results obtained from a one-particle irreducible ver- 
sion of the functional RG by Honerkamp et al. [8]. 

2 Two-dimensional Hubbard model 

The Hubbard model [9] 

= + . ( 1 ) 

ij ^ j 

is a lattice electron model with quantum mechanical hopping amplitudes tij and 
a local repulsion U > 0. Here and Ci^ are creation and annihilation operators 
for fermions with spin projection crç{î,|.}ona lattice site i, and njo- == A 

hopping amplitude —t between nearest neighbors and an amplitude —t' between 
next-nearest neighbors on a square lattice leads to the dispersion relation 



Ck = — 2^(cos kx + cos ky) — 4t'(cos kx cos ky) 



( 2 ) 
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Figure 1: The Fermi surfaces of the non-interacting 2D Hubbard model with t' —0 
(a) and t' = — 0.16t (b) for various densities. 



for single-particle states. This dispersion relation has saddle points at k = (0, tt) 
and (tt, 0), which lead to logarithmic van Hove singularities in the non-interacting 
density of states at the energy evH = 4t'. 

In Fig. 1 we show the Fermi surfaces of the non-interacting system for various 
electron densities n for the case without next-nearest neighbor hopping (t' = 0) and 
a case with < 0. For a chemical potential // = 6 vh the Fermi surface contains the 
van Hove points. In this case a perturbative calculation of the two-particle vertex 
function leads to several infrared divergencies already at second order in f/, i.e. at 
1-loop level [12, 13, 14]. In particular, the particle-particle channel diverges as log^ 
for vanishing total momentum ki-t-k2, and logarithmically for ki-hk2 = (tt, tt). The 
particle-hole channel diverges logarithmically for vanishing momentum transfer; for 
momentum transfer (tt, tt) it diverges logarithmically if 7^ 0 and as log^ in the 
special case t' = 0. Note that /x = 0 for = 0 corresponds to half-filling {n = 1). In 
this case there are also logarithmic divergences for all momentum transfers parallel 
to (tt, tt) or (tt, — tt) due to the strong nesting of the square shaped Fermi surface. 
For fi 7^ CvH only the usual logarithmic Cooper singularity at zero total momentum 
in the particle-particle channel remains. However, the additional singularities at 
/i = 6vH load clearly to largely enhanced contributions for small |/x— € vh|, especially 
if t' is also small. 

Hence, for small \fi — CvhI one has to deal with competing divergencies in 
different channels. This problem can be treated systematically by the RG method. 

3 Renormalization group approach 

We now briefly review the Wick ordered functional renormalization group ap- 
proach. For details, see Salmhofer [7] and Ref. [11]. 
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Figure 2: Diagrammatic representation of the flow equation for The line with 
a slash corresponds to dD^/dA, the others to all possible pairings leaving m 
ingoing and m outgoing external legs have to be summed. 



A) Wick ordered flow equations 

The infrared singularities are regularized by introducing an infrared cutoff 
A > 0 into the bare propagator such that contributions from momenta with 
kk — A^l < A are suppressed. All Green functions of the interacting system will 
then depend on A, and the true theory is recovered only in the limit A ^ 0. 
The RG equations are most conveniently obtained from the effective potential 
which is the generating functional for connected Green functions with bare 
propagators amputated from the external legs. Taking a A-derivative one obtains 
an exact functional flow equation for this quantity. Expanding on both sides 
of the flow equation in powers of the fermionic fields (i.e. Grassmann variables), 
and comparing coefficients, one obtains the so-called Polchinski equations [5] for 
the effective m-body interactions used by Zanchi and Schulz [4]. Salmhofer [7] 
has pointed out that an alternative expansion in terms of Wick (normal) ordered 
monomials of fermion fields yields flow equations for the corresponding m-body 
interactions with a particularly simple structure (see Fig. 2). The flow of is 
given as a bilinear form of other n-body interactions (at the same scale A), which 
are connected by lines corresponding to the propagator 

D^{k) = , (3) 

^^0 - Çk 

where ^k = ^k — and one line corresponding to dD^{k)/dA. For small A, the 
momentum integrals on the right hand side of the flow equation are thus restricted 
to momenta close to the Fermi surface (see Fig. 3). With the initial condition 
V^o — bare interaction, where Aq = max |^k|, the above flow equations determine 
the exact flow of the effective interactions as A sweeps over the entire energy range 
from the band edges down to the Fermi surface. The effective low-energy theory 
can thus be computed directly from the microscopic theory without introducing 
any ad hoc parameters. The structure of these flow equations is very convenient 
for a power counting analysis to arbitrary loop order [7], and also for a concrete 
numerical solution. 
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Figure 3: The support of the propagator in momentum space 



B) One- loop flow 

To detect instabilities of the system in the weak-coupling limit, it is sufficient 
to truncate the infinite hierarchy of flow equations described by Fig. 2 at one-loop 
level, and neglect all components of the effective interaction except the two- particle 
interaction The effective 2-particle interaction then reduces to the one- 
particle irreducible 2-particle vertex F^, and its flow is determined by F^ itself 
(no other m-body interactions enter). Putting arrows on the lines to distinguish 
creation and annihilation operators one thus obtains the flow equation for F^ 
shown graphically in Fig. 4. Flow equations for susceptibilities are obtained by 
considering the exact RG equations in the presence of suitable external fields, 
which leads to an additional 1-body term in the bare interaction, and expanding 
everything in powers of the external fields to sufficiently high order [11]. On 1-loop 
level one obtains the flow equations shown in Fig. 5. The flow of a susceptibility 
is determined by the corresponding response vertex the flow of which is in 
turn given by F^ and itself. The initial conditions are given by = 0 for the 
susceptibilities and by the bare response vertices for R^. For pairing susceptibilities 
only the particle-particle channel contributes to the propagator pair in Fig. 5, for 
charge and spin density susceptibilities only the particle-hole channel. 

It is clearly impossible to solve the flow equations with the full energy and 
momentum dependence of the vertex function, since F^ has three independent 
energy and momentum variables. The problem can however be much simplified 
by ignoring dependences which are irrelevant (in the RG sense) in the low en- 
ergy limit, namely the energy dependence and the momentum dependence normal 
to the Fermi surface. Hence, we compute the flow of the vertex function at zero 
energy and with at least three momenta on the Fermi surface (the forth being de- 
termined by momentum conservation). On the right hand side of the flow equation 
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Figure 4: Flow equation for the vertex function F"^ in 1-loop approximation with 
the particle- particle channel (PP) and the two particle- hole channels (PH and 

PH’). 




Figure 5: Flow equations for (a) the susceptibilities ^ind (b) the response ver- 
tices in 1-loop approximation. 



we approximate the vertex function by its zero energy value with three momenta 
projected on the Fermi surface (if not already there), as indicated in Fig. 6. This 
projection procedure is exact for the bare Hubbard vertex, and asymptotically ex- 
act in the low-energy regime, since only irrelevant dependences are neglected. The 
remaining tangential momentum dependence is discretized. Exploiting all symme- 
tries (time reversal, spin-rotation, lattice point group) one has to deal with 880 
’’running couplings” for a discretization with 16 points on the Fermi surface, and 
about 5000 for 32 points. Most of the results where obtained for 16 points, but it 
has been checked that increasing the number of points does not change our results 
too much [11]. 
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Figure 6: Projection of momenta on the Fermi surface; discretization and labeling 
of angle variables. 



4 Results 



The flow of the vertex function has been computed for many different model pa- 
rameters t' and U {t just fixes the absolute energy scale) and densities close to 
half-filling [11, 15]. In all cases the vertex function develops a strong momentum 
dependence for small A with divergencies for several momenta at some critical scale 
Ac > 0, which vanishes exponentially for C/ ^ 0. To see which physical instability 
is associated with the diverging vertex function, the following susceptibilities have 
been computed: commensurate and incommensurate spin susceptibilities X5 (q) 
with q = (7r,7r), q = {n — S,tt) and q = (1 — J)(7r,7r), where (5 is a function of 
density [16], the commensurate charge susceptibility Xc((7r,7r)), and singlet pair 
susceptibilities with form factors 



d(k) = { 



1 

^ (cos kx + cos ky) 
^{coskx - cosky) 

sin kx sin ky 



(s-wave) 

(extended s-wave) 
(d-wave dx^-y 2 ) 
(d-wave d^^). 



(4) 



Some of these susceptibilities diverge together with the vertex function at the scale 
Ac- Depending on the choice of U,t' and /i the strongest divergence is found for the 
commensurate or incommensurate spin susceptibility or for the pair susceptibility 
with dx 2 -y 2 symmetry [11, 15]. In Fig. 7 we show a typical result for the flow of the 
vertex function and susceptibilities as a function of A. Only the singlet part (from 
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0.01 0.1 1 

(b) A/t 

Figure 7: (a) The flow of the singlet vertex function as a function of A for 
several choices of the momenta k/?i, kF 2 and k^^ which are labelled according to 
the numbers in Fig. 6. The model parameters are U = t and t' = 0, the chemical 
potential is /i = — 0.02t; (b) the flow of the ratio of interacting and non-interacting 
susceptibilities, x"^/Xo ^ same system. 
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a spin singlet-triplet decomposition) of the vertex function is plotted, for various 
choices of the three independent external momenta. The triplet part of the vertex 
function flows generally more weakly than the singlet part. Note the threshold at 
A = 2|/i| below which the amplitudes for various scattering processes, especially 
umklapp scattering, renormalize only very slowly. The flow of the antiferromag- 
netic spin susceptibility is cut off at the same scale. The infinite slope singularity 
in some of the flow curves at scale A = |/i| is due to the van Hove singularity being 
crossed at that scale. The pairing susceptibility with -symmetry is obviously 

dominant here (note the logarithmic scale). 

Following the flow of the vertex function and susceptibilities, one can see 
that those interaction processes which enhance the antiferromagnetic spin sus- 
ceptibility (especially umklapp scattering) also build up an attractive interaction 
in the d^ 2 _y 2 pairing channel. This confirms the spin-fluctuation route to d-wave 
superconductivity [6]. 

5 Strong coupling problem 

Within the one-loop calculation described above, the renormalized interaction 
always diverges in some momentum channels at a finite energy scale Ac, even if 
the bare interaction U was very small. Hence one is always running into a strong 
coupling problem in the low-energy limit. 

If the vertex function diverges only in the Cooper channel, driven only by the 
particle-particle contribution to the flow, the strong coupling problem emerging 
in the low energy region can be controlled by exploiting Ac as a small parameter 
(Ac is small for a small bare interaction) [17]. The formation of a superconducting 
ground state can then be described essentially by a BCS theory with renormalized 
input parameters. One could also continue the RG flow below Ac down to A — > 0 
by including pairing counterterms into the effective action. This would remove 
the divergence of the vertex function at finite A. In the Hubbard model the pure 
Cooper channel instability is realized for ja ^ €vh at sufficiently small U. In that 
regime one can safely infer superconductivity with a d-wave order parameter from 
the divergence of the one-loop pairing susceptibility! At finite temperature the 
off-diagonal long-range order will of course turn into the quasi long-range order of 
a Kosterlitz-Thouless phase. 

In general the one-loop calculation can produce divergencies of the vertex 
function in various momentum channels, with big contributions from both particle- 
particle and particle-hole diagrams. In the weak coupling limit ^ ^ 0 this happens 
for pL 6vH, that is when the Fermi surface touches the van Hove points. In that 
case different possible instabilities can compete in a complicated way. Besides spin 
density wave and pairing instabilities one has to deal with ferromagnetism (at 
moderate \t' /t\ in the Hubbard model) [18, 19] and Pomeranchuk instabilities of 
the Fermi surface [15, 20] as alternative or coexisting candidates. Honerkamp et 
al. [8] have pointed out that for a sizable t' the behavior of the one-loop flow may 
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also signal the formation of an insulating spin liquid for certain densities close to 
half-filling. 

A complete theory of the effective strong coupling problem characterized by 
strong particle-particle and particle-hole fluctuations has still to be found. For 
weak bare coupling, one may again try to exploit the smallness of the scale Ac 
where strong fluctuations appear, to construct a tractable effective low-energy 
theory. 

What will change, if the bare coupling is big? Since strong particle-particle 
and particle-hole fluctuations appear already at weak coupling close to half-filling 
with a small t', it is not unplausible to conjecture that the Hubbard model might 
exhibit most (if not all) of its low-energy features qualitatively already at weak 
bare coupling. This is in fact the case in one dimension [2]. Of course bigger U 
will lead to larger energy scales for binding phenomena, and also the scales for 
pairing and pair condensation will separate, as the pairs get more tightly bound 
at stronger coupling. 

6 Conclusion 

In summary, the pairing instability obtained from a one-loop RG analysis of the 
two-dimensional Hubbard model conclusively establishes the existence of (1^2 _y 2 
superconductivity in this model, at least at weak coupling. 

A complete analysis of the weak coupling behavior of the Hubbard model, 
especially for a Fermi surface near the van Hove points, can be expected to yield 
further important clues for a better understanding of its low-energy behavior at 
moderate and possibly even at strong coupling. 
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Aperiodic Order and Quasicrystals: Spectral Properties 

Daniel Lenz and Peter Stollmann 



Abstract. We present spectral theoretic results for Hamiltonians associated with 
Delone sets. For a family of discrete models we characterize the appearance of 
jumps in the integrated density of states. For a family of continuum models on 
the set of all Delone sets with suitable parameters we prove that generically purely 
singular continuous spectrum occurs. 



Introduction 

In this paper we continue our study of aperiodic order and quasicrystals with 
results concerning the nature of the spectrum of the associated Hamiltonians. It 
is a sequel to [21, 22, 23, 24]. 

We use the framework of Delone (Delaunay) [5] sets to describe the aperi- 
odic order of positions of ions in a quasicrystal or a more general aperiodic solid. 
This approach is quite common, [1, 15, 16, 17, 28] with an equivalent alternative 
description available via tilings, [6, 7, 9, 10, 11, 18, 27]. We elaborate on this issue 
and introduce Delone dynamical systems in the next section. We also provide some 
necessary background concerning the natural topology on the set of all Delone sets. 
Moreover, we mention the von Neumann algebra of observables that can naturally 
be associated with a Delone dynamical system and an invariant measure on the 
latter. 

With these preparations the stage is set for the study of spectral properties we 
are aiming at. In Section 2 we are concerned with discrete models. We investigate 
the possibility of jumps in the integrated density of states of the Hamiltonians in 
question. This latter quantity is a function N{E) of the energy E that measures 
the number of eigenstates with eigenvalue lying below E. Since this quantity is self 
averaging and typically rather smooth it is at first sight very strange that it can 
exhibit jumps. For the Penrose tiling this effect had already been discovered and 
studied earlier [13, 14]. It turns out that it is based on the fact that the tilings in 
question allow for compactly supported eigenfunctions of the associated laplacian. 
Our results from [12] show two facts: firstly, the Penrose tiling is not exceptional 
in the sense that with a “local modification” any Delone set can be turned into 
one that shows compactly supported eigenfunctions. Secondly, discontinuities of 
the integrated density of states can only appear in that way, provided the Delone 
sets one starts with satisfies certain quite natural complexity conditions. As the 
main input, we use a particularly strong ergodic theorem from [24]. 
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In the last section we announce a new result on spectral properties, this time 
for continuum models. In fact we are able to prove that a certain family of models 
exhibits the spectral behaviour that is conjectured to be typical for quasicrys- 
tal Hamiltonians: purely singular continuous spectrum. To this end we consider 
a complete metric space consisting of all Delone sets with suitable parameters 
and associate a Hamiltonian with each of the sets. Our result then tells us that a 
certain energy region consists of purely singular continuous spectrum for a dense 
subset of the metric space in question. To our knowledge this is the first result 
establishing a particular spectral type for multidimensional Hamiltonians asso- 
ciated to Delone sets. The proof we sketch is based on Barry Simon’s “Alice in 
Wonderland” technique from [26]. 

1 Delone sets and Delone dynamical systems: topology and algebra 

We now sketch how to define a suitable topology on the set of all Delone sets 
following [22] to which the reader is referred for details. Let us start with 
the set of closed subsets of and recall that there is a natural action T of E^ 
on .F(E^) given by TtG = G -\-t. We aim at a topology on that fulfills two 

requirements; the action T should be continuous and two sets that are close to 
each other with respect to the topology are supposed to be such that their finite 
parts have small Hausdorff distance. The latter can be defined by 

dH{KuK2) := inf({6 > 0 : C U,{K2) A K2 C Ue{Ki)} U {!}), 

where Ki,K 2 are compact subsets of a metric space (X, d) and Ue{K) denotes the 
open e- neighbor hood around K. The extra 1 is to deal with the empty set that is 
included in /C(X) := {K <Z X : K compact}. It is well known that (/C(X),d//) is 
complete if (X, d) is complete and compact if (X, d) is compact. 

We use the stereographic projection to identify points x e E^ U {00} in the 
one-point-compact ification of E^ with the corresponding points x E Clearly, 
the latter denotes the d-dimensional unit sphere = {^ e E^^^ : ||^|| = 1}. Now 
carries the euclidian metric p. Since the unit sphere is compact and complete, 
we can associate a complete metric pn on X(§^) by what we said above. 

For F G X(E^) write F for the corresponding subset of and define 

p{F,G) := for F,G e X(E^). 

Although this constitutes a slight abuse of notation it makes sense since F U {00}, 
(7 U {00} are compact in provided F, G are closed in E^. 

We have the following result: 

Proposition 1.1 The metric p above induces the natural topology on F(E^). 
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Denote by Br{x) [Ur{x)) the closed (open) ball in around x with radius r. 

A subset u of is called a Delone set if there exist r{uj) > 0 and R{uj) > 0 
such that 2r{u) < \\x — y\\ whenever x,y G u; with x 7^ y, and Bjk^^^{x) fl cj 7^ 0 
for all X G R^. If 0 < r < r{uj) < R{u) < R we speak of an (r, R)-set and denote 
the set of all {r,R)-sets by T>r,R- From the basic properties of the metric p above 
it follows that T>r,R is a compact, complete metric space, a fact that will be useful 
in the sequel. The set of all Delone sets is denoted by V, 

The convergence of a sequence of Delone sets with respect to the natural 
topology can easily be visualized: 

Lemma 1.2 A sequence {un) of Delone sets converges to u e V in the natural 
topology if and only if there exists for any I > 0 an L > I such that the cOn Pi (7 l(0) 
converge to lü C\Ul{^) with respect to the Hausdorff distance as n ^ 00 . 

We call a closed, translation invariant subset Q C a Delone dynamical 
system^ DDS. Note that compactness and hence completeness of Q follows, since 
^(R^) is compact. A DDS is said to be an (r, R)- system if, furthermore, Q C Dr,R’ 
A further notion is to be introduced: we say that u is of finite local complexity^ 
if only finitely many different (up to translation) pattern of bounded diameter 
occur in a;, i.e., if the set {Ul{0) D (cj — x) : x G cj} is finite for every L > 0. 

We speak of a Delone dynamical system of finite type, DDSF, if this latter 
finiteness condition extends to all of D, i.e. if the set {UL{0)n{uj—x) : x e E D.} 
is finite for every L > 0. 

We often keep the translations in our notation and write (D,T) for a DDS. 
The next topic will concern a natural von Neumann algebra that contains the 
Hamiltonians we are interested in. For details and proofs see [23]. These Hamilto- 
nians are in fact families A = (A^j), indexed by the elements a; of a DDS (f^,T). 
Each Auj acts in the Hilbert space where denotes the counting mea- 

sure on the discrete subset a; C R"^. These spaces are “glued” together by the 
following bundle: 

A = {(ct;, x) G Ç : X G a;} C D X R^. 



Here is the definition: 

Definition 1.3 Let (D,T) he an {r, R) -system and let p be an invariant measure 
on Q. Denote by V\ the set of all f : X — ^ C which are measurable and satisfy 
/(cj, •) G a“") for every lü EÜ. 

A family {A^)ujen of bounded operators A^ : £^(cu,a^) — ^ £^(cv, is called 

measurable if u {f{u),{A^^g){uj))^^ is measurable for all f,gEVi.It is called 
bounded if the norms of the A^j are uniformly hounded. It is called covariant if it 
satisfies the covariance condition 



H^+t = UtH^U*, u>€Q,teR‘^, 
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where Ut : ^ is the unitary operator induced by translation. Now, 

we can define 

J\f{Q,T,p) := {A = covariant, measurable and bounded}/ 

where ^ means that we identify families which agree p almost everywhere. 

Obviously, M{Vt,T,p) depends on the measure class of p only. Hence, for 
uniquely ergodic Af{fl,T,p) =: gives a canonical algebra. This 

special case has been considered in [21, 22]. Apparently, T, p) is an involutive 
algebra under the obvious operations. The following result is taken from [23], where 
we prove it using Connes’ noncommutative integration theory, [3]. 

Theorem 1.4 Let be an [r,R)-system and let p be an invariant measure on 

Cl. Then p) is a weak-*- algebra. 

This algebra carries a particular trace that is related to the integrated density 
of states we will meet later. Now, choose a nonnegative measurable u on with 
compact support and J^^u{x)dx = 1. Letting f{u;,p) := u{p), it can be shown 
that the map 



T : J\f{Cl,T, p) — > C, r{A) = f ti{Au;Mu) dp{u) 

Jq 

does not depend on the choice of / viz u as long as the integral is one. Moreover, 
r is a trace on Af{Cl,T, p); see [23] for details. 

Let us now introduce the C*-subalgebra of J\f{Cl, T, p) that contains those 
operators that might be used as Hamiltonians for quasicrystals. We define 

A Xq A {(p, a;, q) G x Cl x : p, q e cu], 

which is a closed subspace of x Q x for any DDS Cl. 

Definition 1.5 A kernel of finite range is a function k G C{X Xq X) that satisfies 
the following properties: 

(i) k is bounded. 

(ii) k has finite range, i.e., there exists Rk> 0 such that k{p,cu,q) = 0, whenever 

\p-q\> Rk‘ 

(iii) k is invariant, i.e.. 



k{p-\-t,LU-\-t,q-\-t) = k{p,uj,q), 
for (p, (jü,q) G X x^^ X and t G R^. 

The set of these kernels is denoted by JC^^'^{Cl,T). 
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We record a few quite elementary observations. For any kernel k G 
denote by tt^/c := the operator K^^ G induced by 

(Ku;Sq\ôp) := k{p,u,q) for p,q eu. 

Clearly, the family K := tt/c, K = (Kaj)ujen, is bounded in the product (equipped 
with the supremum norm) Thus, it belongs to AA(i7,T, //). Now, 

pointwise sum, the convolution (matrix) product 

(fci ■k2){p,uj,q) := '^ki{p,uj,x)k2ix,u),q) 



and the involution k* (p, u,q) := k{q,u, p) make (0, T) into a *-algebra. Then, 
the mapping tt : /C-^^^(Q,T) — > is a faithful ^-representation. We 

denote := 7t(/C'^^^(Q, T)) and call it the operators of finite range. 

This gives a subalgebra of fi), as can easily be seen. The completion of 

with respect to the norm \\A\\ := sup^^^^ II^cjII is denoted by A(fl,T). 
This again is a subalgebra of A/*(Ü, T, ju). Moreover, it is not hard to see that the 
mapping tt^j : v4-^^^(0,T) ^ B{£‘^{üj)),K is a representation that extends 

by continuity to a representation of A{fl,T) that we denote by the same symbol. 

2 Integrated density of states and its discontinuities 

In this section, we first relate the abstract trace r defined above to the mean 
trace per unit volume. The latter object is quite often considered by physicists 
and bears the name integrated density of states. We then study discontinuities of 
the integrated density of states and characterize their occurence by existence of 
locally supported eigenfunctions. 

The proper definition of the integrated density of states rests on ergodicity; 
we need the notion of a van Hove sequence of sets. For s > 0 and Q C we 
denote by dgQ the set of points in whose distance to the boundary of Q is less 
than s. A sequence (Qn) of bounded subsets of is called a van Hove sequence 
if \Qn\~^\dsQn\ — ^ 0, n — > 0 for every s > 0. The following result from [23] 
establishes an identity that one might call an abstract Shubin’s trace formula. It 
says that the abstractly defined trace r is determined by the integrated density of 
states. The latter is the limit of the following eigenvalue counting measures. Let, 
for selfadjoint A G A{Q..T) and Q C 

{p[A^,Q\,q}) := |htr((^(^„lQ)),<^ e C(K). 

Its distribution function is denoted by n[A^^,Q\, i.e. n[A^^,Q]{E) gives the number 
of eigenvalues below E per volume (counting multiplicities). 
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Theorem 2.1 Let {^,T) be a uniquely ergodic (r, R)-system and fi its ergodic prob- 
ability measure. Then, for selfadjoint A G A{^,T) and any van Hove sequence 
{Qn), 

{p[A^,Qn], (f) r{(f{A)) as n-^ oo 

for every cp G C(R) and every a; G Consequently, the measures p[A^^] converge 
weakly to the measure pA defined above by (pa^t) every cj G fî. 

Remark 2.2 This generalizes results in Kellendonk [9] (see [7, 8] for related ma- 
terial as well) and is an analog of results of Bellissard [2] in the almost periodic 
setting. The proof is based on ideas from these works. 

For a special class of operators we can actually say more. This class is defined 
next. It includes all Hamiltonians based on a next neighbor Laplacian and a locally 
determined potential. 

Definition 2.3 Let {^^T) be a DDSF. A finite range operator A = {A^) is called 
locally constant if there exists a constant r a such that A^{x,y) = A^^^{x\y') when- 
ever {Br^ (x) U Br^ {y)) /\u) = t-\- (Br^ (x') U Br^ (y')) A Lo' for some t G R^. 

For such operators we can characterize the appearance of a discontinuity in 
the integrated density of states p^ (see [12] for details and proofs). 

Theorem 2.4 Let (fl, T) be a strictly ergodic DDSF. Let A be a locally constant 
finite range operator. Then E is a point of discontinuity of p^ if and only if there 
exists a locally supported eigenfunction of A^ to E for one (all) uo e D. 

Remark 2.5 Let us emphasize that locally supported eigenfunctions do exist for 
DDSF (see [12] and the references given in the introduction). Thus, the theorem 
gives that the integrated density of states is not continuous for DDSF. This con- 
trasts with what is known for one dimensional operators as well as for random 
operators. On the other hand the theorem also tells us that these discontinuities 
can only arise in a certain way which is linked to the geometry of the Delone sets 
in question. 

3 Generic purely singular continuous spectrum 

In the sequel we consider Hamiltonians in L^(R^) that can be considered for the 
description of aperiodic solids. We fix a compactly supported, bounded / < 0 that 
gives the attractive potential of an ion. If these ions are distributed according to 
the sites of a Delone set uj we arrive at 



tecü 
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Since we consider a fixed / 0 as above we omit it from the notation. We now fix 

0 < r < iZ in such a way that there exists lattices 7, 7 E with the property 
that the corresponding Hamiltonians and H{^) satisfy: 

a := inf a | — A + — ^ •— inf a I —A + /(• — t) 

\ tej / \ teÿ 

Our aim is the following result; details will appear elsewhere [25]. 

Theorem 3.1 There is an nonempty open interval I and a dense Gs-set Clgc such 
that for all u E Qsc the spectrum of H {to) contains I and is purely singular con- 
tinuous there. 

Let us stress that singular continuous spectrum is in fact what one expects 
for quasicrystals. One reason is that the latter are in between highly disordered 
media (for which pure point spectrum is typical) and ordered media (for which 
absolutely continuous spectrum is characteristic). Moreover, there are by now a 
number of rigorous results in this direction for one-dimensional operators, starting 
from [29]; references to more recent papers can be found in the survey papers 
[4, 30]. We are not aware of any result concerning higher dimensions, however. 

Of course, a more realistic Hamiltonian would allow for a finite number of 
different ions. The framework of colored Delone sets as considered in [23] provides 
the appropriate notions. 

The main tool to prove the above result is the following Theorem 2.1 from 
[26] that we include for the readers convenience. 

Theorem 3.2 Let X be a regular metric space of self-adjoint operators. Suppose 
that for some interval (a, h), we have that 

(i) {A E X\ has purely continuous spectrum on (a,^)} is dense in X. 

(ii) E X\ has purely singular spectrum on (a, 5)} is dense in X . 

(iii) {A E X\ has (a, 6 ) in its spectrum} is dense in X. 

Then {A\{a^ h) C cf{A)^ (a, h) fl cFpp{A) = 0, (a, b) fl CTac{A) — 0} is a dense Gs^ 

Clearly, the conclusion of this latter theorem is exactly the assertion of our The- 
orem 3.1. Thus, we have to associate the right space of operators with Vr^R and 
verify properties (i)-(iii) from the preceding theorem. 

Of course, the appropriate space is 

Xr,R = {H{uj)\uj e 

equipped with the metric from Vr,R. Metrizability and completeness have already 
been discussed in Section 1 ; they follow from the results in [22]. Moreover, one can 
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see that convergence with respect to the natural topology implies strong resolvent 
convergence of the respective operators so that is a regular metric space of 
self-adjoint operators in the sense of [26]. 

Let us finally sketch how to prove property (iii). Recall the lattices 7,7 from 
above. This gives the energy interval (a, h) in the following way: We take a, 6 as 
above and define h := mm{b,b*} where b* is the upper edge of the spectral band 
of — A + Yltej ~ starts at a, i.e., 

b* := sup{A|[a, A] C a + ^ /(• - i) j }■ 

We can now use suitable perturbations of 7 to see that (iii) is fulfilled. In fact, given 
^ ^ ^r,R find ujn that coincides with u in the box centered at 0 with sidelength n 
and coincides with 7 outside the box centered at 0 with sidelength 2n. By Lemma 
1.2 above we get convergence ^ a; in the natural topology. Moreover, the 
essential spectrum of H(ujn) coincides with the essential spectrum of H{uj)^ hence 
[a, b] C a {H{(jJn)) for all n G N. 
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Cosmological constant problem #1 Theoty-2002 
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Epidemic Thresholds on Scale- Free Graphs: 

the Interplay between Exponent and Preferential Choice 

Ph. Blanchard, C.-H. Chang and T. Kriiger 

Abstract. We study epidemic threshold properties in a scale-free random graph 
model. We show via a branching process approximation that the divergence of the 
second moment of the degree distribution equivalent to the absence of an epidemic 
threshold. We study further the relation between diameter and epidemic threshold. 
Absence of an epidemic threshold happens precisely when a positive fraction of the 
nodes form a cluster of bounded diameter. 

1 Introduction 

In the last decade there was an enormous increase of interest in phenomena which 
show for characteristic observables a power-law behavior. Power-law distributions 
are often called scale- free distributions due to the absence of characteristic sizes. 
Most prominent under the huge variety of examples became systems which show 
so-called selforganized criticality (SOC), [3] [4]. The basic idea to explain the ap- 
pearance of a power-law-distribution is via the linkage to phase-transitions where 
the appearance of scale-free structures is known since a long time. Despite many 
efforts only little is known about the origin of power-law-distributions on a math- 
ematical base. Partially due to the increasing knowledge of the structure of the 
internet there is a growing number of articles studying the structure of networks 
with a power law distribution for the degree (see [1] and the references therein). 

In this article we want to discuss a specific model of scale- free-graphs based on 
a partner choice strategy and its epidemic threshold properties. Related questions 
found recently much interest due to the possible absence of epidemic thresholds 
for a certain range of power law exponents [8] [9]. We were mainly motivated to 
our studies by the paper of Pastor- Sat orras and Vespignani [9], where on largely 
heuristic grounds the absence of an epidemic threshold for power-law-graphs with 
exponents less than 3 was claimed. Since the argumentation in this paper was quite 
doubtful from a mathematical point of view but somehow intuitively convincing 
we wanted to study the same question on more solid grounds. To get enough the- 
oretical control we developed a new model of power- law-graphs which seems to 
be of independent interest. Due to the strong independence properties - ais far as 
this is possible for a scale-free degree-distribution - we were able to carry out a 
rigorous analysis of certain aspects of the model. For a certain range of param- 
eters we get a power-law distribution for the total degree. For other parameter 
values there is an interesting domain where one has a fat tail degree distribution 
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with gaps. Here the integrated tail distribution is still of power-law form. In both 
cases the divergence of the second moment of the degree distribution is indeed 
equivalent to the absence of an epidemic threshold. We furthermore examine the 
relation between divergence of the second moment of the degree distribution and 
diameter-like properties of the relevant graph-spaces. Namely one has in the case 
of absence of an asymptotic epidemic threshold the striking property that there 
is a huge (meaning a positive fraction of the whole population) cluster of finite- 
size independent- diameter. This clusters carries the epidemics and forms a kind of 
very small world inside the small world of the largest connected component which 
can still be of logarithmic diameter. 

We further study some models with completely different degree distribution 
where the absence of an epidemic threshold is not necessary linked to the diver- 
gence of the second moment of the degree-distribution. 

2 A preferential-choice model of scale-free graphs 

In the following we want to describe the basic structure of the model we consider 
in this paper. Our first aim is to construct a family of static random graph models 
in which vertex degrees are distributed power-law like, while edges still have high 
degree of independence. As usual in random graph theory we will entirely deal 
with asymptotic properties in the sense that the graph size goes to infinity. 

We consider graphs with vertex set V = Vn — n} where an edge 

between the vertices x and y (denoted by x ^ y) is interpreted as a persistent 
contact between the two nodes. Given x G V, its degree is denoted by d{x). We 
will think of edges as generated by a pair-formation process in which each vertex x 
- often denoted as an individual chooses a set of partners according to a specified 
x-dependent rule. Therefore the set of individuals which have contact with a given 
vertex x can be divided into two -possible non-disjoint sets: the set of nodes which 
are chosen by x himself and the set of nodes which have chosen x as one of their 
partners. We call the size of the first set the outdegree dout(x)oî x and the size 
of the second one the indegree din{x) of x. Obviously d{x) < dout{x) + din{x) and 
if the choices are sufficiently independent one can expect equality to hold almost 
surely if n oo. 

We partition the set of vertices Vn into groups {Ci{n)}-^^ where all members 
of a group Ci{n) choose exactly i partners by themselves (dont = i on Ci{n)). Let 
(n, j) the probability for x to choose a fixed partner y G Cj (n) if n partners 
are available for the choice and just one choice will be made be 

Pa (n,j) = Aa{n) ^ . (2.1) 

Here Aa{n) is a normalization constant such that Aa{n) (Si>i l^i(”)l^) = 1 
and a is a real parameter. Since we want Aa{n) A^,as n ^ oo we need 
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tc) be bounded as a function of n which will impose later on con- 
straints on the constant a. The parameter a acts as an affinity parameter tuning 
the tendency to choose a partner with a high out degree or low out degree. If o = 0, 
choices are made without any preferences and Aa{n) = 1. For a > 0 the “highly 
active” individuals are preferred whereas for a < 0 the “low activity” individuals 
are favored. From this we obtain the basic probability 

2 ’ 

Pr[ X from Ci (n) has chosen y G Cj (n) as one of his partners] ~ Aa{n) 

Concerning the size of the sets Ci{n) we will make the following assumption: 



\Ci{n)\ 

n 



=: Pi (n) 

n-^oo 



C' 



( 2 . 2 ) 



With this choice we have to impose the restriction a < 7 — 1 to ensure 
the convergence of A^ (n). We require furthermore 7 > 2 throughout the paper 
since otherwise the expected indegree for individuals from a fixed group would 
diverge. The basic probabilities together with the fixed outdegree-distribution de- 
fine a probability distribution on each graph with vertex-set W, and therefore a 
random graph space Gn (<^,7). 

We want to investigate the threshold properties of epidemic processes on 
these random graphs as a function of the parameters 7 and a. Here we are mainly 
interested in the question whether the epidemics is asymptotically {n 00) sub-or 
overcritical. For the threshold study in this random graph model it is enough to 
investigate the epidemic process as a multi-group branching process since typical 
graphs have locally a tree structure. Absence of a threshold means here that for 
arbitrary small transition probability /i > 0 one still gets for n 00 an overcritical 
epidemic process. In section 3 we will compare the results for the above model 
with some specific graph models which have no power-law-distribution but still 
diverging second moments of the degree distribution. 

First we want to compute the important pairing probabilities which will be 
used frequently. We start with the easier case o = 0, 

Pr(x ~ 2/ I .T e Ci;2/ 6 Cfc) = • (2.3) 

n n^oo n 



Completely similar one can compute the corresponding probabilities for a ^ 
0. Dropping the simple details we just state the result: 



Pr (a: ~ 2/ I a; 6 Ci ; 2/ € Cfc) ~ as n ^ oo. (2.4) 

It turns out, that the typical graphs in this model still have for a < 2 a 
power-law distribution for the degree with an exponent which can be different 
from the exponent of the outdegree. For a > 2 we obtain a degree distribution 
which follows in mean a power law but has gaps. To compare both domains we 
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will use the integrated tail distribution Fk Fr{d{x) > k) . We will show that 
in both cases we get the same integrated tail distribution. Since we are interested 
in the dependence of the epidemic threshold from the power-law exponent of the 
total degree distribution we have to analyze how this exponent varies with the two 
parameters a and 7. Since the partner choice is sufficiently random and not too 
strongly biased toward high degree individuals (that’s the meaning of the condition 
q; < 7 — 1) it is easy to see that the indegree distribution of a vertex from group 
Ci converges for n — > oc to a Poisson distribution with mean const • There 
are essentially two regimes in the parameter space, one for which the expected 
indegree is of smaller order than the outdegree over all groups and one where 
the indegree is asymptotically of larger order. In the first case it is clear that 
the indegree is too small to have an effect on the degree distribution exponent. 
In other words: the set of individuals with degree k consists mainly of individuals 
whose outdegree is of order k. An easy estimation using the formula for the pairing 
probabilities shows that the expected indegree of individuals from group i is given 
by E{din (x) \ X e Ci) ~ const • asymptotically. Therefore the indegree is of 
smaller order than the outdegree if a < 1 . In the case 7 — 1 > a > 1 the set 
of individuals with degree k consists mainly of individuals from groups with an 
index of order k^ . Furthermore we will show that the condition for absence of an 
epidemic threshold is precisely given by the divergence of the second moment of 
the degree distribution. 

We will state now the main result of the paper which will be shown in the 
next sections. 

2.0.1 Main Results: 

a) for a < 1 the random graph On (a, 7) hats a power-law distribution for the 
degree with exponent 7 

b) for 1 < Of < min (2; 7 — 1) the random graph Gn{<^^'y) has a power-law 
distribution for the degree with exponent 7^ = 1 + J ~ ~ 

c) for 2 < o < 7 — 1 the random graph On (a, 7) has a power-law integrated tail 
distribution with exponent 7 _ i 

d) there is an epidemic threshold for the parameter set {o; < ^^^,7 > 3} and 
absence of an asymptotic threshold on the complement 

e) for a > or 7 < 3 the second moment of the degree distribution diverges. 

2.1 The degree distribution and second moment estimation 

In this paragraph we want to estimate the degree distribution, respectively the 
integrated tail distribution for the parameter domains 1 < < 2 and 2 < a < 7 — 1 

and give conditions for the divergence of the second moments. We first note that 
for a > 1 the indegree dominates the outdegree in every group Q. Furthermore 
for n 00 and any groupindex i the indegree distribution converges to a Poisson 
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distribution with expectation A (i) = const • Since the variance is of order it 
is clear that for a > 2 and large i there will be essential no overlap between the 
indegree distribution in group i and group i -f 1. 

We first deal with the indegree distribution for a < min (2; 7 — 1) : 



Vr{din{x)=^k) ^Y^¥v{din{x) = k\x £Ci) ( 2 . 5 ) 

i 

= (2.6) 

“A;! i'y ^ ^ 



The essential interval of indices i which contribute for sufficiently large k to the 
sum is given by the condition k G — a {k)i^ ; const • + a {k) z ^ "^2 ^ j 

where a{k) is a slowly growing function in k (for instance log A: is fine). Solving 
for z and denoting the above constant by A the we get for the boundaries of the 
essential index interval less {^) •= [^min {k) ;Zmax (A^)] • 



^min (A^) • 
^max (Aî) : 

For the length \Iess (A^)| of the 



a (k) 



_l_ / (fc) ^ 
A-2 - 4 À 



a {k) 



+ 



{k) k 
.4 A 



^A-2 

essential interval we get therefore: 



( 2 . 7 ) 

( 2 . 8 ) 



l^ess (A^)| — ^max {k) Zjjiin {k) — 





(2.9) 



For a < 2 it follows that less (k) C\ less (A^ + 1) 7^ 0 for all A: sufficiently large. In 
contrast to this we have for a > 2 the situation that certain k- values do essentially 
not appear since \Iess {k) \ converges to zero as k tends to infinity and less (k) has to 
contain an integer to give a contribution. We estimate now the sum in the indegree 
distribution for large k by restricting the summation to the essential intervals: 



Fv{din (x) = k) 



E 

ieless{k) 




Cl 

■ 



( 2 . 10 ) 



Using Stirlings formula A;! = (27 tA:) ^ (1 -h o (1)) and the approximation 

(1 + l)^ = for 1^1 k^ we obtain: 



Pr {din {x) = k) 



2a {k) i_i l-fo(l) 

— --k^ 2 . ^ 

^ (27tA:) 2 

1 



( 2 . 11 ) 



k^ + i-i+od) • 



( 2 . 12 ) 
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We get therefore an asymptotic power law exponent 7 ' = 1 + for the indegree 
distribution which is also the exponent of the total degree distribution in the case 
a > 1 since then 7' < 7 and hence the indegree dominates the outdegree. 

It is much more easy to estimate the integrated tail distribution. We will do 
this again for the indegree distribution and a > 2 to be able to compare with the 
situation for smaller values of a. Clearly one has 



Fk = Pr {din (x) > k) = (x) = Z I x e Ci) (2.13) 

i x^Ci l>k 

~ Pr (din {x) = l\ x eC^)^ . (2.14) 

i l>k 



Since for the degree I > k only vertices with indices i > 

contribute and in this case the sum (x) = I \ x E Ci) 

we get 



(l + o(A:i))j 
is essentially 1 , 



Fk 



(din (x) = I \ X e Ci) ^ 



^ E ^ 

i> ^ 1+o^A: ) J 



Cl 

c 



Cl 

ki (i + o())J" 



Cl 

^1 + 0 



(2.15) 

(2.16) 
(2.17) 



Therefore the integrated tail-degree distribution is still of power-law form. 

Next we want to estimate the second moments of the indegree distribu- 
tion for values 1 < a < 7 — 1. By definition we have 

y] ( 2 . 18 ) 

k i x£Ci 

= EE^'Pdd(a:) = fc|:reC'0| (2.19) 

k i 



and asymptotically since the second moment of a Poisson distribution with expec- 
tation A is given by A^— A : 






E £i 



{M-y 



■Ai^ 




const 

il-2a 



const 

i^-ot 



(2.20) 

( 2 . 21 ) 



The first term in the sum diverges for a > which is of course also the condition 
for the total degree second moment to diverge (in the case a < 1 the outdegree 
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dominates the indegree and the second moment diverges for 7 < 3). We show in 
the next paragraph that this is precisely the condition for absence of an epidemic 
threshold. 

2.2 The branching process approximation 

In the following we want to analyze the branching process approximation for the 
epidemic process on the above described random graph space Gn (ct? t)- Due to the 
absence of local cycles (in the large n limit) one gets an exact threshold estimation 
via the branching process approximation. Let be the transmission probability 
that an infection will be transmitted along a given edge of the graphs. Eliminating 
edges in Qn (a, 7) with probability 1 — ^ gives a new random graph space Gn (<^5 7) 
which we call the epidemic random graph. We will denote expectations with respect 
to Gn 7) by whereas E is reserved for expectations with respect to Gn (ct, 7)- 

Let T (a,j,n) = (aij (a,n)) be given by 

Gij (a, n) :=E^(H{y : X I X e Q (n),y e Cj (n)) (2.22) 

= y -E(tl {y : X yj I X e C^(n) ,y e Cj (n)) (2.23) 

and let aij (a) := lim^i-^oo ciij {a, n ) . Note that T (a, 7, n) is the transposed of the 

branching process transition matrix. In other words a^j {a) is the expected number 
of infected individuals in group j generated by one infected individual in group 
i. Actually the exact terms in the asymptotic branching process matrix are given 
by üij (a) — 1. Since the difference by one is irrelevant for our considerations we 
neglect this term in the computations. Prom formula (2.4) we conclude that 

üij {a, n) ^ Cl -fi- (2.24) 

where the constant C\ depends only on a and 7. Absence of an epidemic threshold 
for fixed a and 7 holds exactly if 

lim Amax {T (a, 7, n)) = 00 for > 0 (2.25) 

n— ’•CO 

with Amax (') being the largest eigenvalue in modulus. 

The regions in the parameter space for which we will get absence of an asymp- 
totic epidemic threshold are 1) 2 < 7 < 3 and arbitrary a and 2) the parameters 
with 7 > 3 and a > Finally we get for 7 > 3 and a < that there is a 
size independent threshold. 

Since the smallest row sum of a positive matrix gives a lower bound on the 
largest eigenvalue it is enough to show that for {bik (n)) := T ^ (a,j,n) and a and 
7 as in case 1) or 2) we have 



min bik (n) oc. 

i ^ n—^00 

k>\ 



( 2 . 26 ) 
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Note that aij (a, n) ~ atj (a) and therefore 
f>ik {n) 






ik^ -f 



= if^c.r 



, 2 /lk°‘ +l°ky^i‘^+^ 



• 2 a laua 



k^ ^ 



From this formula one immediately concludes the absence of an epidemic threshold 

in the above described domains since ^ ^ respectively ^ ^ diverge for 7 < 3 

i i 

respectively a > whereas the divergence of ^ is irrelevant since 
> 7 - 2 for 7 > 3 . 

To show the existence of an asymptotic threshold for parameters satisfying 
7 > 3 and a < we will demonstrate that for /i > 0 we have lim Am ax 

(T’^ (a, 7, n)) < 00 from which the assertion then follows. Let {gj^ {n)) T 
^ We will derive a recursion for (n) as follows. Since 



gr,{n)c,(tkC,r E 

f'm 



12'i'i i\^2 i\k^ kii 

il k'y 



we define variables 



g]^{n,x,y) := {fiCi)'^ E 



iZi il k^ 



and observe that 



(«- 2;, y) = yCi ■ 5, (n, x', y') 



x' = Bç)X + Biy ; y' = B2X + Boy ( 2 . 34 ) 

*2 

and Bo = Y 1 V“> ^1 = E V -^2 = E Since 5)^ (n, A:“, k) ~ (n) we 



< 1 to establish the boundedness 



only have to show that /iCi • Amax ( ) < 1 to establish the boundedness 

[\ ^2 ^oj 

of Amax (o, 7, ^))- Clearly if Bq, B\ and B2 stay bounded as a function of n we 
can match the requirement of the last sentence. But Bo, B\ and B2 stay bounded 
on the parameter set 7 > 3 and a < from which our assertion follows. 

The result has a nice heuristic explanation. The “tail” -groups (that is the 
groups Ci with i very large) act as a hub for the infection. Indeed given an infected 
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individual in group Ck it spreads the infection into the hub via it’s outdegree edges 
(for a large) from which the infection is backspread via the indegree edges of the 
hub to Ck or any other fixed group. It is precisely this infection path which is 
proportional to ^ and therefore makes the branching process divergent for 

a > 2'y 1. 

3 The very small world inside the small world 

In the following we will show that the distinction between absence and pres- 
ence of an epidemic threshold can be exactly related to properties of the di- 
ameter of the giant component of Of course the diameter 

of a random graph space is itself a random variable but it turns out that it 
has a small variation. For our purpose we therefore concentrate on the expected 
diameter. Given a graph Gn with n vertices and a subset .A of we define 
diam{Gn,A) \= max{d(a:, t/) \x,y^A] where d{x,y) is the distance on the 
graph Gn. Let the essential diameter of a graph Gn be defined as 

diarrff^ (Gn) := nun {diam (Gn,A) \ \A\ > (1 — e) \Vn\} • 

Similar if we have a sequence of random graph spaces {Gn} the quantity 
diaml^^ (Gn) is now a random variable depending on the realization Gn ^ Gn- 

The striking phenomenon in the case of absence of an epidemic threshold for 
the above described parameter region is the following property: 

For all s G (0, 1) there is a C (s) > 0 such that Pr {diam^^^ (Gn (<^?7)) < 
C (e)} 1 as n tends to infinity. 

In other words: A positive fraction of the vertices are located in a cluster 
of finite diameter. This cluster has a kind of fuzzy, onion like structure with no 
sharp boundary. At the “center” of the cluster are the very high degree groups 
placed which are entirely contained in the cluster. The union of shells which are 
at distance less than R from the center has positive mass as soon as R gets larger 
than a critical value Gq. Increasing R makes this fraction larger and larger but to 
get the whole giant connected component in the cluster requires to take R oo 
as n goes to infinity (something between log log n and logn should be the right 
growth rate). 

Before going into the details let us recall the general philosophy how to esti- 
mate the diameter of a (large) connected component of a random graph space. Fix- 
ing an individual x one tries to estimate the expectation ofTk := ^{y : d (x, y) = k} 
as good as possible. As long as <C n holds and the variation of the F^ 

is under control one can use in general a branching process approximation to get 
the right order of the expectation of F^. The smallest ko such that '^i^ko Ti ~ n 
gives then a very good estimation of diam^^^ (for ^ small). 

To see that the above statement is true let us note first that iterating the 
branching process matrix A:— times -denoted in the following by and being 
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the transposed of {a, 7) — gives the expected size of the number of individu- 
als in group i which are at distance A; to a random chosen individual in group j 

(k') 

provided that all expectations much smaller than \Ci (n)| for k' < k. 

3 

Assume now that the maximal outdegree respectively the largest group index in 
our model scales like for some 0< 0 < ^ and hence the size of the group 
with the maximal outdegree is of order const • . Since the dominating con- 



tribution in the terms comes from | ) we can safely iterate the 

branching process matrix /c— times as long as A:/? (1 — 7 -h 2a) <1 — 13^ (note 
that in the “no-threshold” case we always have 1 — 7 -h 2o; > 0). Let k^ be the 
largest k for which the above inequality holds. To get a good lower bound on the 
number of individuals at distances larger then ko one has to truncate the matrix 
properly in such a way that only paths are taken into account which make no 
use of groups which are smaller than the entries in the iterated branching pro- 
cess matrix. Therefore we consider in step A:q -|- A: only those groups Ci for which 
i < with 0k •= (i_^_^2aKA;o+fc)+7 ‘ leading term after A;q + A; iterations is 



now given by 7+2a) 7+2») _ ^ 

i<k 

Since the sum in the exponent diverges there is a size independent ki such that 
72 ^ 0 / 3 ( 1 - 7 + 20 ;) ^/ 3 / (i-7+2o;) ^ ^ Consequently a finite fraction of the vertex 



l<ki 

set stays within a bounded distance. 

Let us finally demonstrate that there cannot be a C* such that essentially 
all vertices of the giant component are within a distance less than C*. Fix / > 0 
and an individual x with d{x) < /. From formula (2.4.) it follows that there is 
a n independent p{f) > 0 such that the probability that x has only partners y 
with d{y) < f is larger p (/) > 0. Therefore the probability that at distance less 
equal C* from x there are only individuals with degree less than / is larger than 

r C7 -j- 1 

p {fy . Since there are const (/) • n individuals with degree less than / there 
is a small but positive fraction of individuals which are at distance larger than (7* 
from the high degree groups for arbitrary large C*. In a similar way one can show 
that there are individuals with distance of order In In n (but those form only a set 
of asymptotic zero density). 



4 A toy example 

In the following section we want to discuss a model in which the degree-distribution 
is somehow extreme, namely there are only two types of individuals with sizes C 2 
and The index 2 and / (72) stands for the degree of the two individual-types 

the Icist one depends on the size n of the vertex set. We will make the general 
requirement that the second moment of the degree distribution diverges whereas 




Vol. 4, 2003 



Epidemic Thresholds on Scale- Free Graphs 



S967 



the first moment converges: 

I I 

limsup/(n)-^ — < oo (4.1) 

n 

(4.2) 

We will look at the epidemic process initiated by one randomly infected indi- 
vidual in group and ask wether one has a persistent epidemics for arbitrary 

small transmission probability. For this model the question of existence of an epi- 
demic threshold is only interesting in the above conditional sense since due to our 
assumptions the asymptotic fraction of Cj(n) individuals on the total population 
tends to zero and no epidemics can be persistent among the C 2 population for 
sufficiently small transmission probabilities. 

We will investigate two different structures of edge formations. 

4.1 Case A 

In this subsection we want to separate the elements of as far as possible 

from each other which gives the epidemiologically “safest” structure in the sense 
that unless the first moments of the degree-distribution “almost” diverge one would 
expect an undercritical situation if the transmission probability is sufficiently small 
(note that if the first moment of the degree-distribution diverges one gets trivially 
no threshold). We want to separate all elements of by chains of uniform 

I c I 

length I (n) consisting of elements of C 2 . Since we have / (n) ^ chains (we drop 
all effects due to integer part requirements since they are asymptotically irrelevant) 
we get for the length I (n) = , + 1. The probability to transmit an infection 

\Cf(n) l/(^) 

along such a chain is and hence the expected number of individuals in Cf(^n) 
which will be infected from one initially infected individual from group CfÇri) 
equals / (n) To get absence of a threshold we need therefore / (n) > 1 

asymptotically. This is equivalent to log f (n) — C ^\C ^ ^ ^ ~ ^ ^ 

from which by (4.1) O (n) log / (n) > c - (n — |C/(n) |) follows. First we observe that 
it is impossible to find exponents /3,6 e (0,1) s.t. / (n) := and |C'/(n)| := 
and / (n) 00 under the assumption /3 -\- a < 1. Note that /3 + a = 1 is just 

the borderline where the expectation of the degree starts to become large. 

As a conclusion from the considerations in this section one should prob- 
ably take, that it is the balance between separation length and degree growth 
which matters for the epidemiological threshold-question. So if the elements from 
C 2 would not be as uniformly spread we should look at the following quantities: 
‘fii [/(«)] := [tJ {y € C/(„) : disto {x,y) = /)] where disto denotes the length of 
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the shortest path between x e C'y(n) and y which entirely consists of individ- 
uals from C2. For getting an overcritical epidemics we therefore need A{n) := 

E ‘Pi [f{n)]M‘ > 1- 

l>0 

4.2 Case B 

Assume now that we have stationary asymptotic probabilities poo? Poi, Pio and pn 
for the events that a single random chosen edge from an x G Cy^(n) points to an 
element from (Poo) or to an element from C2 (poi) respectively that an edge 

from an element from C2 points to (pio) or to an element from C2 (pii)- 

There are three conditions on the choice of the pij namely poo+Poi I7P10+P11 = 
1 and poi • / (^) |^/(n) I = Pio ' 2 IC2I. Since the first moments are bounded we have 
~ therefore IC2I ~ n. To get the threshold condition we have 

to estimate the expected number of secondary infections caused by an infected 
individual from which is precisely given by the quantity A (n). To compute 
the numbers (pi [/ (n)] observe that 



Pr (a fixed chain starting at x has length /,/>!)== PoiPn^Pio 


(4.3) 


and Pr (a fixed chain starting at 


X has length 1 ) = poo- Therefore we have 






/ (n) ( mo + iJ'PoiPn^Pio ) 


(4.4) 


i>i 


V (>i / 




= 


/ {n) upoo + f (n) p'^PoiPio , 

1 - PPn 


(4.5) 



(we dropped the n-dependence of the pij). Clearly the direct coupling between 
the Cf(^n) individuals has to be of order at most to avoid a nontrivial epi- 
demics inside the Cf(^ri) population which implies f (n) C2 and in turn 

Pio 0. Using the relations between the pij we have A > p? ' ------ . Prom 

the divergence of the second moments one gets - — ^ hence ab- 
sence of an epidemic threshold in this probabilistic setting. It remains to estimate 
the asymptotic number I (n) of infected individuals which equals the size of the 
largest component of the epidemic random graph. We first estimate the num- 
ber of infected individuals in denoted by If via the stationarity equation 

= If{n). Furthermore an infected individual from CfÇn) in- 
fects in mean = /(n) jzff individuals from C2 from which one 

obtains the following asymptotic bounds on the number of infected individuals: 

è |C/(n)l (1 - âm) / W îGt ^ ^ (1 “ f W rG- Therefore 

unless one has lim„_,oo > 0 (in this case one gets trivially absence of a 

threshold since poo > 0 ) fhe size of the asymptotic fraction of infected individuals 
in the case of absence of a threshold still converges to zero. 
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5 Conclusions and comments 

We have shown that for the preferential choice model described in this paper 
the divergence of the second moment is precisely the condition for absence of 
an epidemic threshold even in the case when the asymptotic distribution is not 
of a power-law form (but has gaps). In terms of graph geometry this is related 
to the presence or absence of a massive finite diameter cluster which carries the 
epidemics. It seems that for the case of an exponent larger than three one always 
has a size independent threshold if one additionally requires that the size of the 
largest component of the epidemic random graph is proportional to /i • n. Note 
that it is easy to get absence of threshold for exponents larger than three if one 
introduces horizontal preferences in the partner choice (for instance a strong bias 
to make connections into the own group) but in this case the size of the largest 
component scales like • n with an e > 0. It is a much more difficult task to 
estimate exactly the size of the giant component on the epidemic random graph 
which will be the subject of a more mathematical forthcoming paper [6] . Certainly 
it scales like const • /i • n. It would be furthermore very interesting to have random 
network growth principles which produce in a natural way the scale free graphs 
we described in this paper. For some results in this direction see [10]. In any case 
the model seems to be interesting in describing computer networks and associated 
spread of viruses on it. Caution is required if one wants to draw conclusions for 
sexually transmitted diseases like HIV on social networks even when these networks 
have a power law tail distribution for the degree. The main point here is that the 
transmission probability scales with the degree for small transmission probabilities- 
like in the AIDS-epidemics- since high degree individuals spend necessarily less 
time with each single partner [8], 

Finally we would like to mention that there is a natural variant of our model 
by defining independent directed edge probabilities between x E Q and y G Cj 
as Pij = with the same as above. For this model one gets the same 

threshold statements since it has the same branching process transition matrix. 
For some alternative strategies to choose partners and the corresponding threshold 
estimations see also[ll][5]. 

References 

[1] R. Albert, A.-L. Barabarasi, Statistical Mechanics of Complex Networks, Re- 
views of Modern Physics 74 , 47 (2002), arXiv:cond-mat/0106096. 

[2] R. Albert, A.-L. Barabarasi, Science 286 , 509 (1999). 

[3] P. Bak, C. Tang, K. Wiesenfeld, Self organized criticality: An explanation of 
1/f noise, Phys. Rev. Lett. 59 , 381 (1987). 

[4] P. Bak, How Nature Works, Springer-Verlag (1996). 




S970 



Ph. Blanchard, C.-H. Chang and T. Kriiger Ann. Henri Poincaré 



[5] Ph. Blanchard, G.-F. Bolz, T. Kriiger, Modeling A IDS -Epidemics or any vén- 
érai diseases on random graphs, in “Stochastic Processes in Epidemic The- 
ory”, Lecture Notes in Biomathematics 86 , Springer-Verlag (1990). 

[6] Ph. Blanchard, C.-H. Chang, T. Kriiger, St. Reiman, Epidemic thresholds on 
scale-free graphs, BiBoS Preprint, University of Bielefeld (2002). 

[7] B. Bollobais, O. Riordan, J. Spencer, G. Tusnady, The degree sequence of a 
scale- free random graph. Random Struc. Alg. 18, 279 (2001). 

[8] Z. Deszo, A.-L. Barabasi, Halting viruses in scale-free networks, arXivicond- 
mat/0107430v2 (2002). 

[9] R. Pastor- Satorras, A. Vespignani, Epidemic Spreading in Scale-Free Net- 
works, Phys. Rev. Lett.%%, 3200 (2001). 

[10] D. Volchenkov, Ph. Blanchard, An algorithm generating random graphs with 
power law distributions, to appear in Physica A (2003). 

[11] D. Volchenkov, L. Volchenkova, Ph. Blanchard, The role of social structure 
and immunization policy in disease spreading in scale free networks, submitted 
to Phys. Rev. E . 

Ph. Blanchard 
Universitat Bielefeld 
Fakultat für Physik, 

Universtatsstr. 25 
D-33619 Bielefeld 
Germany 

C.-H. Chang 

National Center for Theoretical Sciences 
Physics Division, NTHU 
101. Section 2 
Hsinchu, Taiwan 300 

R.O.C. 

T. Kriiger 

Universitat Bielefeld 
Fakultat für Mathematik 
Universitatsstr. 25 
D-33619 Bielefeld 
Germany 




I Annales Henri Poincaré 



Ann. Henri Poincaré 4, Suppl. 2 (2003) S971 - S976 
@ Birkhâuser Verlag, Basel, 2003 
1424-0637/03/02S971-6 
DOI 10.1007/s00023-003-0976-0 



The First Regional Center for Mathematical Sciences 



Ali H. Chamseddine 



Introduction 

The Center for Advanced Mathematical Sciences (CAMS) at the American Uni- 
versity of Beirut (AUB) promotes research in disciplines that make significant use 
of mathematical techniques, such as Mathematics, Physics, Chemistry and Natu- 
ral Sciences, Engineering, Computer Science, Economics and Business. Situated in 
the oldest and largest American- Chartered University in the Middle East, CAMS 
offers to visitors and researchers a magnificent view on the Mediterranean, and 
refreshing walks in the beautiful campus of AUB. The Center is located in Beirut, 
a cosmopolitan city with five thousand years of history and culture, where one can 
enjoy many tourist attractions. 

CAMS was founded in 1999 through the efforts of an international group 
of scientists with the primary goal of becoming the premier center of excellence 
for research in the mathematical sciences in the Middle East. The purpose of the 
Center, according to its official mission statement, is to promote research and grad- 
uate studies in mathematics and to serve as a focal point for collaboration among 
mathematicians and scientists in Lebanon and throughout the region. CAMS has 
a unique role to play as a leading regional institute for mathematical research. 
Its aims are to create opportunities for top-quality research and teaching, to en- 
courage academic collaboration and interdisciplinary research at AUB and in the 
region, to serve as a flagship institute within AUB academic plan to revitalize 
scholarship, and to attract exceptional faculty to the university. 

In February 2002, CAMS was named an associate of the Millennium Science 
Initiative (MSI) for the Middle East. The goal of MSI is to create and nurture 
world-class science and scientific talent in the developing world. 



CAMS Activities 

CAMS has a core faculty who hold joint appointments at the Center and in their 
respective departments at AUB. The faculty includes a director who manages the 
Center in consultation with the university Provost and an International Advisory 
Committee of distinguished scholars. In addition to its core faculty, the Center 
has 86 Associate Scholars (as of September 2002), who conduct research in related 
fields such as physics, mathematics, chemistry, civil engineering, electrical and 
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Figure 1: The first conference 



communications engineering, biomédical engineering, and business and manage- 
ment. CAMS Associates use the research and computing facilities of the Center, 
participate in its scientific programs, and shape research directions to be under- 
taken at CAMS. In the course of their research work, they interact with AUB 
faculty and students, and typically give seminars and workshops. CAMS also of- 
fers good library facilities with access to many electronic journals and data basis, 
and a good Visiting Scholars program. Prom 1999 to 2002, the Center received 
around 50 Visiting Scholars from North America, Europe, and the Middle East, 
who conducted research and offered seminars, lectures, and short courses. CAMS 
hosts a regular series of seminars, conferences and lectures, and invites speakers 
in various disciplines, from a number of universities, to talk about their field of 
specialty. More than 100 seminars were organized from 1999 to 2002, covering a 
wide kaleidoscope of current scientific topics. 

CAMS is equipped with the state-of-the-art technology to best accommodate 
associate and guest researchers. High-speed Internet connectivity, excellent access 
to electronic sources of scientific literature and a fully integrated seminar room 
are available, as well as high performance clusters for scientists looking to perform 
demanding computations. Recently, CAMS has installed a 16 processor IBM p630 
Cluster, procured by utilizing a grant from the American Schools and Hospitals 
Abroad (ASHA). This High Performance Computing platform, named IbnSina, 
will consolidate the role of CAMS in becoming a regional center for scientific 
computing. The Cluster comes as a significant addition to the already existing 
commodity-built clusters: a 4 node Intel PHI Beowulf cluster and a 4 node Digital 
Alpha cluster. With an annual budget of $200,000, the range of activities of CAMS 
remains modest. However, the high quality of the research and training activity 
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Figure 2: The cluster 



at the Center, as well as the installation of a computer system that is the most 
powerful in the region, bodes well for the future. 

Challenges faced by CAMS 

CAMS suffers from the bad reputation of the 15 years of civil war (which ended in 
1991), which earned Lebanon an image of violence. Many scientists who have been 
invited to visit the Center have been reluctant to come as a result. Although the 
country is now very stable and many tourists are flowing in, the negative picture 
persists. Moreover, the Middle East is a region of conflict. Events that occur in 
neighboring countries give the impression that they affect Lebanon, thus dissuading 
many scientists from visiting the Center. To become a genuinely regional center, 
CAMS must aim to triple the funding available to it. More fund raising is needed 
especially from Arab countries. CAMS also needs stronger support from Europe. 
The Center is also affected by the relative lack of overlap of research interests 
between different researchers in Lebanon. This has hindered collaborative efforts 
between scientists, as the country does not have a critical mass of researchers in 
any one field of specialization. 

The problems mentioned above can be solved through the following steps: 
establish more links with USA and European funding agencies to support visits, 
conferences, workshops and summer schools; add more contacts of scientists inter- 
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ested in helping science in developing countries; encourage more visits to CAMS 
(all who visited were pleasantly surprised); start an effective Ph.D. program to 
attract good students from Lebanon and the region; encourage regional collabora- 
tions by supporting extended visits from scientists in the region and experts from 
the West; and unite the efforts of libraries by having combined subscriptions to 
electronic journals. 

Conclusion 

CAMS has now established itself as a premier center of excellence for research in 
the mathematical sciences in the Middle East. CAMS needs however a vigorous 
effort in fund raising to increase its size while maintaining its qualities. 

For more information, check out the website of the Center: 
http: / /www.cams.aub.edu.lb 



Acknowledgments 

I would like to thank Professors Daniel lagolnitzer, Jean Zinn-Justin and Vincent 
Rivasseau for giving me the opportunity to present CAMS at the special session 
on “Physics in Developing Countries” during the conference TH-2002. 1 would also 
like to thank Professor Kamal Khuri-Makdisi for help in editing of this article. 



Appendix A: CAMS Faculty 

1. Ali Chamseddine, Director, Ph.D. 1976, Imperial College, London. Research 
area: supersymmetry, string theory and noncommutative geometry. 

2. Kamal Khuri-Makdisi, Ph.D. 1993, Princeton University. Research area: 
Number theory and Automorphic forms. 

3. Wafic Sabra, Ph.D. 1991, University of London. Research area: String theory, 
quantum gravity. 

4. Jihad Tourna, Ph.D. 1993, Massachusetts Institute of Technology. Research 
area: Non-linear dynamics and Chaos. 

Appendix B: International Advisory Committee 

The CAMS International Advisory Committee (I AC) is an international group of 
highly distinguished mathematicians and physicists, who evaluate the academic 
activities of CAMS annually, and recommend the appointment of its faculty and 
director. The I AC members are highly experienced scientific leaders who have 
been involved in the administration of some of the world’s top institutions in 
mathematics and physics. They are: 
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1. Sir Michael Atiyah, lAC Chairman: one of the world’s leading mathemati- 
cians (Fields Medal, 1966); former President, Royal Society, UK; Honorary 
Fellow, Mathematics Department, University of Edinburgh. 

2. Dr. Luis Alvarez-Gaume, Senior Scientist, European Organization for Nu- 
clear Research (CERN), Geneva. 

3. Dr. Jean Pierre Bourguignon, Director, Institut des Hautes Etudes Scien- 
tifiques, Bures-sur- Yvette, France. 

4. Dr. Jürg Frohlich, Professor, Institute for Theoretical Physics, Swiss Federal 
Institute of Technology (ETH), Zurich. 

5. Dr. Roman W. Jackiw, Jerrold Zacharias Professor of Physics, Massachusetts 
Institute of Technology. 

6. Dr. Nicola N. Khuri, Professor and Head, Laboratory of Theoretical Physics, 
Rockefeller University, New York. 

7. Sir James Mirrlees, 1996 Nobel Prize Winner in Economics; Fellow in Eco- 
nomics, Trinity College, Cambridge University. 

8. Dr. Edoardo Vesentini, President, Accademia Nazionale dei Lincei, Rome, 
Italy; Professor of Mathematics, Politecnico di Torino. 

Appendix C: Major Conferences 

1. Symposium on Computational Science, May 13-15, 1998. 

2. Flow, Friction, and Fracture, July 1-7, 1998. 

3. The Mathematical Sciences after the Year 2000: A Prospective View, January 
11-15, 1999. 

4. First Beit-Mery Workshop on Mathematical Sciences: Geometry and Physics, 
January 11-15, 2000. 

5. Workshop on String Theory and Noncommutative Geometry, May 31-June 
5, 2000. 

6. Workshop on Finite Element Methods, December 16-20, 2000. 

7. Summer School and Workshop: Dirac Operators, Yesterday and Today, Au- 
gust 27-September 7, 2001. 

8. Summer School on Parallel, Distributed, Mobile and Internet Computing, 
July 8-19, 2002. 



9. First Workshop on Dynamics and its Applications, October 21-25, 2002. 
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10. International Conference on the Discrete Simulation of Fluid Dynamics, 
planned for August 2003. 

11. Many more are planned. 

Appendix D: Institutional grants to CAMS 

1. Lounsbery Foundation, New York. Major grant, 1999-2001. 

2. Schlumberger Corporation. Major grant, 2002-2003. 

3. ASHA (American Schools and Hospitals Abroad). 

4. International Center for Theoretical Physics (ICTP), Trieste, Italy. 

5. Geraldine Dodge Foundation, New Jersey. 

6. UNESCO, Cairo office 

7. Clay Mathematics Institute, Cambridge, Massachusetts. 

8. Banque Audi, Beirut, Lebanon. 

CAMS has also benefited from generous support by private donors. 

Ali H. Chamseddine 
CAMS, AUB 

The Center for Advanced Mathematical Sciences (CAMS) 
at the American University of Beirut (AUB) 
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Abstract. I briefly review the African Institute for Mathematical Sciences project, 
a new educational centre now being established in Cape Town, South Africa with 
the goal of strengthening scientific and technological capacity across the African 
continent. 



1 What is AIMS? 

The African Institute for Mathematical Sciences (AIMS) is an exciting new initia- 
tive involving African and international Universities, to establish a new centre for 
postgraduate teaching and research in Cape Town, South Africa. It will initially 
focus on a unique, intensive nine- month postgraduate course developing research 
skills as well as providing an overview of many of the most exciting areas of math- 
ematical science. 

The course will lead to a Postgraduate Diploma in the Mathematical Sci- 
ences, formally accredited by the three partner South African Universities and 
endorsed by the Faculty of Mathematics of the University of Cambridge. Recently, 
the Universities of Orsay and Oxford have joined the project as partners. The 
AIMS institute will be formally opened in September 2003. 

The AIMS project has received enthusiastic support from the international 
academic community. Outstanding African and international scientists and educa- 
tors have volunteered to teach at AIMS, ensuring a course of the highest interna- 
tional quality. 30 students have been recruited for the first year of the course from 
15 African countries, from Algeria to Zimbabwe. Our intention is for the AIMS 
course to serve as a stepping stone to careers in research, education, industry or 
government. The pan- African character of the student body and the international 
mix of lecturers will offer unique opportunities for building friendships across cul- 
tural and national boundaries, contributing positively to bridging the North-South 
divide. AIMS graduates will remain in contact long after they have completed the 
course, providing a powerful network working together for African progress. 

AIMS will be located in Muizenberg, Cape Town, in a beautiful building 
(formerly an 80- room art-deco style hotel) which has been donated to the project. 
Students and visiting lecturers will be housed within the building, allowing for 
maximal interaction in an informal and collegiate setting. AIMS will also house a 
computer laboratory, a library and excellent teaching facilities. 
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The Institute will be located in the twin white buildings at the left of this picture 

Muizenberg is an area of outstanding natural beauty within a half-hour drive 
of each of the local Universities and Cape Town international airport. There is a 
direct train service to the centre of Cape Town. Several important museums, a 
theater and tertiary educational colleges further enhance the setting.The beach on 
False Bay, 50 yards from AIMS, is one of the finest in the Cape. 

2 The AIMS Course 

The AIMS curriculum will provide both a strong foundation in mathematical and 
computing research skills, and an overview of cutting edge developments in the 
mathematical sciences. 

Skills courses will include: 

• The Art of Scientific Approximation 

• Mathematical Problem Solving 

• Probability and Statistics 

• Mathematical Modeling 

• Methods of Mathematical Physics 
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Review courses will when possible be given in tandem by overseas lectur- 
ers and local academics. They will cover some of the most exciting areas of sci- 
ence today, with mathematics as a unifying thread. Topics include Fluid mechan- 
ics, Quantum physics. Epidemiology, Astronomy and astrophysics. Fundamental 
physics, Modeling of biological systems. Complex systems, Numerical analysis. Ap- 
plications of complex variables, Geometry and topology, Financial mathematics. 

The teaching style will be highly interactive and responsive to the students’ 
needs. As a rule, students will study only one subject at a time for two or three 
weeks, with morning lectures and related afternoon problem solving and computing 
sessions. Additional lectures may be held in the evenings. 

The goal will be to develop well-rounded scientists, with creative problem- 
solving skills and a strong grounding in generally applicable mathematical and 
computational methods. A course of such breadth is unique worldwide, offering 
students an exposure to a range of sciences and allowing them to make an informed 
choice as to their future scientific speciality. 

3 AIMS Structure and Funding 

The AIMS Board of Directors includes representatives of the partner Universities: 

- Prof. Jan van Bever Donker, University of the Western Cape, 

- Prof. Fritz Hahne, University of Stellenbosch, 

- Prof. Keith Moffatt, University of Cambridge, 

- Prof. Daya Reddy, University of Cape Town, 

- Prof. Graham Richards, University of Oxford, 

- Prof. Vincent Rivasseau, University of Orsay, 

- Prof. Neil Turok, University of Cambridge. 

There is also an Advisory Board of African academics chaired by Prof. Hans 
Eggers, and we are developing a list of AIMS representatives across Africa. 

AIMS is made possible by funding from: 

- The Gatsby Charitable Foundation, UK. 

- The Vodafone Group Foundation, UK. 

- The Vodacom Foundation, South Africa. 

- Department of Arts, Culture, Science and Technology, South Africa. 

- Cambridge University Press. 

- The David and Elaine Potter Charitable Foundation 

- The Daniel lagolnitzer Foundation and the Foundation la Ferthé, under aegis 
of the Foundation de France 

- The University of Cambridge Local Examinations Syndicate 

- The Ford Foundation 

- Seardel Investment Corporation 

- The University of Stellenbosch 
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Over US$1.5 million has been secured for the project so far, of a total of $2.0 
million required for the first three years of operation. Further details including 
a list of lecturer volunteers, student application forms, details of the first years 
course and other materials may be found on the AIMS web-site: 
http: / / www.aimsforafrica.org 

4 What Supporters say. . . 

John Samuel, Chief Executive, The Nelson Mandela Foundation: 

“We have been informed of the proposed African Institute for Mathematical 
Sciences (AIMS) and find it one of the most important projects presented to us. 
Mr Mandela has repeatedly declared his concern about the advancement of science 
teaching and research in South Africa. This initiative is an outstanding instance 
of concrete collaboration between leading scientists in the advanced countries and 
our own scientists. This Foundation enthusiastically supports AIMS in the name 
of the former president Mandela.” 

Mosibudi Mangena, Deputy Minister of Education, South Africa: 

“Our country and continent badly need a broad, concerted effort to im- 
prove the quality, participation and performance of South African and African 
researchers and learners in the mathematical sciences. The African Institute for 
Mathematical Sciences is destined to become a flagship institution, giving concrete 
expression to this need and ensuring that Africa’s contribution to the mathematical 
sciences receives the recognition it deserves. The in-service training of mathematics 
teachers is indeed a top priority of the Government’s educational transformation 
plan, Tirisano - hence my unequivocal support for the establishment of this Insti- 
tute.” 

Professor Chris Brink, Vice-Chancellor, University of Stellenbosch, SA: 

“The development of the mathematical sciences is a visionary and cost- 
effective investment in Africa’s future. All credit to the AIMS team.” 

Professor Sir Alec Broers, Vice Chancellor, University of Cambridge, UK: 

“Africa’s problems are awesome and their solution must be multifactoral. But 
there can be no doubt that training in the sciences and technology, based on the 
impressive foundations already existing in South Africa, must be one of the keys 
to unlocking the continent’s potential and empowering its citizens. This project is 
an impressive and well-thought-out approach to a new era for African science and 
mathematics.” 

Professor George Ellis, former President of the Royal Society of South Africa: 

“The economic development of Africa this century is inevitably tied in to its 
technological development, which in turn is crucially dependent on high quality 
technical and scientific and technical education. The AIMS project can play an 
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exciting part in the much needed development of South African and African scien- 
tific and technical capacity, vitally needed to support the bold ideas of an African 
Renaissance and NEPAD.” 

Professor Sir Michael Berry, Physics Department, University of Bristol, UK: 

“A brave and exciting project, that will expose some of the brightest people 
in Africa to scientific culture at the highest level and thereby help them acquire 
the tools to develop desperately-needed fundamental new technologies” 

Professor John Ellis, former head. Theory Division, CERN, Geneva, Switzerland: 
“Among the keys to Africa’s social and economic progress are the devel- 
opment of its human resources and its participation in advances in science and 
technology. The proposed project addresses both these priorities in a well-focused 
way that uses and builds on the local resources in the Western Cape, while being 
open to other parts of Africa and the rest of the world.” 

Professor David Gross, Director, Institute for Theoretical Physics, Santa Barbara, 
USA: 

“The African Institute for the Mathematical Sciences, by bringing students 
in contact with excellent local and foreign scientists in a program that covers 
the many of the most exciting areas of modern science, offers an innovative and 
creative approach to developing scientific talent in Africa. The potential benefits 
for scientific and technological development in Africa are enormous.” 

Professor Sir Martin Rees, Astronomer Royal, Institute of Astronomy, Cambridge, 
UK: 

“This venture, initiated by an outstanding group of committed researchers 
and teachers, could be extraordinarily cost-effective in fostering youthful talent 
and scientific excellence South Africa and beyond.” 

Professor Stephen Hawking, Cambridge, UK: 

“This institute will bring Africa to the cutting edge of science.” 

Professor Paul Steinhardt, Princeton University, USA: 

“By offering a unique, interdisciplinary introduction to topics at the fore- 
fronts of science, the African Institute for the Mathematical Sciences will bring 
together students and faculty with diverse interests and backgrounds, creating in- 
tellectual kinships that will have a long-lasting impact and perhaps will inspire 
new breakthroughs. I have no doubt that the young students sharing knowledge 
across intellectual and international boundaries will make a profound contribution 
to freedom, education and prosperity in Africa.” 

Sir John Sulston, former Director, Sanger Institute and winner of the 2002 Nobel 
Prize for Medicine: 

“The broad dissemination of knowledge, understanding and expertise is a 
precondition for reducing global inequity. Disease and famine of the body call for 
action so powerfully that it is easy to forget the equal need to alleviate famine of 
the mind, without which the future holds no promise.” 
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5 Future 

AIMS will encourage its alumni to remain in contact, in order to collaboratively 
employ their skills, knowledge and contacts to advance education and development 
across the continent. AIMS will develop a network of partner institutes across 
Africa, encouraging African researchers to teach at AIMS and recruit students onto 
their research projects. AIMS will strongly encourage outstanding international 
scientists lecturing at AIMS to visit other African educational and research centres. 

AIMS is seeking funding to launch a Schools Enrichment Center (AIMS-SEC) 
which will be located in the Muizenberg facility. This project is being headed by 
Toni Beardon, founder of the NRICH project for web-based mathematics. The 
goal is to support schoolteachers by providing web-based and paper materials to 
assist mathematics and science learning. There will also be in-service courses for 
teachers held during the months when the Diploma course is in recess (June, July 
and August). 

AIMS will host workshops and long term visitor programs encouraging re- 
search collaboration between African and international scientists. A model for 
AIMS’s long-term future is the International Center for Theoretical Physics, in 
Trieste, Italy. 

Neil Turok 

Centre for Mathematical Sciences, 

University of Cambridge, 

Wilber force Road, 

Cambridge CB4 OWA 
UK 

Email: N.G.Turok@damtp.cam.ac.uk 
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